Introduction Example Lineage specific identification SQL type support Query Command Building Build with Apache Maven Build against Different Apache Spark Versions Test with...
What is Apache Hudi Core Concepts to Learn Getting Started Connect With The Community Join in on discussions Come to Office Hours for help Community Calls Contribute Welco...
Catalogs Catalog configuration Using catalogs Replacing the session catalog Using catalog specific Hadoop configuration values Loading a custom catalog SQL Extensions Runtim...
User experience Reliability and performance Open standard Apache Iceberg is an open table format for huge analytic datasets. Iceberg adds tables to compute engines including S...
Introduce multi-catalog How to use Future work Introduce multi-catalog A catalog is a metadata namespace that stores information about databases, tables, views, indexes, users...
Operation Types UPSERT INSERT BULK_INSERT DELETE BOOTSTRAP INSERT_OVERWRITE INSERT_OVERWRITE_TABLE DELETE_PARTITION Configs Writing path Related Resources It may be he...
Spark Tuning Guide Writing General Tips Spark failures Hudi consumes too much space in a temp folder while upsert How to tune shuffle parallelism of Hudi jobs ? GC Tuning ...
Syncing to Unity Catalog Pre-requisites Steps Running sync Register the target table in Unity Catalog Validating the results Conclusion Syncing to Unity Catalog This docum...