Sync Modes Manifest File Benefits of using the new manifest approach: View Over Files (Legacy) Configurations Partition Handling Example Hudi tables can be queried from Goo...
Approaches Use Hudi for new partitions alone Convert existing table to Hudi Using Hudi Streamer Using Spark Datasource Writer Using Spark SQL CALL Procedure Using Hudi CLI C...
Requirements Default Behavior Related Configurations Remote Metastore Database Remote Metastore Server Activate Configurations Via kyuubi-defaults.conf Via hive-site.xml Via...
Spark Tuning Guide Writing General Tips Spark failures Hudi consumes too much space in a temp folder while upsert How to tune shuffle parallelism of Hudi jobs ? GC Tuning ...
Introduce multi-catalog How to use Future work Introduce multi-catalog A catalog is a metadata namespace that stores information about databases, tables, views, indexes, users...
Connector V2 Health SeaTunnel uses a grading system for connectors to help you understand what to expect from a connector: Alpha Beta General Availability (GA) Expec...
Spark DataSource API The hudi-spark module offers the DataSource API to write a Spark DataFrame into a Hudi table. There are a number of options available: HoodieWriteConfig :...