Spark SQL Insert Into Insert Overwrite Update Merge Into Delete From Data Skipping and Indexing Flink SQL Insert Into Update Delete From Setting Writer/Reader Configs F...
Spark DataSource API The hudi-spark module offers the DataSource API to write a Spark DataFrame into a Hudi table. There are a number of options available: HoodieWriteConfig :...
Approaches Use Hudi for new partitions alone Convert existing table to Hudi Using Hudi Streamer Using Spark Datasource Writer Using Spark SQL CALL Procedure Using Hudi CLI C...
Key Generators SimpleKeyGenerator ComplexKeyGenerator NonpartitionedKeyGenerator CustomKeyGenerator Bring your own implementation TimestampBasedKeyGenerator Timestamp is GMT ...
Syncing to Unity Catalog Pre-requisites Steps Running sync Register the target table in Unity Catalog Validating the results Conclusion Syncing to Unity Catalog This docum...
Connector V2 Health SeaTunnel uses a grading system for connectors to help you understand what to expect from a connector: Alpha Beta General Availability (GA) Expec...
Configurations Example DataHub is a rich metadata platform that supports features like data discovery, data obeservability, federated governance, etc. Since Hudi 0.11.0, you c...