Introduce multi-catalog How to use Future work Introduce multi-catalog A catalog is a metadata namespace that stores information about databases, tables, views, indexes, users...
What is Apache Hudi Core Concepts to Learn Getting Started Connect With The Community Join in on discussions Come to Office Hours for help Community Calls Contribute Welco...
Table format (aka. format) was first proposed by Iceberg, which can be described as follows: It defines the relationship between tables and files, and any engine can query and r...
Support Doris Version Support Those Engines Key Features Description Sink Options schema_save_mode[Enum] data_save_mode[Enum] save_mode_create_template Data Type Mapping Su...
Spark Tuning Guide Writing General Tips Spark failures Hudi consumes too much space in a temp folder while upsert How to tune shuffle parallelism of Hudi jobs ? GC Tuning ...
Operation Types UPSERT INSERT BULK_INSERT DELETE BOOTSTRAP INSERT_OVERWRITE INSERT_OVERWRITE_TABLE DELETE_PARTITION Configs Writing path Related Resources It may be he...
Background Cleaning Retention Policies Configs Ways to trigger Cleaning Inline Async Run independently CLI Related Resources Background Cleaning is a table service emplo...