Architecture Supported table formats Supported engines Iceberg format Paimon format Mixed format User cases Self-managed streaming Lakehouse Stream-and-batch-fused data pipe...
Introduction Self-optimizing mechanism Self-optimizing scheduling policy Quota Balanced Introduction Lakehouse is characterized by its openness and loose coupling, with data...
Table freshness Table watermark Table freshness Data freshness represents timeliness, and in many discussions, freshness is considered one of the important indicators of data q...
A Streaming Data Lake Near Real-Time Ingestion Incremental Processing Pipelines Unified Batch and Streaming Cloud-Native Tables Schema Management ACID Transactions Efficient ...
Metadata Table Supporting Multi-Modal Index in Hudi Metadata table indices Enable Hudi Metadata Table and Multi-Modal Index in write side Use metadata indices for query side im...
How does Hudi ensure atomicity? Does Hudi extend the Hive table layout? What concurrency control approaches does Hudi adopt? Hudi’s commits are based on transaction start time i...