Architecture Supported table formats Supported engines Iceberg format Paimon format Mixed format User cases Self-managed streaming Lakehouse Stream-and-batch-fused data pipe...
System requirements Download the distribution Source code compilation Configuration Configure the service address Configure system database Configure high availability Config...
Ingest into one table Iceberg format Mixed-Iceberg format Ingest Into multiple tables Iceberg format Mixed-Iceberg format CDC stands for Change Data Capture, which is a broa...
Compared with Iceberg format, Mixed-Iceberg format provides more features: Stronger primary key constraints that also apply to Spark OLAP performance that is production-ready fo...
Introduction Self-optimizing mechanism Self-optimizing scheduling policy Quota Balanced Introduction Lakehouse is characterized by its openness and loose coupling, with data...
Iceberg format Paimon format Mixed format Environment preparation Mixed-Hive format Frequently Asked Questions Iceberg format The Iceberg Format can be accessed using the C...
Introduce multi-catalog How to use Future work Introduce multi-catalog A catalog is a metadata namespace that stores information about databases, tables, views, indexes, users...
Table format (aka. format) was first proposed by Iceberg, which can be described as follows: It defines the relationship between tables and files, and any engine can query and r...
Optimizer container Local container Flink container External container Optimizer group Add optimizer group Edit optimizer group Remove optimizer group Optimizer Scale-out a...