Indexing Multi-modal Indexing Index Types in Hudi Global and Non-Global Indexes Configs Spark based configs Flink based configs Indexing Strategies Workload 1: Late arriving...
Format Options How to use Ogg format Kafka uses example Oracle GoldenGate (a.k.a ogg) is a managed service providing a real-time data mesh platform, which uses replication to ...
Ingest into one table Iceberg format Mixed-Iceberg format Ingest Into multiple tables Iceberg format Mixed-Iceberg format CDC stands for Change Data Capture, which is a broa...
Compared with Iceberg format, Mixed-Iceberg format provides more features: Stronger primary key constraints that also apply to Spark OLAP performance that is production-ready fo...
Preparation when using Flink SQL Client Flink’s Python API Adding catalogs. Catalog Configuration Hive catalog Creating a table Writing Branch Writes Reading Type conversi...
Description Using Dependency For Spark/Flink Engine For SeaTunnel Zeta Engine Key Features Options driver [string] user [string] password [string] url [string] query [stri...
What does the Hudi cleaner do? How do I run compaction for a MOR table? What options do I have for asynchronous/offline compactions on MOR table? How to disable all table servic...
Introduction Self-optimizing mechanism Self-optimizing scheduling policy Quota Balanced Introduction Lakehouse is characterized by its openness and loose coupling, with data...