How To Use Spark Dynamic Resource Allocation (DRA) in Kyuubi The Basics of Dynamic Resource Allocation How to Enable Dynamic Resource Allocation Dynamic Resource Allocation w/ Ext...
System requirements Download the distribution Source code compilation Configuration Configure the service address Configure system database Configure high availability Config...
Upgrading from 1.10 or 2.0 to 2.1 Create ZooKeeper snapshot (optional - but recommended) Rename master Properties, Config Files, and Script References Pre-Upgrade the property st...
Deploying Hudi Streamer Spark Datasource Writer Jobs Upgrading Downgrading Migrating This section provides all the help you need to deploy and operate Hudi tables at scale. ...
The Basics of AQE Dynamically Switch Join Strategies Dynamically Coalesce Shuffle Partitions Other Tips for Best Practises How to set spark.sql.adaptive.advisoryPartitionSizeInByt...
Pre-Splitting New Tables Multiple Ingest Clients Bulk Ingest Logical Time for Bulk Ingest MapReduce Ingest Accumulo is often used as part of a larger data processing and stor...
Support Those Engines Key Features Description Supported DataSource Info Source Options Task Example Simple Regex Topic AWS MSK SASL/SCRAM AWS MSK IAM Kerberos Authenticat...
Indexing Multi-modal Indexing Index Types in Hudi Global and Non-Global Indexes Configs Spark based configs Flink based configs Indexing Strategies Workload 1: Late arriving...
Description Using Dependency For Spark/Flink Engine For SeaTunnel Zeta Engine Key Features Options driver [string] user [string] password [string] url [string] query [stri...