Spark SQL Insert Into Insert Overwrite Update Merge Into Delete From Data Skipping and Indexing Flink SQL Insert Into Update Delete From Setting Writer/Reader Configs F...
Configuring and using Scan Executors Configuring and using Scan Prioritizers. Providing hints from the client side. Accumulo scans operate by repeatedly fetching batches of dat...
Spark DataSource API The hudi-spark module offers the DataSource API to write a Spark DataFrame into a Hudi table. There are a number of options available: HoodieWriteConfig :...
Prerequisites Running the Proxy Server Proxy Client Examples Ruby Python Java The Accumulo Proxy allows the interaction with Accumulo with languages other than Java. A prox...
Approaches Use Hudi for new partitions alone Convert existing table to Hudi Using Hudi Streamer Using Spark Datasource Writer Using Spark SQL CALL Procedure Using Hudi CLI C...
Configuring Tracing Instrumenting a Client Tracing from the Shell It can be difficult to determine why some operations are taking longer than expected. For example, you may be ...