Pre-Splitting New Tables Multiple Ingest Clients Bulk Ingest Logical Time for Bulk Ingest MapReduce Ingest Accumulo is often used as part of a larger data processing and stor...
How To Use Spark Dynamic Resource Allocation (DRA) in Kyuubi The Basics of Dynamic Resource Allocation How to Enable Dynamic Resource Allocation Dynamic Resource Allocation w/ Ext...
Indexing Multi-modal Indexing Index Types in Hudi Global and Non-Global Indexes Configs Spark based configs Flink based configs Indexing Strategies Workload 1: Late arriving...
Configuration Encrypting All Tables Per Table Encryption Disabling Crypto Custom Crypto Things to keep in mind Utilities need access to encryption properties Some data will b...
Create a table Using a Hive catalog Using a Hadoop catalog Branching and Tagging Creating branches and tags Committing to branches Reading from branches and tags Replacing an...
Key Image Tags and Examples Caching About database drivers On supporting arm64 AND amd64 Working with Apple silicon The Apache Superset community extensively uses Docker for ...
Upgrading from 1.10 or 2.0 to 2.1 Create ZooKeeper snapshot (optional - but recommended) Rename master Properties, Config Files, and Script References Pre-Upgrade the property st...
The Basics of AQE Dynamically Switch Join Strategies Dynamically Coalesce Shuffle Partitions Other Tips for Best Practises How to set spark.sql.adaptive.advisoryPartitionSizeInByt...
Trouble Shooting Common Issues java.lang.UnsupportedClassVersionError .. Unsupported major.minor version 52.0 org.apache.spark.SparkException: When running with master ‘yarn’ eith...