Using Iceberg in Spark 3 Adding catalogs Creating a table Writing Reading Next steps The latest version of Iceberg is 1.5.2 . Spark is currently the most feature-rich compu...
Introduce multi-catalog How to use Future work Introduce multi-catalog A catalog is a metadata namespace that stores information about databases, tables, views, indexes, users...
Support Iceberg Version Support Those Engines Key features Description Supported DataSource Info Database Dependency Data Type Mapping Source Options Task Example Simple: ...
Syncing to Hive Metastore Pre-requisites Steps Running sync Register the target table in Hive Metastore Conclusion Syncing to Hive Metastore This document walks through the...
Spark Tuning Guide Writing General Tips Spark failures Hudi consumes too much space in a temp folder while upsert How to tune shuffle parallelism of Hudi jobs ? GC Tuning ...
Table format (aka. format) was first proposed by Iceberg, which can be described as follows: It defines the relationship between tables and files, and any engine can query and r...
HA Architecture Key Benefits System-side Deployment Pseudo mode Production mode Client-side Usage How to Hot Upgrade Kyuubi Server As an enterprise-class ad-hoc SQL query s...