Scan planning Metadata filtering Data filtering Iceberg is designed for huge tables and is used in production where a single table can contain tens of petabytes of data. Even ...
Auxiliary SQL Functions for Spark SQL Auxiliary SQL Functions for Spark SQL Kyuubi provides several auxiliary SQL functions as supplement to Spark’s Built-in Functions ...
Running Tests Running Tests Fully Running Tests for a Module Running Tests for a Single Test Running Tests Kyuubi can be tested based on Apache Maven and the ScalaTe...
Flink Writes Iceberg support batch and streaming writes With Apache Flink ‘s DataStream API and Table API. Writing with SQL Iceberg support both INSERT INTO and INSERT OVERWRIT...
Kyuubi v.s. HiveServer2 Introduction Hive on Spark Differences Between Kyuubi and HiveServer2 Performance References Kyuubi v.s. HiveServer2 Introduction HiveServer2 is a ...
Creating your first interoperable table Using Apache XTable™ (Incubating) to sync your source tables in different target format involves running sync on your current dataset usi...
Spark Procedures To use Iceberg in Spark, first configure Spark catalogs . Stored procedures are only available when using Iceberg SQL extensions in Spark 3. Usage Procedures c...