Documentation Apache Iceberg is an open table format for huge analytic datasets. Iceberg adds tables to compute engines including Spark, Trino, PrestoDB, Flink, Hive and Impala u...
Kyuubi provides several auxiliary SQL functions as supplement to Spark’s Built-in Functions Name Description Return Type Since kyuubi_version Return the version of K...
Spark Structured Streaming Iceberg uses Apache Spark’s DataSourceV2 API for data source and catalog implementations. Spark DSv2 is an evolving API with different levels of support...
Branching and Tagging Overview Iceberg table metadata maintains a snapshot log, which represents the changes applied to a table. Snapshots are fundamental in Iceberg as they are ...
Flink Queries Iceberg support streaming and batch read With Apache Flink ‘s DataStream API and Table API. Reading with SQL Iceberg support both streaming and batch read in Flink...
Disclaimer Supported Storage System Verified Combination of Spark and storage system HDInsight Spark2.4 on Azure Data Lake Storage Gen 2 Databricks Spark2.4 on Azure Data Lake S...
Support Those Engines Key features Description Sink Options Task Example Changelog 2.2.0-beta 2022-09-26 Socket sink connector Support Those Engines Spark Flink SeaTu...