Syncing to BigLake Metastore This document walks through the steps to register an Apache XTable™ (Incubating) synced Iceberg table in BigLake Metastore on GCP. Pre-requisites S...
Spark DDL To use Iceberg in Spark, first configure Spark catalogs . Iceberg uses Apache Spark’s DataSourceV2 API for data source and catalog implementations. CREATE TABLE Spark...
Introduction Hive on Spark Differences Between Kyuubi and HiveServer2 Performance References Introduction HiveServer2 is a service that enables clients to execute Hive QL qu...
Kyuubi provides several auxiliary SQL functions as supplement to Spark’s Built-in Functions Name Description Return Type Since kyuubi_version Return the version of K...
Getting Started The latest version of Iceberg is 1.8.1 . Spark is currently the most feature-rich compute engine for Iceberg operations. We recommend you to get started with Spar...
Spark Streaming Spark Streaming Structured Streaming reads are based on Hudi’s Incremental Query feature, therefore streaming read can return data for which commits and base fil...
Disclaimer Supported Storage System Verified Combination of Spark and storage system HDInsight Spark2.4 on Azure Data Lake Storage Gen 2 Databricks Spark2.4 on Azure Data Lake S...
Debugging Server Debugging Engine Flink Engine Trino Engine Hive Engine Debugging Apps Spark Engine Flink Engine You can use the Java Debug Wire Protocol to debug Kyuubi ...
Support Those Engines Key features Description Sink Options Task Example Changelog 2.2.0-beta 2022-09-26 Socket sink connector Support Those Engines Spark Flink SeaTu...