Evolution Iceberg supports in-place table evolution . You can evolve a table schema just like SQL — even in nested structures — or change partition layout when data volume chang...
Writing with SQL INSERT OVERWRITE INSERT INTO Upsert to table with primary keys. DELETE FROM UPDATE MERGE INTO Writing with DataFrames Appending data Overwriting data Crea...
Usage Manage kyuubi servers List server Create server Get server Delete server Manage kyuubi engines List engine Get engine Delete engine Usage bin / kyuubi - ctl -- h...
Daft Daft is a distributed query engine written in Python and Rust, two fast-growing ecosystems in the data engineering and machine learning industry. It exposes its flavor of t...
Running Tests Fully Running Tests for a Module Running Tests for a Single Test Kyuubi can be tested based on Apache Maven and the ScalaTest Maven Plugin, please refer to the ...
The Engine Configuration Guide The Engine Configuration Guide Kyuubi aims to bring Spark to end-users who need not qualify with Spark or something else related to the big data a...
Querying from Apache Spark To read an Apache XTable™ (Incubating) synced target table (regardless of the table format) in Apache Spark locally or on services like Amazon EMR, Goog...
Spark Queries To use Iceberg in Spark, first configure Spark catalogs . Iceberg uses Apache Spark’s DataSourceV2 API for data source and catalog implementations. Querying with S...
Querying from Google BigQuery Iceberg tables To read an Apache XTable™ (Incubating) synced Iceberg table from BigQuery , you have two options: Using Iceberg JSON metadata file ...
Aliyun OSS configs Aliyun OSS Credentials Aliyun OSS Libs In this page, we explain how to get your Hudi spark job to store into Aliyun OSS. Aliyun OSS configs There are two c...