Syncing to BigLake Metastore This document walks through the steps to register an Apache XTable™ (Incubating) synced Iceberg table in BigLake Metastore on GCP. Pre-requisites S...
Disclaimer Supported Storage System Verified Combination of Spark and storage system HDInsight Spark2.4 on Azure Data Lake Storage Gen 2 Databricks Spark2.4 on Azure Data Lake S...
Syncing to Unity Catalog This document walks through the steps to register an Apache XTable™ (Incubating) synced Delta table in Unity Catalog on Databricks and open-source Unity C...
To read a OneTable synced target table (regardless of the table format) in Amazon Athena, you can create the table either by: Using a DDL statement as mentioned in the following...
Syncing to Glue Data Catalog This document walks through the steps to register an Apache XTable™ (Incubating) synced table in Glue Data Catalog on AWS. Pre-requisites Source ta...
Configs The following describes the general file layout structure for Apache Hudi. Please refer the tech spec for a more detailed description of the file layouts. Hudi orga...
Features and Limitations Features Apache XTable™ (Incubating) provides users with the ability to translate metadata from one table format to another. Apache XTable™ (Incubatin...
Aliyun OSS configs Aliyun OSS Credentials Aliyun OSS Libs In this page, we explain how to get your Hudi spark job to store into Aliyun OSS. Aliyun OSS configs There are two c...
Querying from Apache Spark To read an Apache XTable™ (Incubating) synced target table (regardless of the table format) in Apache Spark locally or on services like Amazon EMR, Goog...
Spark Streaming Spark Streaming You can write Hudi tables using spark’s structured streaming. Scala // spark-shell // prepare to stream write to new table import org ....