Information about your use of this website is collected using server access logs and a tracking cookie. The collected information consists of the following: The IP address from ...
Hudi Integration Dependencies Configurations Hudi Operations Apache Hudi (pronounced “hoodie”) is the next generation streaming data lake platform. Apache Hudi brings core war...
Trino just like Presto allows you to query table formats like Hudi, Delta and Iceberg tables using connectors. Users do not need additional configurations to work with OneTable syn...
Spark DataSource API Daft Spark DataSource API The hudi-spark module offers the DataSource API to read a Hudi table into a Spark DataFrame. A time-travel query example: val ...
Hudi and Iceberg tables Delta Lake table To read a OneTable synced target table (regardless of the table format) in Amazon Redshift, users have to create an external schema and ...
Presto allows you to query table formats like Hudi, Delta and Iceberg using connectors. The same setup will work for OneTable synced tables as well. For more information and requ...
Spark Streaming Spark Streaming Structured Streaming reads are based on Hudi’s Incremental Query feature, therefore streaming read can return data for which commits and base fil...
Creating your first interoperable table Using Apache XTable™ (Incubating) to sync your source tables in different target format involves running sync on your current dataset usi...
Disclaimer Supported Storage System Verified Combination of Spark and storage system HDInsight Spark2.4 on Azure Data Lake Storage Gen 2 Databricks Spark2.4 on Azure Data Lake S...
Syncing to Hive Metastore This document walks through the steps to register an Apache XTable™ (Incubating) synced table on Hive Metastore (HMS). Pre-requisites Source table(s) ...