Spark DataSource API Daft Spark DataSource API The hudi-spark module offers the DataSource API to read a Hudi table into a Spark DataFrame. A time-travel query example: val ...
Branching and Tagging Overview Iceberg table metadata maintains a snapshot log, which represents the changes applied to a table. Snapshots are fundamental in Iceberg as they are ...
Gobblin General Questions What is Gobblin? What programming languages does Gobblin support? Does Gobblin require any external software to be installed? What Hadoop versions can ...
Next Step More Information To import the custom VDF into Ambari, follow these steps: In the cluster install wizard, Select Version step, click the drop down with the HDP vers...
Java API Quickstart Create a table Tables are created using either a Catalog or an implementation of the Tables interface. Using a Hive catalog The Hive catalog connects to...