Overview Use Cases Historical Tags Audit Branch Usage Overview Iceberg table metadata maintains a snapshot log, which represents the changes applied to a table. Snapshots ar...
Pre-requisites Steps Create BigLake Catalog Create BigLake Database Running sync Validating the results Conclusion This document walks through the steps to register a OneTa...
Key Generators SimpleKeyGenerator ComplexKeyGenerator NonpartitionedKeyGenerator CustomKeyGenerator Bring your own implementation TimestampBasedKeyGenerator Timestamp is GMT ...
Requirements Usage DB-API Use PyHive with Pandas Authentication PyHive is a collection of Python DB-API and SQLAlchemy interfaces for Hive. PyHive can connect with the Kyuub...
Introduction Supported table format Supported column data type How to use Optimize history data Syntax Examples Optimize incremental data To improve query speed, Kyuubi su...
Apache Paimon (Incubating) Integration Dependencies Configurations Apache Paimon (Incubating) Operations Apache Paimon(incubating) is a streaming data lake platform that suppo...
Support Hive Version Support Those Engines Key Features Description Supported DataSource Info Database Dependency Data Type Mapping Source Options Tips Task Example Simple...
Support Those Engines Description Using Dependency For Spark/Flink Engine For SeaTunnel Zeta Engine Key Features Supported DataSource Info Data Type Mapping Sink Options Ti...
Catalog Configuration Runtime configuration Read options Write options Catalog Configuration A catalog is created and named by executing the following query (replace <catalog...
Table freshness Table watermark Table freshness Data freshness represents timeliness, and in many discussions, freshness is considered one of the important indicators of data q...