Iceberg Java API Tables The main purpose of the Iceberg API is to manage table metadata, like schema, partition spec, metadata, and data files that store table data. Table metad...
Introduction Hadoop and S3 The s3a File System The s3 File System Getting Gobblin to Publish to S3 Signing Up For AWS Setting Up EC2 Launching an EC2 Instance EC2 Package I...
Metrics This feature is available only in Redisson PRO Monitoring systems integration Redisson provides integration the most popular monitoring systems through Micrometer fram...
New in version 1.4.0. Kyuubi provides an frontend service that enables the connectivity and accessibility from MySQL connectors. The document you are visiting now is incompl...
Flink Queries Iceberg support streaming and batch read With Apache Flink ‘s DataStream API and Table API. Reading with SQL Iceberg support both streaming and batch read in Flink...
Spark Configuration Catalogs Spark adds an API to plug in table catalogs that are used to load, create, and manage Iceberg tables. Spark catalogs are configured by setting Spark ...
Object holder Java implementation of Redis or Valkey based RBucket object is a holder for any type of object. Size is limited to 512Mb. Code example: RBucket < AnyObject > buc...
Introduction Dataset Config Management Requirement Data Model Versioning Client library Config Store Current Dataset Config Management Implementation Data model Client appli...
Gobblin General Questions What is Gobblin? What programming languages does Gobblin support? Does Gobblin require any external software to be installed? What Hadoop versions can ...