Configs The following describes the general file layout structure for Apache Hudi. Please refer the tech spec for a more detailed description of the file layouts. Hudi orga...
Syncing to Glue Data Catalog This document walks through the steps to register an Apache XTable™ (Incubating) synced table in Glue Data Catalog on AWS. Pre-requisites Source ta...
Gobblin Execution Modes Overview One important feature of Gobblin is that it can be run on different platforms. Currently, Gobblin can run in standalone mode (which runs on a sing...
and isEmpty not or cond ifElse allPass anyPass defaultTo both complement either propSatisfies unless when pathSatisfies until xor isNotEmpty and a → b → a | b...
EventMesh Schema Registry (OpenSchema) Overview of Schema and Schema Registry Schema Schema Registry Comparison of Schema Registry in Different Projects Overview of OpenSchema ...
Introduction Hive SerDe Integration Writing to an ORC File Data Flow Extending Gobblin’s SerDe Integration Introduction Gobblin is capable of writing data to ORC files by le...
Introduction Docker Docker Repositories Run the docker image with simple wikipedia jobs Use Gobblin Standalone on Docker for Kafka and HDFS Ingestion Run Gobblin as a Service ...