How Hive Registration Works in Gobblin HiveSpec HiveRegistrationPolicy HiveSerDeManager Predicate and Activity How to Use Hive Registration in Your Gobblin Job Hive Regist...
DDL commands CREATE Catalog Hive catalog This creates an Iceberg catalog named hive_catalog that can be configured using 'catalog-type'='hive' , which loads tables from Hive m...
Introduction How it works gobblin-modules/ Gobblin flavor Current flavors and modules What’s next Introduction Gobblin-modules is a way to support customization of the gobb...
Documentation Overview GitHub Wiki Limitations MkDocs ReadTheDocs Additional Information Documentation Overview The documentation for Gobblin is based on ReadTheDocs and Mk...
Overview Redisson offers ability to run as standalone node and participate in distributed computing. Such Nodes are used to run MapReduce , ExecutorService , ScheduledExecutorServ...
Spark Queries To use Iceberg in Spark, first configure Spark catalogs . Iceberg uses Apache Spark’s DataSourceV2 API for data source and catalog implementations. Querying with S...
Introduction Hive SerDe Integration Writing to an ORC File Data Flow Extending Gobblin’s SerDe Integration Introduction Gobblin is capable of writing data to ORC files by le...
savepoint and restore with savepoint use savepoint use restore with savepoint savepoint and restore with savepoint savepoint is created using the checkpoint. a global mirror o...