Devlive 开源社区 本次搜索耗时 0.709 秒,为您找到 65 个相关结果.
  • Introduction

    Documentation Apache Iceberg is an open table format for huge analytic datasets. Iceberg adds tables to compute engines including Spark, Trino, PrestoDB, Flink, Hive and Impala u...
  • State Management and Watermarks

    Managing Watermarks in a Job Basics Task Failures Multi-Dataset Jobs Gobblin State Deep Dive State class hierarchy How States are Used in a Gobblin Job This page has two p...
  • Unity Catalog

    Syncing to Unity Catalog This document walks through the steps to register an Apache XTable™ (Incubating) synced Delta table in Unity Catalog on Databricks and open-source Unity C...
  • Retention

    Introduction Design Overview of Gobblin Config Management Library Retention Constructs DatasetCleaner DatasetFinder ManagedCleanableDatasetFinder ConfigurableCleanableDataset ...
  • BigLake Metastore

    Syncing to BigLake Metastore This document walks through the steps to register an Apache XTable™ (Incubating) synced Iceberg table in BigLake Metastore on GCP. Pre-requisites S...
  • Camus to Gobblin Migration

    Advantages of Migrating to Gobblin Kafka Ingestion Related Job Config Properties Config properties for pulling Kafka topics Config properties for compaction Deployment and Chec...
  • Glue Data Catalog

    Syncing to Glue Data Catalog This document walks through the steps to register an Apache XTable™ (Incubating) synced table in Glue Data Catalog on AWS. Pre-requisites Source ta...
  • Gobblin Modules

    Introduction How it works gobblin-modules/ Gobblin flavor Current flavors and modules What’s next Introduction Gobblin-modules is a way to support customization of the gobb...
  • Hive Distcp

    Introduction Configure Hive Distcp Job Source and target metastores Database and tables to copy Target path computation Conflicting table and partitions treatment Deregisterin...
  • Query based

    Introduction Constructs QueryBasedSource QueryBasedExtractor build readRecord Configuration Introduction QueryBasedSource represents a category of sources whose data...