Devlive 开源社区 本次搜索耗时 0.791 秒,为您找到 370 个相关结果.
  • Architecture

    Gobblin Architecture Overview Gobblin Job Flow Gobblin Constructs Source and Extractor Converter Quality Checker Fork Operator Data Writer Data Publisher Gobblin Task Flow...
  • Procedures

    Spark Procedures To use Iceberg in Spark, first configure Spark catalogs . Stored procedures are only available when using Iceberg SQL extensions in Spark 3. Usage Procedures c...
  • Gobblin CLI

    Gobblin Commands & Execution Modes Gobblin Commands The Distcp Quick App The OneShot Quick App Developing quick apps for the CLI Implementing new Gobblin commands Gobblin Ser...
  • Gobblin on Yarn

    Introduction Architecture Overview The Role of Apache Helix Gobblin Yarn Application Launcher YarnAppSecurityManager LogCopier Gobblin ApplicationMaster YarnService GobblinH...
  • Flink Writes

    Flink Writes Iceberg support batch and streaming writes With Apache Flink ‘s DataStream API and Table API. Writing with SQL Iceberg support both INSERT INTO and INSERT OVERWRIT...
  • Java API

    Iceberg Java API Tables The main purpose of the Iceberg API is to manage table metadata, like schema, partition spec, metadata, and data files that store table data. Table metad...
  • Configuration

    Spark Configuration Catalogs Spark adds an API to plug in table catalogs that are used to load, create, and manage Iceberg tables. Spark catalogs are configured by setting Spark ...
  • Maintenance

    Maintenance Maintenance operations require the Table instance. Please refer Java API quickstart page to refer how to load an existing table. Recommended Maintenance Expire...
  • Flink Queries

    Flink Queries Iceberg support streaming and batch read With Apache Flink ‘s DataStream API and Table API. Reading with SQL Iceberg support both streaming and batch read in Flink...
  • Fork Operator

    Overview of the ForkOperator Using the ForkOperator Basics of Usage Per-Fork Configuration Failure Semantics Performance Tuning Comparison with PartitionedDataWriter Writing...