Devlive 开源社区 本次搜索耗时 0.772 秒,为您找到 197 个相关结果.
  • Gobblin CLI

    5858 2025-03-12 《Apache Gobblin 0.17.0》
    Gobblin Commands & Execution Modes Gobblin Commands The Distcp Quick App The OneShot Quick App Developing quick apps for the CLI Implementing new Gobblin commands Gobblin Ser...
  • AWS Datasync

    DataSync 节点 综述 创建任务 任务样例 独有参数 环境配置 DataSync 节点 综述 AWS DataSync 是一种在线数据传输服务,可简化、自动化和加速本地存储系统和 AWS Storage 服务之间,以及不同 AWS Storage 服务之间的数据移动。 DataSync 支持的组件: Network File ...
  • Kafka-HDFS Ingestion

    5816 2025-03-18 《Apache Gobblin 0.17.0》
    Getting Started Standalone MapReduce Job Constructs Source and Extractor Writer and Publisher Job Config Properties Metrics and Events Task Level Metrics Task Level Events ...
  • Hive

    5816 2025-03-13 《Apache Iceberg 1.8.1》
    Hive Iceberg supports reading and writing Iceberg tables through Hive by using a StorageHandler . Feature support The following features matrix illustrates the support for diff...
  • Procedures

    5708 2025-03-11 《Apache Iceberg 1.8.1》
    Spark Procedures To use Iceberg in Spark, first configure Spark catalogs . Stored procedures are only available when using Iceberg SQL extensions in Spark 3. Usage Procedures c...
  • File Sizing

    5657 2024-06-30 《Apache Hudi 0.15.0》
    Auto-sizing during writes File sizing for Copy-On-Write (COW) and Merge-On-Read (MOR) tables More details about file sizing for Merge-On-Read(MOR) tables Configs Auto-Sizing Wi...
  • Using Logstore

    5606 2024-06-26 《Apache Amoro 0.6.1》
    Real-Time data in LogStore Overview Prerequisites for using LogStore Double write LogStore and FileStore Due to the limitations of traditional offline data warehouse architect...
  • Creating your first interoperable table

    Creating your first interoperable table Using Apache XTable™ (Incubating) to sync your source tables in different target format involves running sync on your current dataset usi...
  • Architecture

    5567 2025-03-13 《Apache Gobblin 0.17.0》
    Gobblin Architecture Overview Gobblin Job Flow Gobblin Constructs Source and Extractor Converter Quality Checker Fork Operator Data Writer Data Publisher Gobblin Task Flow...
  • Iceberg

    5516 2024-06-26 《Apache Amoro 0.6.1》
    Iceberg format refers to Apache Iceberg table, which is an open table format for large analytical datasets designed to provide scalable, efficient, and secure data storage and qu...