Devlive 开源社区 本次搜索耗时 0.585 秒,为您找到 219 个相关结果.
  • HTTP

    Introduction Note Constructs HttpOperation AsyncRequestBuilder HttpClient ResponseHandler Build an asynchronous writer AvroHttpWriterBuilder R2RestWriterBuilder Bui...
  • Metrics for Gobblin ETL

    Configuring Metrics and Event emission Operational Metrics Extractor Metrics Converter Metrics Fork Operator Metrics Row Level Policy Metrics Data Writer Metrics Runtime Eve...
  • Writing ORC Data

    Introduction Hive SerDe Integration Writing to an ORC File Data Flow Extending Gobblin’s SerDe Integration Introduction Gobblin is capable of writing data to ORC files by le...
  • Reliability

    Reliability Iceberg was designed to solve correctness problems that affect Hive tables running in S3. Hive tables track data files using both a central metastore for partitions a...
  • Writing Tables

    426 2024-06-27 《Apache Hudi 0.15.0》
    SQL DDL SQL DML Batch Writes Streaming Writes
  • Parquet HDFS

    Description Usage Example Pipeline Configuration Configuration Developer Notes Description An extension to FsDataWriter that writes in Parquet format in the form of either...
  • Exactly Once Support

    Achieving Exactly-Once Delivery with CommitStepStore Scalability 2 can also easily be parallelized where we have each container responsible for a subset of datasets. APIs Thi...
  • Partitioned Writers

    Existing Partition Aware Writers Existing Partitioners Design Implementing a partitioner Implementing a Partition Aware Writer Builder Gobblin allows partitioning output data...
  • Case Studies

    Kafka-HDFS Ingestion Publishing Data to S3 Writing ORC Data Hive Distcp