Devlive 开源社区 本次搜索耗时 1.160 秒,为您找到 472 个相关结果.
  • Docker Integration

    1132 2025-03-19 《Apache Gobblin 0.17.0》
    Introduction Docker Docker Repositories Run the docker image with simple wikipedia jobs Use Gobblin Standalone on Docker for Kafka and HDFS Ingestion Run Gobblin as a Service ...
  • 用户手册

    快速上手 系统使用手册 元数据文档 部署参数分析
  • Gobblin as a Library

    1095 2025-03-17 《Apache Gobblin 0.17.0》
    Using Gobblin as a Library Creating an Embedded Gobblin instance Configuring Embedded Gobblin Running Embedded Gobblin Extending Embedded Gobblin Using Gobblin as a Library ...
  • Query based

    1084 2025-03-24 《Apache Gobblin 0.17.0》
    Introduction Constructs QueryBasedSource QueryBasedExtractor build readRecord Configuration Introduction QueryBasedSource represents a category of sources whose data...
  • Hive Registration

    1081 2025-03-19 《Apache Gobblin 0.17.0》
    How Hive Registration Works in Gobblin HiveSpec HiveRegistrationPolicy HiveSerDeManager Predicate and Activity How to Use Hive Registration in Your Gobblin Job Hive Regist...
  • Gobblin Modules

    1066 2025-03-26 《Apache Gobblin 0.17.0》
    Introduction How it works gobblin-modules/ Gobblin flavor Current flavors and modules What’s next Introduction Gobblin-modules is a way to support customization of the gobb...
  • Daft

    1048 2025-03-13 《Apache Iceberg 1.8.1》
    Daft Daft is a distributed query engine written in Python and Rust, two fast-growing ecosystems in the data engineering and machine learning industry. It exposes its flavor of t...
  • Queries

    1046 2025-03-10 《Apache Iceberg 1.8.1》
    Spark Queries To use Iceberg in Spark, first configure Spark catalogs . Iceberg uses Apache Spark’s DataSourceV2 API for data source and catalog implementations. Querying with S...
  • Docker Demo

    Building interoperable tables using Apache XTable™ (Incubating) This demo walks you through a fictional use case and the steps to add interoperability between table formats using ...
  • Reliability

    Reliability Iceberg was designed to solve correctness problems that affect Hive tables running in S3. Hive tables track data files using both a central metastore for partitions a...