Devlive 开源社区 本次搜索耗时 1.672 秒,为您找到 30 个相关结果.
  • Performance

    2191 2024-07-01 《Apache Hudi 0.15.0》
    Optimized DFS Access Performance Optimizations Write Path Bulk Insert Upserts Indexing Read Path Data Skipping Optimized DFS Access Hudi also performs several key storage...
  • Table & Query Types

    2188 2024-06-28 《Apache Hudi 0.15.0》
    Table and Query Types Table Types Query types Copy On Write Table Merge On Read Table Query configs Spark configs Flink Configs Related Resources Table and Query Types H...
  • Scan Executors

    2129 2024-06-22 《Apache Accumulo 2.x》
    Configuring and using Scan Executors Configuring and using Scan Prioritizers. Providing hints from the client side. Accumulo scans operate by repeatedly fetching batches of dat...
  • Structured Streaming

    1921 2024-06-29 《Apache Iceberg 1.5.2》
    Streaming Reads Streaming Writes Partitioned table Maintenance for streaming tables Tune the rate of commits Expire old snapshots Compacting data files Rewrite manifests I...
  • The TTL Of Kyuubi Engines

    1855 2024-07-05 《Apache Kyuubi 1.9.1》
    The Big Contributors Of Resource Waste TTL Types In Kyuubi Engines Configurations Engine TTL Executor TTL For a multi-tenant cluster, its overall resource utilization is a KP...
  • TiDB

    1742 2024-07-05 《Apache Kyuubi 1.9.1》
    TiDB Integration Dependencies Configurations TiDB Operations TiDB is an open-source NewSQL database that supports Hybrid Transactional and Analytical Processing (HTAP) workloa...
  • AWS

    Iceberg AWS Integrations Iceberg provides integration with different AWS services through the iceberg-aws module. This section describes how to use Iceberg with AWS. Enabling ...
  • State Management and Watermarks

    Managing Watermarks in a Job Basics Task Failures Multi-Dataset Jobs Gobblin State Deep Dive State class hierarchy How States are Used in a Gobblin Job This page has two p...
  • Structured Streaming

    Spark Structured Streaming Iceberg uses Apache Spark’s DataSourceV2 API for data source and catalog implementations. Spark DSv2 is an evolving API with different levels of support...
  • Daft

    Daft Daft is a distributed query engine written in Python and Rust, two fast-growing ecosystems in the data engineering and machine learning industry. It exposes its flavor of t...