Devlive 开源社区 本次搜索耗时 0.604 秒,为您找到 115 个相关结果.
  • Troubleshooting

    3052 2024-07-01 《Apache Hudi 0.15.0》
    Writing Tables org.apache.parquet.io.InvalidRecordException: Parquet/Avro schema mismatch: Avro field ‘col1’ not found java.lang.UnsupportedOperationException: org.apache.parquet....
  • Clustering

    3042 2024-06-30 《Apache Hudi 0.15.0》
    Background How is compaction different from clustering? Clustering Architecture Overall, there are 2 steps to clustering Schedule clustering Execute clustering Clustering Use...
  • Design & Concepts

    2997 2024-07-01 《Apache Hudi 0.15.0》
    How does Hudi ensure atomicity? Does Hudi extend the Hive table layout? What concurrency control approaches does Hudi adopt? Hudi’s commits are based on transaction start time i...
  • Deployment

    2776 2024-07-01 《Apache Hudi 0.15.0》
    Deploying Hudi Streamer Spark Datasource Writer Jobs Upgrading Downgrading Migrating This section provides all the help you need to deploy and operate Hudi tables at scale. ...
  • Indexing

    2711 2024-06-28 《Apache Hudi 0.15.0》
    Indexing Multi-modal Indexing Index Types in Hudi Global and Non-Global Indexes Configs Spark based configs Flink based configs Indexing Strategies Workload 1: Late arriving...
  • Creating your first interoperable table

    Pre-requisites Steps Initialize a pyspark shell Create dataset Running sync Conclusion Next steps Using OneTable to sync your source tables in different target format invo...
  • Hive Metastore

    2477 2024-06-30 《Apache Hudi 0.15.0》
    Spark Data Source example Query using HiveQL Use partition extractor properly Hive Sync Tool Hive Sync Configuration Sync modes HMS JDBC HIVEQL Flink Setup Install Hive E...
  • Google BigQuery

    2394 2024-06-30 《Apache Hudi 0.15.0》
    Sync Modes Manifest File Benefits of using the new manifest approach: View Over Files (Legacy) Configurations Partition Handling Example Hudi tables can be queried from Goo...
  • Table Services

    2281 2024-07-01 《Apache Hudi 0.15.0》
    What does the Hudi cleaner do? How do I run compaction for a MOR table? What options do I have for asynchronous/offline compactions on MOR table? How to disable all table servic...
  • Spark Tuning Guide

    2263 2024-07-01 《Apache Hudi 0.15.0》
    Spark Tuning Guide Writing General Tips Spark failures Hudi consumes too much space in a temp folder while upsert How to tune shuffle parallelism of Hudi jobs ? GC Tuning ...