config.yaml description We use yaml configuration file. It will be created automatically after answer init command. The default path is /data/conf/config.yaml config.yaml...
The Engine Configuration Guide The Engine Configuration Guide Kyuubi aims to bring Spark to end-users who need not qualify with Spark or something else related to the big data a...
Documentation Overview GitHub Wiki Limitations MkDocs ReadTheDocs Additional Information Documentation Overview The documentation for Gobblin is based on ReadTheDocs and Mk...
Flink Connector Apache Flink supports creating Iceberg table directly without creating the explicit Flink catalog in Flink SQL. That means we can just create an iceberg table by s...
Introduction Docker Docker Repositories Run the docker image with simple wikipedia jobs Use Gobblin Standalone on Docker for Kafka and HDFS Ingestion Run Gobblin as a Service ...
RisingWave RisingWave is a Postgres-compatible SQL database designed for real-time event streaming data processing, analysis, and management. It can ingest millions of events per...
DDL commands CREATE Catalog Hive catalog This creates an Iceberg catalog named hive_catalog that can be configured using 'catalog-type'='hive' , which loads tables from Hive m...
Querying from Apache Spark To read an Apache XTable™ (Incubating) synced target table (regardless of the table format) in Apache Spark locally or on services like Amazon EMR, Goog...
Collect Trace with Jaeger Jaeger Configuration Migrating from Zipkin Collect Trace with Jaeger Jaeger Jaeger , inspired by Dapper and OpenZipkin , is a distributed tracing ...