The Basics of AQE Dynamically Switch Join Strategies Dynamically Coalesce Shuffle Partitions Other Tips for Best Practises How to set spark.sql.adaptive.advisoryPartitionSizeInByt...
Description Using Dependency For Spark/Flink Engine For SeaTunnel Zeta Engine Key Features Options driver [string] user [string] password [string] url [string] query [stri...
Support Iceberg Version Support Those Engines Description Supported DataSource Info Database Dependency Data Type Mapping Sink Options Task Example Simple: Hive Catalog: H...
Using Apache Hadoop resource in Flink on Kubernetes 1. Apache HDFS 1.1 Add the shaded jar 1.2. add core-site.xml and hdfs-site.xml 2. Apache Hive 2.1. Add Hive-related jars 2...
Dependency of elastic writing Write data to Elasticsearch based on the official Using Apache StreamPark™ writes to Elasticsearch 1. 配置策略和连接信息 2. 写入Elasticsearch Other configur...
Pre-requisites Steps Initialize a pyspark shell Create dataset Running sync Conclusion Next steps Using OneTable to sync your source tables in different target format invo...
Trouble Shooting Common Issues java.lang.UnsupportedClassVersionError .. Unsupported major.minor version 52.0 org.apache.spark.SparkException: When running with master ‘yarn’ eith...
The Basics of Dynamic Resource Allocation How to Enable Dynamic Resource Allocation Dynamic Resource Allocation w/ External Shuffle Service Dynamic Allocation w/o External Shuffl...
Monitoring Accumulo Monitor SSL Metrics[] Configuration Metric Names Monitoring Accumulo Monitor The Accumulo Monitor provides a web UI with information on the health and ...