Support Those Engines Key features Description Sink Options Task Example Changelog 2.2.0-beta 2022-09-26 Socket sink connector Support Those Engines Spark Flink SeaTu...
Maintenance Maintenance operations require the Table instance. Please refer Java API quickstart page to refer how to load an existing table. Recommended Maintenance Expire...
A MySQL, Oracle, PostgreSQL, or Amazon RDS database instance must be running and available to be used by Ranger. The Ranger installation will create two new users (default names: ...
Introduction Hive on Spark Differences Between Kyuubi and HiveServer2 Performance References Introduction HiveServer2 is a service that enables clients to execute Hive QL qu...
Disclaimer Supported Storage System Verified Combination of Spark and storage system HDInsight Spark2.4 on Azure Data Lake Storage Gen 2 Databricks Spark2.4 on Azure Data Lake S...
Overview of the ForkOperator Using the ForkOperator Basics of Usage Per-Fork Configuration Failure Semantics Performance Tuning Comparison with PartitionedDataWriter Writing...
Spark Configuration Catalogs Spark adds an API to plug in table catalogs that are used to load, create, and manage Iceberg tables. Spark catalogs are configured by setting Spark ...
Spark Streaming Spark Streaming Structured Streaming reads are based on Hudi’s Incremental Query feature, therefore streaming read can return data for which commits and base fil...
Rewrite files action. Rewrite files action. Iceberg provides API to rewrite small files into large files by submitting Flink batch jobs. The behavior of this Flink action is the...
JDO defines XML MetaData in jdo files as well as orm files. As always with XML, the metadata must match the defined DTD/XSD for that file type. This section describes the conten...