Daft Daft is a distributed query engine written in Python and Rust, two fast-growing ecosystems in the data engineering and machine learning industry. It exposes its flavor of t...
On each of your hosts: yum and rpm (RHEL/CentOS/Oracle/Amazon Linux) zypper and php_curl (SLES) apt (Debian/Ubuntu) scp , curl , unzip , tar , wget , and gcc* OpenSSL (v1....
Using Gobblin as a Library Creating an Embedded Gobblin instance Configuring Embedded Gobblin Running Embedded Gobblin Extending Embedded Gobblin Using Gobblin as a Library ...
When an Rmd document is compiled, all of its metadata in the YAML section will be stored in the list object rmarkdown::metadata . For example, rmarkdown::metadata$title gives you...
Description Usage Example Pipeline Configuration Configuration Developer Notes Description An extension to FsDataWriter that writes in Parquet format in the form of either...
Problem Statement New Design & Details Problem Statement Current Gobblin Kafka High Level Consumer uses Kafka Consumer (0.8) APIs and gobblin support for them will be depreca...
Quick Start Metric Contexts Metrics Events Reporters Gobblin Metrics is a metrics library for emitting metrics and events instrumenting java applications. Metrics and events...