A simplified, lightweight ETL Framework based on Apache Spark. Visit Website
Metorikku is a library that simplifies writing and executing ETLs on top of Apache Spark. It is based on simple YAML configuration files and runs on any Spark cluster. The platform also includes a simple way to write unit and E2E tests.