Setl Alternatives

Name: SETL-Framework/setl
Brand: SETL-Framework/setl
SKU: project/SETL-Framework/setl
Rating: 4.48 (172 reviews)

A simple Spark-powered ETL framework that just works 🍺

Categories > Data Processing > Machine Learning

Suggest Alternative

Stars

172

Alternatives

License

apache-2.0

Open Issues

Most Recent Commit

over 2 years ago

Programming Language

Scala

Dependent Repos

Dependent Packages

Total Releases

Latest Release

August 21, 2020

Categories

Machine Learning > Machine Learning

Data Processing > Dataset

Programming Languages > Scala

Data Processing > Pipeline

Data Processing > Data Science

Data Processing > Spark

Data Processing > Data Analysis

Data Processing > Big Data

Data Processing > Etl

Data Processing > Data Engineering

Data Processing > Data Transformation

Repo

Alternatives To SETL-Framework/setl

Project Name	Stars	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
apache/airflow	33,219	0	320	over 2 years ago	169	November 27, 2023	890	apache-2.0	Python
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
mage-ai/mage-ai	6,324	0	0	over 2 years ago	314	December 06, 2023	189	apache-2.0	Python
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
orchest/orchest	3,876	0	0	about 3 years ago	19	December 13, 2022	125	apache-2.0	TypeScript
Build data pipelines, the easy way 🛠️
DAGWorks-Inc/hamilton	1,139	0	2	over 2 years ago	116	December 05, 2023	136	bsd-3-clause-clear	Jupyter Notebook
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.
stitchfix/hamilton	877	0	0	about 3 years ago	10	October 23, 2022	12	bsd-3-clause-clear	Python
A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton
onepanelio/onepanel	697	0	1	over 3 years ago	64	November 15, 2021	85	apache-2.0	Go
The open source, end-to-end computer vision platform. Label, build, train, tune, deploy and automate in a unified platform that runs on any cloud and on-premises.
elastic/eland	588	0	3	over 2 years ago	30	November 22, 2023	88	apache-2.0	Python
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
grailbio/bigslice	525	0	0	about 3 years ago	13	April 05, 2021	23	apache-2.0	Go
A serverless cluster computing system for the Go programming language
insitro/redun	464	0	1	over 2 years ago	18	November 12, 2023	28	apache-2.0	Python
Yet another redundant workflow engine
Cascading/cascading	321	0	0	over 7 years ago	0			other	Java
Cascading is a feature rich API for defining and executing complex and fault tolerant data processing workflows on various cluster computing platforms. Please see https://github.com/cwensel/cascading for access to all WIP branches.

Alternatives To SETL-Framework/setl

Select To Compare

apache/airflow ⭐ 33,219

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

dependent packages 320 total releases 169 most recent commit over 2 years ago downloads badge

mage-ai/mage-ai ⭐ 6,324

🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.

dependent packages 0 total releases 314 most recent commit over 2 years ago downloads badge

orchest/orchest ⭐ 3,876

Build data pipelines, the easy way 🛠️

dependent packages 0 total releases 19 most recent commit about 3 years ago downloads badge

DAGWorks-Inc/hamilton ⭐ 1,139

Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.

dependent packages 2 total releases 116 most recent commit over 2 years ago downloads badge

stitchfix/hamilton ⭐ 877

A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton

dependent packages 0 total releases 10 most recent commit about 3 years ago

onepanelio/onepanel ⭐ 697

The open source, end-to-end computer vision platform. Label, build, train, tune, deploy and automate in a unified platform that runs on any cloud and on-premises.

dependent packages 1 total releases 64 most recent commit over 3 years ago

elastic/eland ⭐ 588

Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch

dependent packages 3 total releases 30 most recent commit over 2 years ago downloads badge

grailbio/bigslice ⭐ 525

A serverless cluster computing system for the Go programming language

dependent packages 0 total releases 13 most recent commit about 3 years ago

insitro/redun ⭐ 464

Yet another redundant workflow engine

dependent packages 1 total releases 18 most recent commit over 2 years ago downloads badge

Cascading/cascading ⭐ 321

Cascading is a feature rich API for defining and executing complex and fault tolerant data processing workflows on various cluster computing platforms. Please see https://github.com/cwensel/cascading for access to all WIP branches.

dependent packages 0 total releases 0 most recent commit over 7 years ago

Suggest An Alternative To setl

Alternative Project Comparisons

SETL-Framework/setl vs Airflow

SETL-Framework/setl vs Mage Ai

SETL-Framework/setl vs Orchest

SETL-Framework/setl vs Hamilton

SETL-Framework/setl vs Onepanel

SETL-Framework/setl vs Eland

SETL-Framework/setl vs Bigslice

SETL-Framework/setl vs Redun

SETL-Framework/setl vs Cascading

Popular Etl Projects

pingcap/tidb⭐ 35,604

TiDB is an open-source, cloud-native, distributed, MySQL-Compatible database for elastic scale and real-time analytics. Try AI-powered Chat2Query free at : https://tidbcloud.com/free-trial

airbytehq/airbyte⭐ 12,918

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

apache/doris⭐ 10,666

Apache Doris is an easy-to-use, high performance and unified analytics database.

dagster-io/dagster⭐ 9,467

An orchestration platform for the development, production, and observation of data assets.

pentaho/pentaho-kettle⭐ 7,194

Pentaho Data Integration ( ETL ) a.k.a Kettle

Popular Machine Learning Projects

tensorflow/tensorflow⭐ 180,196

An Open Source Machine Learning Framework for Everyone

huggingface/transformers⭐ 119,240

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

pytorch/pytorch⭐ 74,794

Tensors and Dynamic neural networks in Python with strong GPU acceleration

netdata/netdata⭐ 66,844

Monitor your servers, containers, and applications, in high-resolution and in real-time!

microsoft/ML-For-Beginners⭐ 63,698

12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all

Popular Data Processing Categories

Jupyter Notebook

Dataset

Sql

Validation

Pipeline

Translation

Data Science

Classification

Transaction

Scraper