Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Airbyte | 10,857 | 9 hours ago | 90 | June 23, 2022 | 4,700 | other | Python | |||
Data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes. | ||||||||||
Dagster | 7,616 | 2 | 89 | 19 hours ago | 495 | July 06, 2022 | 1,693 | apache-2.0 | Python | |
An orchestration platform for the development, production, and observation of data assets. | ||||||||||
Mage Ai | 4,819 | 14 hours ago | 9 | June 27, 2022 | 86 | apache-2.0 | Python | |||
š§ The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data. | ||||||||||
Orchest | 3,876 | 4 days ago | 14 | April 06, 2022 | 125 | apache-2.0 | TypeScript | |||
Build data pipelines, the easy way š ļø | ||||||||||
Kestra | 3,492 | a day ago | 28 | August 30, 2022 | 256 | apache-2.0 | Java | |||
Kestra is an infinitely scalable orchestration and scheduling platform, creating, running, scheduling, and monitoring millions of complex pipelines. | ||||||||||
Awesome Etl | 2,874 | a month ago | 11 | |||||||
A curated list of awesome ETL frameworks, libraries, and software. | ||||||||||
Mara Pipelines | 1,993 | a month ago | 11 | June 11, 2020 | 24 | mit | Python | |||
A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow | ||||||||||
Go Streams | 1,344 | 9 | 3 months ago | 18 | February 11, 2022 | 18 | mit | Go | ||
A lightweight stream processing library for Go | ||||||||||
Onepanel | 669 | 1 | 4 months ago | 64 | November 15, 2021 | 85 | apache-2.0 | Go | ||
The open source, end-to-end computer vision platform. Label, build, train, tune, deploy and automate in a unified platform that runs on any cloud and on-premises. | ||||||||||
Goodreads_etl_pipeline | 593 | 3 years ago | mit | Python | ||||||
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform. |
Website ⢠Twitter ⢠Linked In ⢠Slack ⢠Documentation
Try Kestra using our live demo.
Kestra is an open-source, event-driven orchestrator that simplifies data operations and improves collaboration between engineers and business users. By bringing Infrastructure as Code best practices to data pipelines, Kestra allows you to build reliable workflows and manage them with confidence.
Thanks to the declarative YAML interface for defining orchestration logic, everyone who benefits from analytics can participate in the data pipeline creation process. The UI automatically adjusts the YAML definition any time you make changes to a workflow from the UI or via an API call. Therefore, the orchestration logic is defined declaratively in code, even if some workflow components are modified in other ways.
Flow
is the main component in Kestra. It's a container for your tasks and orchestration logic.Namespace
is used to provide logical isolation, e.g., to separate development and production environments. Namespaces are like folders on your file system ā they organize flows into logical categories and can be nested to provide a hierarchical structure.Tasks
are atomic actions in a flow. By default, all tasks in the list will be executed sequentially, with additional customization options, a.o. to run tasks in parallel or allow a failure of specific tasks when needed.Triggers
define when a flow should run. In Kestra, flows are triggered based on events. Examples of such events include:
Inputs
allow you to pass runtime-specific variables to a flow. They are strongly typed, and allow additional validation rules.Most tasks in Kestra are available as plugins, but many type of tasks are available in the core library, including a.o. script tasks supporting various programming languages (e.g., Python, Node, Bash) and the ability to orchestrate your business logic packaged into Docker container images.
To create your own plugins, check the plugin developer guide.
Kestra provides a variety of tasks to handle both simple and complex business logic, including:
disabled
to true
.id: hello
namespace: prod
description: Hi from `Kestra` and a **markdown** description.
labels:
owner: john-doe
team: data-engineering
tasks:
- id: hello
type: io.kestra.core.tasks.log.Log
message: Hello world!
description: a *very* important task
disabled: false
timeout: 10M
retry:
type: constant # type: string
interval: PT15M # type: Duration
maxDuration: PT1H # type: Duration
maxAttempt: 5 # type: int
warningOnRetry: true # type: boolean, default is false
- id: parallel
type: io.kestra.core.tasks.flows.Parallel
concurrent: 3
tasks:
- id: task1
type: io.kestra.core.tasks.scripts.Bash
commands:
- 'echo "running {{task.id}}"'
- 'sleep 10'
- id: task2
type: io.kestra.core.tasks.scripts.Bash
commands:
- 'echo "running {{task.id}}"'
- 'sleep 10'
- id: task3
type: io.kestra.core.tasks.scripts.Bash
commands:
- 'echo "running {{task.id}}"'
- 'sleep 10'
triggers:
- id: schedule
type: io.kestra.core.models.triggers.types.Schedule
cron: "*/15 * * * *"
backfill:
start: 2023-06-25T14:00:00Z
You can write workflows directly from the UI. When writing your workflows, the UI provides:
To get a local copy up and running, follow the steps below.
Make sure that Docker is installed and running on your system. The default installation requires the following:
Download the Docker Compose file:
curl -o docker-compose.yml https://raw.githubusercontent.com/kestra-io/kestra/develop/docker-compose.yml
Alternatively, you can use wget https://raw.githubusercontent.com/kestra-io/kestra/develop/docker-compose.yml
.
Start Kestra:
docker-compose up
Open http://localhost:8080
in your browser and create your first flow.
Here is a simple example logging hello world message to the terminal:
id: hello
namespace: prod
tasks:
- id: hello-world
type: io.kestra.core.tasks.log.Log
message: Hello world!
For more information:
Kestra is built on a plugin system. You can find your plugin to interact with your provider; alternatively, you can follow these steps to develop your own plugin.
For a full list of plugins, check the plugins page.
Here are some examples of the available plugins:
This list is growing quickly and we welcome contributions.
If you need help or have any questions, reach out using one of the following channels:
See the open issues for a list of proposed features (and known issues) or look at the project board.
We love contributions, big or small. Check out our contributor guide for details on how to contribute to Kestra.
See our Plugin Developer Guide for details on developing and publishing Kestra plugins.
Apache 2.0 Ā© Kestra Technologies