dagster
Data pipeline orchestrator
An orchestration platform for data pipelines and assets, providing a declarative programming model and integrated lineage and observability.
An orchestration platform for the development, production, and observation of data assets.
12k stars
124 watching
2k forks
Language: Python
last commit: 2 months ago
Linked from 10 awesome lists
analyticsdagsterdata-engineeringdata-integrationdata-orchestratordata-pipelinesdata-scienceetlmetadatamlopsorchestrationpythonschedulerworkflowworkflow-automation
Backlinks from these awesome lists:
-
ethicalml/awesome-production-machine-learning
-
runacapital/awesome-oss-alternatives
-
oxnr/awesome-bigdata
-
igorbarinov/awesome-data-engineering
-
pditommaso/awesome-pipeline
-
kelvins/awesome-mlops
-
gunnarmorling/awesome-opensource-data-engineering
-
vihar/awesome-oss-saas
-
kelvins/awesome-dataops
-
simomay/find-oss
Related projects:
Repository | Description | Stars |
---|---|---|
| A tool to orchestrate applications using a version-controlled dataflow | 1,083 |
| An agile pipeline framework for data engineering teams to track and orchestrate their data processes. | 260 |
| Automates and simplifies the creation of function pipelines for efficient execution of scientific workflows. | 230 |
| A platform for defining and executing scientific pipelines in distributed environments using C++ and Python. | 16 |
| A continuous big data ingestion platform that enables easy creation of data pipelines for various data sources and destinations. | 90 |
| A platform to programmatically author, schedule and monitor complex workflows | 37,580 |
| An orchestration platform for automating DevOps workflows by combining tools and services into a single, GUI-based solution | 339 |
| Helps define and manage data transformations with a modular, self-documenting, and portable framework for directed acyclic graphs (DAGs) of data transformations. | 1,900 |
| A utility and developer library for data streams catching and aggregation | 154 |
| A distributed workflow management system that coordinates services and scripts into complex workflows. | 538 |
| A platform for data-intensive scientific analysis and workflow management | 1,431 |
| A Mesos scheduler that enables deployment and management of long-running applications with high availability and scalability. | 408 |
| A lightweight tool for managing and versioning large data alongside source code in data pipelines | 184 |
| Provides a unified interface for constructing and managing workflows across different workflow engines. | 919 |
| A toolbox for industrial data analytics and stream processing | 614 |