drake
Data workflow manager
A tool that automates the management of data workflows by organizing command execution around data and its dependencies.
Data workflow tool, like a "Make for data"
1k stars
151 watching
110 forks
Language: Clojure
last commit: over 2 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
apache/oozie | A system to define, manage, schedule, and execute complex data processing workflows across multiple systems using a declarative framework. | 717 |
ivansharamok/dynamicworkflows | A workflow management tool built on top of Sitecore, allowing users to define and manage dynamic workflows. | 2 |
centurylinklabs/dray | An engine for managing container-based workflows | 383 |
rbarrois/django_xworkflows | A library that enables workflow management with Django models | 106 |
richfitz/remake | A package to manage complex analysis workflows by defining a series of tasks and rules for their execution | 340 |
tauffer-consulting/domino | A platform that allows users to create and monitor complex workflows using a graphical interface and reusable Python code units | 155 |
alpha-unito/streamflow | A container-native workflow management system for managing multi-container environments and hybrid workflows. | 54 |
chadian/vorfreude | A development tool for managing project workflows and releases, utilizing Svelte and Ember frameworks. | 7 |
kevin-hanselman/dud | A lightweight tool for managing and versioning large data alongside source code in data pipelines | 184 |
fieldryand/goflow | A lightweight, single-binary DAG scheduler and dashboard for orchestrating workflows with tasks and operators | 396 |
whitaker-io/machine | A library for creating data workflows that can be simple or complex, with features like recursion and memoization. | 159 |
it4innovations/hyperqueue | A tool that automates the execution of complex workflows on HPC clusters by dynamically allocating resources and load-balancing tasks. | 292 |
pytask-dev/pytask | A workflow management system that facilitates reproducible data analyses | 115 |
snowkit/flow | A JavaScript library for declaratively building and managing data flows | 63 |
kanisterio/kanister | A framework for managing application-level data on Kubernetes with focus on data protection and workflow management. | 775 |