drake
Data workflow manager
A tool that automates the management of data workflows by organizing command execution around data and its dependencies.
Data workflow tool, like a "Make for data"
1k stars
151 watching
110 forks
Language: Clojure
last commit: over 2 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
apache/oozie | A system to define, manage, schedule, and execute complex data processing workflows across multiple systems using a declarative framework. | 713 |
ivansharamok/dynamicworkflows | A workflow management tool built on top of Sitecore, allowing users to define and manage dynamic workflows. | 2 |
centurylinklabs/dray | An engine for managing container-based workflows | 383 |
rbarrois/django_xworkflows | A library that enables workflow management with Django models | 106 |
richfitz/remake | A package to manage complex analysis workflows by defining a series of tasks and rules for their execution | 340 |
tauffer-consulting/domino | A platform that allows users to create and monitor complex workflows using a graphical interface and reusable Python code units | 149 |
alpha-unito/streamflow | A container-native workflow management system for managing multi-container environments and hybrid workflows. | 52 |
chadian/vorfreude | A development tool for managing project workflows and releases, utilizing Svelte and Ember frameworks. | 7 |
kevin-hanselman/dud | A lightweight tool for managing and versioning large data alongside source code in data pipelines | 183 |
fieldryand/goflow | A lightweight, single-binary DAG scheduler and dashboard for orchestrating workflows with tasks and operators | 386 |
whitaker-io/machine | A library for creating data workflows that can be simple or complex, with features like recursion and memoization. | 158 |
it4innovations/hyperqueue | A tool that automates the execution of complex workflows on HPC clusters by dynamically allocating resources and load-balancing tasks. | 278 |
pytask-dev/pytask | A workflow management system that facilitates reproducible data analyses | 114 |
snowkit/flow | A JavaScript library for declaratively building and managing data flows | 63 |
kanisterio/kanister | A framework for managing data operations on Kubernetes that abstracts away tedious details and provides a set of cohesive APIs for defining data workflows. | 763 |