drake

Data workflow manager

A tool that automates the management of data workflows by organizing command execution around data and its dependencies.

Data workflow tool, like a "Make for data"

GitHub

1k stars
151 watching
110 forks
Language: Clojure
last commit: over 2 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
apache/oozie A system to define, manage, schedule, and execute complex data processing workflows across multiple systems using a declarative framework. 713
ivansharamok/dynamicworkflows A workflow management tool built on top of Sitecore, allowing users to define and manage dynamic workflows. 2
centurylinklabs/dray An engine for managing container-based workflows 383
rbarrois/django_xworkflows A library that enables workflow management with Django models 106
richfitz/remake A package to manage complex analysis workflows by defining a series of tasks and rules for their execution 340
tauffer-consulting/domino A platform that allows users to create and monitor complex workflows using a graphical interface and reusable Python code units 149
alpha-unito/streamflow A container-native workflow management system for managing multi-container environments and hybrid workflows. 52
chadian/vorfreude A development tool for managing project workflows and releases, utilizing Svelte and Ember frameworks. 7
kevin-hanselman/dud A lightweight tool for managing and versioning large data alongside source code in data pipelines 183
fieldryand/goflow A lightweight, single-binary DAG scheduler and dashboard for orchestrating workflows with tasks and operators 386
whitaker-io/machine A library for creating data workflows that can be simple or complex, with features like recursion and memoization. 158
it4innovations/hyperqueue A tool that automates the execution of complex workflows on HPC clusters by dynamically allocating resources and load-balancing tasks. 278
pytask-dev/pytask A workflow management system that facilitates reproducible data analyses 114
snowkit/flow A JavaScript library for declaratively building and managing data flows 63
kanisterio/kanister A framework for managing data operations on Kubernetes that abstracts away tedious details and provides a set of cohesive APIs for defining data workflows. 763