pydra

Dataflow engine

A lightweight Python dataflow engine for building and executing directed acyclic graphs (DAGs) in a scalable manner.

Pydra Dataflow Engine

GitHub

120 stars
14 watching
59 forks
Language: Python
last commit: 3 days ago
Linked from 1 awesome list

brainwebdataflow-enginepython3

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
baffelli/pyperator A Python library for building asynchronous workflows with a directed acyclic graph structure 60
nipy/nipype A framework that simplifies workflow design and interaction between neuroimaging packages 750
danielgerlag/liteflow A Python library for running workflows with pluggable persistence and concurrency providers. 61
wayfair-incubator/dagger A distributed workflow engine for executing long-running business logic in a scalable and resilient way. 55
dagworks-inc/hamilton Helps define and manage data transformations with a modular, self-documenting, and portable framework for directed acyclic graphs (DAGs) of data transformations. 1,861
man-group/mdf A toolkit for expressing programs as directed acyclic graphs and wiring together computations over time-series data. 169
analysiscenter/batchflow A framework for defining and executing data processing and machine learning workflows with support for batch processing, lazy execution, and model training. 201
pydap/pydap A Python library for accessing and manipulating scientific data over the internet using the OPeNDAP protocol. 139
streamlet-dev/tributary A Python library for constructing dataflow graphs with support for reactive and lazy evaluation. 442
johnsonc/lambdo A workflow engine for unifying feature engineering and machine learning operations in data analysis pipelines 1
mehd-io/pypi-duck-flow A project to build data pipelines and visualizations for analyzing Python package download data from PyPi. 148
apache/datafusion-python A Python library that provides a data processing and querying framework using the Apache Arrow in-memory query engine. 375
d6t/d6tflow A Python library to build and manage complex data science workflows efficiently 952
deqitang/pymatflow Automates preparation and submission of DFT calculations for materials science simulations 5
elyra-ai/elyra An AI-centric extension to JupyterLab Notebooks 1,854