rubra

Pipeline manager

A bioinformatics pipeline system that supports running workflow stages on a distributed compute cluster.

Infrastructure code to support DNA pipeline

GitHub

38 stars
9 watching
18 forks
Language: Python
last commit: over 9 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
natcap/taskgraph A Python library for managing and optimizing computational workflows with parallel processing and data reuse. 21
ssadedin/bpipe A tool for running and managing bioinformatics pipelines by abstracting away low-level details and providing features such as dependency tracking, transactional management, and parallelism. 230
samapriya/planet-gee-pipeline-cli A command-line tool for automating data processing and uploads from Planet's API to Google Earth Engine. 42
galaxyproject/galaxy An integrated framework for data-intensive scientific analysis and workflow management 1,410
montilab/pipeliner A framework for defining and automating bioinformatics pipelines using Nextflow. 44
prodmodel/prodmodel A tool for managing data science pipelines by automating build, testing, and deployment processes while ensuring correctness and performance. 59
seldonio/tempo An MLOps Python library that enables data scientists to deploy and orchestrate machine learning pipelines for production-ready inference. 116
linkedin/brooklin A distributed system for streaming data between heterogeneous systems with high reliability and throughput at scale 920
alexanderrichtertd/plex A comprehensive pipeline management system for VFX, animation, and game production workflows 245
kakaobrain/torchgpipe A PyTorch-based library for efficient training of large neural networks using pipeline parallelism and automatic recomputation of gradients. 817
databiosphere/toil A workflow management system designed to efficiently run pipelines in various environments. 901
druths/xp A tool for creating flexible and self-documenting data science pipelines 56
kevin-hanselman/dud A lightweight tool for managing and versioning large data alongside source code in data pipelines 183
pubs/pubs A command line tool to organize and manage scientific papers' bibliographic data. 271
hyfather/pipeline A package implementing pipelines using goroutines to manage concurrency in Go applications. 58