rubra
Pipeline manager
A bioinformatics pipeline system that supports running workflow stages on a distributed compute cluster.
Infrastructure code to support DNA pipeline
38 stars
9 watching
18 forks
Language: Python
last commit: over 9 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
natcap/taskgraph | A Python library for managing and optimizing computational workflows with parallel processing and data reuse. | 21 |
ssadedin/bpipe | A tool for running and managing bioinformatics pipelines by abstracting away low-level details and providing features such as dependency tracking, transactional management, and parallelism. | 230 |
samapriya/planet-gee-pipeline-cli | A command-line tool for automating data processing and uploads from Planet's API to Google Earth Engine. | 42 |
galaxyproject/galaxy | An integrated framework for data-intensive scientific analysis and workflow management | 1,410 |
montilab/pipeliner | A framework for defining and automating bioinformatics pipelines using Nextflow. | 44 |
prodmodel/prodmodel | A tool for managing data science pipelines by automating build, testing, and deployment processes while ensuring correctness and performance. | 59 |
seldonio/tempo | An MLOps Python library that enables data scientists to deploy and orchestrate machine learning pipelines for production-ready inference. | 116 |
linkedin/brooklin | A distributed system for streaming data between heterogeneous systems with high reliability and throughput at scale | 920 |
alexanderrichtertd/plex | A comprehensive pipeline management system for VFX, animation, and game production workflows | 245 |
kakaobrain/torchgpipe | A PyTorch-based library for efficient training of large neural networks using pipeline parallelism and automatic recomputation of gradients. | 817 |
databiosphere/toil | A workflow management system designed to efficiently run pipelines in various environments. | 901 |
druths/xp | A tool for creating flexible and self-documenting data science pipelines | 56 |
kevin-hanselman/dud | A lightweight tool for managing and versioning large data alongside source code in data pipelines | 183 |
pubs/pubs | A command line tool to organize and manage scientific papers' bibliographic data. | 271 |
hyfather/pipeline | A package implementing pipelines using goroutines to manage concurrency in Go applications. | 58 |