mara-pipelines

Pipeline framework

A lightweight ETL framework providing a simple way to define and execute data transformation pipelines using declarative Python code.

A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow

GitHub

2k stars
56 watching
103 forks
Language: Python
last commit: about 1 year ago
Linked from 3 awesome lists

datadata-integrationetlpipelinepostgresqlpython

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
m3dev/gokart A framework that solves common problems in machine learning pipeline development and provides an environment for reproducibility and team collaboration. 319
giacbrd/smartpipeline A framework for designing and executing concurrent data pipelines with a focus on simplicity and efficiency 25
paysure/orinoco A functional composable pipeline framework for Python that separates business logic from implementation. 11
databiosphere/toil A workflow management system designed to efficiently run pipelines in various environments. 901
neuraxio/neuraxle A machine learning pipeline library that enables the creation of modular and reusable data processing workflows 610
huggingface/datatrove A platform-agnostic data processing framework for large-scale text data pipelines 2,103
calebwin/pipelines A language and runtime for crafting massively parallel data pipelines 375
pwwang/pipen A Python-based workflow automation framework that enables easy creation of data processing pipelines 105
bjpop/rubra A bioinformatics pipeline system that supports running workflow stages on a distributed compute cluster. 38
thbar/kiba A Ruby-based framework for defining and running reliable, concise, and maintainable ETL jobs 1,752
sitecore/data-exchange-framework-docs A documentation project for an ETL tool used in Sitecore to exchange and process data 1
amphi-ai/amphi-etl A tool that enables data analysts to create and manage data pipelines with an intuitive interface, generating Python code for deployment anywhere. 933
renkun-ken/piper Provides functions and methods to chain operations in R, enhancing readability and maintainability of data pipelines. 169
acdemiralp/fg An abstracted rendering pipeline framework describing frames as directed acyclic graphs of render tasks and resources. 547
ypares/porcupine A tool that enables data manipulation and analysis pipelines to be flexible, reusable, and reproducible in different environments 89