lambdo

Data pipeline engine

A workflow engine for unifying feature engineering and machine learning operations in data analysis pipelines

A column-oriented approach to feature engineering. Feature engineering and machine learning: together at last!

GitHub

1 stars
2 watching
18 forks
Language: Python
last commit: about 6 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
asavinov/lambdo A workflow engine that unifies feature engineering and machine learning operations for data analysis. 23
kevin-hanselman/dud A lightweight tool for managing and versioning large data alongside source code in data pipelines 183
ypares/porcupine A tool that enables data manipulation and analysis pipelines to be flexible, reusable, and reproducible in different environments 89
analysiscenter/batchflow A framework for defining and executing data processing and machine learning workflows with support for batch processing, lazy execution, and model training. 201
mehd-io/pypi-duck-flow A project to build data pipelines and visualizations for analyzing Python package download data from PyPi. 148
aronchick/mlops-pipeline Automates the end-to-end machine learning workflow from code commit to model deployment 18
databiosphere/toil A workflow management system designed to efficiently run pipelines in various environments. 901
giacbrd/smartpipeline A framework for designing and executing concurrent data pipelines with a focus on simplicity and efficiency 23
pipefunc/pipefunc Automates and simplifies the creation of function pipelines for efficient execution of scientific workflows. 215
loglabs/mltrace A tool to help analyze and debug machine learning pipelines by tracking the flow of data and components through the pipeline. 468
hhio618/golem-ci A decentralized task pipeline on Golem.network using Python. 5
kubeflow-kale/kale Simplifies the deployment of Kubeflow Pipelines workflows by providing a graphical interface for Data Scientists to define and deploy pipelines directly from JupyterLab. 632
pwwang/pipen A Python-based workflow automation framework that enables easy creation of data processing pipelines 103
danielgerlag/liteflow A Python library for running workflows with pluggable persistence and concurrency providers. 61
scipipe/scipipe A flexible and efficient way to write and run complex workflows using Go programming language 1,075