hamilton

Dataflow framework

Helps define and manage data transformations with a modular, self-documenting, and portable framework for directed acyclic graphs (DAGs) of data transformations.

Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

GitHub

2k stars
17 watching
124 forks
Language: Jupyter Notebook
last commit: 7 days ago
Linked from 4 awesome lists

dagdata-analysisdata-engineeringdata-sciencedataframeetletl-frameworketl-pipelinefeature-engineeringhacktoberfestlineagellmopsmachine-learningmlopsorchestrationpandaspythonragsoftware-engineering

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
nipype/pydra A lightweight Python dataflow engine for building and executing directed acyclic graphs (DAGs) in a scalable manner. 120
pipefunc/pipefunc Automates and simplifies the creation of function pipelines for efficient execution of scientific workflows. 215
dagworks-inc/burr A framework for building applications that make decisions using LLMs and state machines. 1,266
rhosocial/go-dag A framework for managing and executing workflows described by directed acyclic graphs. 21
chunelfeng/cgraph A C++ framework for building directed acyclic graph-based parallel computations with dependency and scheduling management 1,779
man-group/mdf A toolkit for expressing programs as directed acyclic graphs and wiring together computations over time-series data. 169
symphony09/ograph A framework for building data pipelines with concurrent execution and dependency management 32
graphprotocol/graph-client A library and toolset for building fast, performant GraphQL-based decentralized applications 174
eclipse-zenoh-flow/zenoh-flow A framework for declarative data flow programming and edge computing 91
daostack/arc A platform providing a modular, upgradeable infrastructure for decentralized autonomous organizations (DAOs) on the Ethereum blockchain. 47
yadage/adage A package to dynamically build and manage directed acyclic graphs (DAGs) of tasks that can be executed in parallel or sequentially. 55
erikbrinkman/d3-dag A library that provides a data structure and algorithms for visualizing directed acyclic graphs 1,454
dagster-io/dagster An orchestration platform for data pipelines and assets, providing a declarative programming model and integrated lineage and observability. 11,699
jexia/semaphore A tool for building and managing complex data flows across multiple protocols 94
google/digitalbuildings Provides tools and an ontology for representing and managing structured information about buildings and equipment. 371