flupy

Data pipeline processor

A library that provides a fluent interface for processing data pipelines in Python without holding large amounts of memory

Fluent data pipelines for python and your shell

GitHub

193 stars

8 watching

15 forks

Language: Python

last commit: almost 2 years ago

collectionsdata-pipelinefluentpython

Related projects:

Repository	Description	Stars
nazar256/parapipe	A library that provides a concurrent, non-blocking buffered pipeline for structuring and scaling applications.	33
pdpipe/pdpipe	Provides a set of pre-defined data processing pipelines for pandas DataFrames.	718
silascutler/malpipe	An ingestion and processing framework for malware and indicator data from various feeds.	104
thephpleague/pipeline	Provides a flexible pipeline pattern implementation to compose sequential stages and process payloads in a composable manner.	965
julienpalard/pipe	A Python library providing a simple and flexible way to process sequences of data using infix notation.	1,965
apache/datafusion-python	A Python library that provides a data processing and querying framework using the Apache Arrow in-memory query engine.	385
ypares/porcupine	A tool that enables data manipulation and analysis pipelines to be flexible, reusable, and reproducible in different environments	89
databiosphere/toil	A workflow management system designed to efficiently run pipelines in various environments.	901
h2oai/datatable	A Python package for manipulating 2-dimensional tabular data structures with an emphasis on speed and big data support.	1,821
deltares/pyflwdir	A Python package for fast and efficient hydrological and topographic data processing	78
substantic/rain	A framework for processing large-scale task-based pipelines in a distributed manner	749
raine/ramda-cli	A tool for composing functions into data-processing pipelines to produce desired output.	573
giacbrd/smartpipeline	A framework for designing and executing concurrent data pipelines with a focus on simplicity and efficiency	25
druths/xp	A tool for creating flexible and self-documenting data science pipelines	56
huggingface/datatrove	A platform-agnostic data processing framework for large-scale text data pipelines	2,103