flupy

Data pipeline processor

A library that provides a fluent interface for processing data pipelines in Python without holding large amounts of memory

Fluent data pipelines for python and your shell

GitHub

193 stars
8 watching
15 forks
Language: Python
last commit: 4 months ago
collectionsdata-pipelinefluentpython

Related projects:

Repository Description Stars
nazar256/parapipe A library that provides a concurrent, non-blocking buffered pipeline for structuring and scaling applications. 33
pdpipe/pdpipe Provides a set of pre-defined data processing pipelines for pandas DataFrames. 718
silascutler/malpipe An ingestion and processing framework for malware and indicator data from various feeds. 104
thephpleague/pipeline Provides a flexible pipeline pattern implementation to compose sequential stages and process payloads in a composable manner. 965
julienpalard/pipe A Python library providing a simple and flexible way to process sequences of data using infix notation. 1,965
apache/datafusion-python A Python library that provides a data processing and querying framework using the Apache Arrow in-memory query engine. 385
ypares/porcupine A tool that enables data manipulation and analysis pipelines to be flexible, reusable, and reproducible in different environments 89
databiosphere/toil A workflow management system designed to efficiently run pipelines in various environments. 901
h2oai/datatable A Python package for manipulating 2-dimensional tabular data structures with an emphasis on speed and big data support. 1,821
deltares/pyflwdir A Python package for fast and efficient hydrological and topographic data processing 78
substantic/rain A framework for processing large-scale task-based pipelines in a distributed manner 749
raine/ramda-cli A tool for composing functions into data-processing pipelines to produce desired output. 573
giacbrd/smartpipeline A framework for designing and executing concurrent data pipelines with a focus on simplicity and efficiency 25
druths/xp A tool for creating flexible and self-documenting data science pipelines 56
huggingface/datatrove A platform-agnostic data processing framework for large-scale text data pipelines 2,103