flupy
Data pipeline processor
A library that provides a fluent interface for processing data pipelines in Python without holding large amounts of memory
Fluent data pipelines for python and your shell
193 stars
8 watching
15 forks
Language: Python
last commit: 4 months ago collectionsdata-pipelinefluentpython
Related projects:
Repository | Description | Stars |
---|---|---|
nazar256/parapipe | A library that provides a concurrent, non-blocking buffered pipeline for structuring and scaling applications. | 33 |
pdpipe/pdpipe | Provides a set of pre-defined data processing pipelines for pandas DataFrames. | 718 |
silascutler/malpipe | An ingestion and processing framework for malware and indicator data from various feeds. | 104 |
thephpleague/pipeline | Provides a flexible pipeline pattern implementation to compose sequential stages and process payloads in a composable manner. | 965 |
julienpalard/pipe | A Python library providing a simple and flexible way to process sequences of data using infix notation. | 1,965 |
apache/datafusion-python | A Python library that provides a data processing and querying framework using the Apache Arrow in-memory query engine. | 385 |
ypares/porcupine | A tool that enables data manipulation and analysis pipelines to be flexible, reusable, and reproducible in different environments | 89 |
databiosphere/toil | A workflow management system designed to efficiently run pipelines in various environments. | 901 |
h2oai/datatable | A Python package for manipulating 2-dimensional tabular data structures with an emphasis on speed and big data support. | 1,821 |
deltares/pyflwdir | A Python package for fast and efficient hydrological and topographic data processing | 78 |
substantic/rain | A framework for processing large-scale task-based pipelines in a distributed manner | 749 |
raine/ramda-cli | A tool for composing functions into data-processing pipelines to produce desired output. | 573 |
giacbrd/smartpipeline | A framework for designing and executing concurrent data pipelines with a focus on simplicity and efficiency | 25 |
druths/xp | A tool for creating flexible and self-documenting data science pipelines | 56 |
huggingface/datatrove | A platform-agnostic data processing framework for large-scale text data pipelines | 2,103 |