 bonobo
 bonobo 
 Data processor
 A Python framework for parallelizing data transformations and processing
Extract Transform Load for Python 3.5+
2k stars
 58 watching
 146 forks
 
Language: Python 
last commit: over 2 years ago 
Linked from   1 awesome list  
  automationbonobodata-processingextract-transform-loadparallelizationpython3 
 Related projects:
| Repository | Description | Stars | 
|---|---|---|
|  | Provides a unified Python interface to the GBIF API for retrieving biodiversity data and accessing various datasets and resources. | 114 | 
|  | A Python implementation of a high-performance RPC framework with service discovery and traffic management features | 272 | 
|  | A lightweight toolkit for processing tabular data with a focus on functional programming and PyPy compatibility. | 417 | 
|  | Provides scalable, performant data loading solutions and utilities to be shared by PyTorch domain libraries | 1,149 | 
|  | Enables data processing and transformation in large files using a high-level language with compile-time optimizations for efficient execution on distributed computing frameworks. | 682 | 
|  | Provides tools to access and manipulate SDMX-compliant data in various formats | 130 | 
|  | Provides a declarative way to handle nested data structures in Python | 1,920 | 
|  | A Python library for functional programming that aims to simplify the experience by providing a unified API and operator overloading for common data transformations and operations. | 134 | 
|  | A Python library providing a clean and expressive API for data cleaning by chaining multiple operations together in a logical order. | 1,371 | 
|  | A tool to parse and retrieve biodiversity data from archived files | 45 | 
|  | A Python binding to a C++ NLP tool for Dutch language processing tasks | 47 | 
|  | A Python library that integrates asyncio with multiprocessing for concurrent task execution | 653 | 
|  | A toolbox for processing earth observation data with Python. | 14 | 
|  | A Python framework for stateful stream and event processing with built-in connectors and flexible dataflow capabilities. | 1,585 | 
|  | A Python package for manipulating 2-dimensional tabular data structures with an emphasis on speed and big data support. | 1,821 |