bonobo
Data processor
A Python framework for parallelizing data transformations and processing
Extract Transform Load for Python 3.5+
2k stars
58 watching
146 forks
Language: Python
last commit: almost 2 years ago
Linked from 1 awesome list
automationbonobodata-processingextract-transform-loadparallelizationpython3
Related projects:
Repository | Description | Stars |
---|---|---|
| Provides a unified Python interface to the GBIF API for retrieving biodiversity data and accessing various datasets and resources. | 114 |
| A Python implementation of a high-performance RPC framework with service discovery and traffic management features | 272 |
| A lightweight toolkit for processing tabular data with a focus on functional programming and PyPy compatibility. | 417 |
| Provides scalable, performant data loading solutions and utilities to be shared by PyTorch domain libraries | 1,149 |
| Enables data processing and transformation in large files using a high-level language with compile-time optimizations for efficient execution on distributed computing frameworks. | 682 |
| Provides tools to access and manipulate SDMX-compliant data in various formats | 130 |
| Provides a declarative way to handle nested data structures in Python | 1,920 |
| A Python library for functional programming that aims to simplify the experience by providing a unified API and operator overloading for common data transformations and operations. | 134 |
| A Python library providing a clean and expressive API for data cleaning by chaining multiple operations together in a logical order. | 1,371 |
| A tool to parse and retrieve biodiversity data from archived files | 45 |
| A Python binding to a C++ NLP tool for Dutch language processing tasks | 47 |
| A Python library that integrates asyncio with multiprocessing for concurrent task execution | 653 |
| A toolbox for processing earth observation data with Python. | 14 |
| A Python framework for stateful stream and event processing with built-in connectors and flexible dataflow capabilities. | 1,585 |
| A Python package for manipulating 2-dimensional tabular data structures with an emphasis on speed and big data support. | 1,821 |