bonobo

Data processor

A Python framework for parallelizing data transformations and processing

Extract Transform Load for Python 3.5+

GitHub

2k stars
58 watching
146 forks
Language: Python
last commit: over 1 year ago
Linked from 1 awesome list

automationbonobodata-processingextract-transform-loadparallelizationpython3

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
gbif/pygbif Provides a unified Python interface to the GBIF API for retrieving biodiversity data and accessing various datasets and resources. 114
apache/dubbo-python A Python implementation of a high-performance RPC framework with service discovery and traffic management features 272
reubano/meza A lightweight toolkit for processing tabular data with a focus on functional programming and PyPy compatibility. 417
pytorch/data Provides scalable, performant data loading solutions and utilities to be shared by PyTorch domain libraries 1,149
apache/pig Enables data processing and transformation in large files using a high-level language with compile-time optimizations for efficient execution on distributed computing frameworks. 682
dr-leo/pandasdmx Provides tools to access and manipulate SDMX-compliant data in various formats 130
mahmoud/glom Provides a declarative way to handle nested data structures in Python 1,920
cgarciae/phi A Python library for functional programming that aims to simplify the experience by providing a unified API and operator overloading for common data transformations and operations. 134
pyjanitor-devs/pyjanitor A Python library providing a clean and expressive API for data cleaning by chaining multiple operations together in a logical order. 1,371
belgianbiodiversityplatform/python-dwca-reader A tool to parse and retrieve biodiversity data from archived files 45
proycon/python-frog A Python binding to a C++ NLP tool for Dutch language processing tasks 47
dano/aioprocessing A Python library that integrates asyncio with multiprocessing for concurrent task execution 653
benmack/eo-box A toolbox for processing earth observation data with Python. 14
bytewax/bytewax A Python framework for stateful stream and event processing with built-in connectors and flexible dataflow capabilities. 1,585
h2oai/datatable A Python package for manipulating 2-dimensional tabular data structures with an emphasis on speed and big data support. 1,821