meza

Data processor

A lightweight toolkit for processing tabular data with a focus on functional programming and PyPy compatibility.

A Python toolkit for processing tabular data

GitHub

416 stars
18 watching
32 forks
Language: Python
last commit: 4 months ago
Linked from 1 awesome list

csvdataexcelfeaturedfunctional-programminglibrarypandastabular-dataxlsxxml

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
apache/samza A distributed stream processing framework for handling high-volume data streams with fault tolerance and durability guarantees 819
dkogan/vnlog A toolkit for manipulating tabular ASCII data with normal UNIX tools. 160
dr-leo/pandasdmx Provides tools to access and manipulate SDMX-compliant data in various formats 127
kapolos/pramda A PHP implementation of functional programming concepts to simplify data processing and analysis. 245
columbia-applied-data-science/rosetta Tools and utilities for efficient data processing with a focus on text analysis. 206
alanmarazzi/panthera A Clojure-based library for working with dataframes and numerical computations using Python libraries. 189
pyjanitor-devs/pyjanitor A Python library providing a clean and expressive API for data cleaning by chaining multiple operations together in a logical order. 1,364
ileriayo/ileriayo A software project focused on developing a system for managing and processing complex data flows. 52
h2oai/datatable A Python package for manipulating 2-dimensional tabular data structures with an emphasis on speed and big data support. 1,817
quixio/quix-streams A Python framework for real-time data processing on Apache Kafka streams 1,190
nathanmarz/cascalog A library for data processing and querying on large datasets without the need for Hadoop expertise 1,376
bl3f/yato An orchestrator for DuckDB databases that automates data transformation and integration with other tools. 174
python-bonobo/bonobo A Python framework for parallelizing data transformations and processing 1,589
scicloj/tablecloth A dataset manipulation library built on top of tech.ml.dataset, providing a simplified API for data processing and analysis. 303
benmack/eo-box A toolbox for processing earth observation data with Python. 14