pyjanitor

Data processor

A Python library providing a clean and expressive API for data cleaning by chaining multiple operations together in a logical order.

Clean APIs for data cleaning. Python implementation of R package Janitor

GitHub

1k stars
18 watching
170 forks
Language: Python
last commit: 6 days ago
Linked from 1 awesome list

cleaning-datadatadata-engineeringdataframehacktoberfestpandaspydata

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
pytorch/data A PyTorch project providing data loading utilities and scalable dataloading solutions 1,133
danielstjules/pjs A tool for filtering, mapping, and reducing data in JavaScript from the command line. 419
pyscripter/pyscripter A feature-rich Python IDE with debugging and editing tools 993
sumana2001/pybull A collection of Python projects and tools for beginners and enthusiasts 31
david-oconnor/pyflow A tool for streamlining Python project setup and dependency management 1,329
reubano/meza A lightweight toolkit for processing tabular data with a focus on functional programming and PyPy compatibility. 416
svenkreiss/pysparkling A lightweight Python implementation of Spark's RDD and DStream interfaces for improved performance on small datasets 262
doloopwhile/pyjq A Python binding for a JSON processor that allows transforming and filtering structured data 196
ujjwalkarn/datasciencepython A curated list of tutorials and resources for learning Python for data science, machine learning, and other related topics. 5,274
jturner314/py_literal A Rust crate for parsing and formatting Python literals. 16
jason-kerney/peelandslice.java A Java implementation of a self-contained, serverless, and zero-configuration data processing framework 1
deltares/pyflwdir A Python package for fast and efficient hydrological and topographic data processing 75
sparklingpandas/sparklingpandas Enables distributed data analysis using PySpark and Pandas APIs 361
pycontribs/jira A Python library providing easy access to the Jira REST API 1,959
h2oai/datatable A Python package for manipulating 2-dimensional tabular data structures with an emphasis on speed and big data support. 1,817