pyjanitor

Data processor

A Python library providing a clean and expressive API for data cleaning by chaining multiple operations together in a logical order.

Clean APIs for data cleaning. Python implementation of R package Janitor

GitHub

1k stars
18 watching
172 forks
Language: Python
last commit: about 1 month ago
Linked from 1 awesome list

cleaning-datadatadata-engineeringdataframehacktoberfestpandaspydata

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
pytorch/data Provides scalable, performant data loading solutions and utilities to be shared by PyTorch domain libraries 1,149
danielstjules/pjs A tool for filtering, mapping, and reducing data in JavaScript from the command line. 420
pyscripter/pyscripter A feature-rich Python IDE with debugging and editing tools 1,006
sumana2001/pybull A collection of Python projects and tools for beginners and enthusiasts 31
david-oconnor/pyflow A tool for streamlining Python project setup and dependency management 1,329
reubano/meza A lightweight toolkit for processing tabular data with a focus on functional programming and PyPy compatibility. 417
svenkreiss/pysparkling A lightweight Python implementation of Spark's RDD and DStream interfaces for improved performance on small datasets 262
doloopwhile/pyjq A Python binding for a JSON processor that allows transforming and filtering structured data 195
ujjwalkarn/datasciencepython A curated list of tutorials and resources for learning Python for data science, machine learning, and other related topics. 5,301
jturner314/py_literal A Rust crate for parsing and formatting Python literals. 16
jason-kerney/peelandslice.java A Java implementation of a self-contained, serverless, and zero-configuration data processing framework 1
deltares/pyflwdir A Python package for fast and efficient hydrological and topographic data processing 78
sparklingpandas/sparklingpandas Enables distributed data analysis using PySpark and Pandas APIs 362
pycontribs/jira A Python library providing easy access to the Jira REST API 1,967
h2oai/datatable A Python package for manipulating 2-dimensional tabular data structures with an emphasis on speed and big data support. 1,821