pyjanitor
Data processor
A Python library providing a clean and expressive API for data cleaning by chaining multiple operations together in a logical order.
Clean APIs for data cleaning. Python implementation of R package Janitor
1k stars
18 watching
172 forks
Language: Python
last commit: about 1 month ago
Linked from 1 awesome list
cleaning-datadatadata-engineeringdataframehacktoberfestpandaspydata
Related projects:
Repository | Description | Stars |
---|---|---|
pytorch/data | Provides scalable, performant data loading solutions and utilities to be shared by PyTorch domain libraries | 1,149 |
danielstjules/pjs | A tool for filtering, mapping, and reducing data in JavaScript from the command line. | 420 |
pyscripter/pyscripter | A feature-rich Python IDE with debugging and editing tools | 1,006 |
sumana2001/pybull | A collection of Python projects and tools for beginners and enthusiasts | 31 |
david-oconnor/pyflow | A tool for streamlining Python project setup and dependency management | 1,329 |
reubano/meza | A lightweight toolkit for processing tabular data with a focus on functional programming and PyPy compatibility. | 417 |
svenkreiss/pysparkling | A lightweight Python implementation of Spark's RDD and DStream interfaces for improved performance on small datasets | 262 |
doloopwhile/pyjq | A Python binding for a JSON processor that allows transforming and filtering structured data | 195 |
ujjwalkarn/datasciencepython | A curated list of tutorials and resources for learning Python for data science, machine learning, and other related topics. | 5,301 |
jturner314/py_literal | A Rust crate for parsing and formatting Python literals. | 16 |
jason-kerney/peelandslice.java | A Java implementation of a self-contained, serverless, and zero-configuration data processing framework | 1 |
deltares/pyflwdir | A Python package for fast and efficient hydrological and topographic data processing | 78 |
sparklingpandas/sparklingpandas | Enables distributed data analysis using PySpark and Pandas APIs | 362 |
pycontribs/jira | A Python library providing easy access to the Jira REST API | 1,967 |
h2oai/datatable | A Python package for manipulating 2-dimensional tabular data structures with an emphasis on speed and big data support. | 1,821 |