dataprep

Data prep tool

A Python library for rapidly collecting, cleaning, and visualizing data with minimal code

Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.

GitHub

2k stars
27 watching
206 forks
Language: Python
last commit: 5 months ago
Linked from 2 awesome lists

apisapiwrappercleaningconnectordata-explorationdata-sciencedatacleaningdataconnectordataprepdatapreparationedaexploratory-data-analysiswebconnector

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
hi-primus/optimus A Python library that provides a simple API for data preparation and analysis on various big-data engines 1,481
ibm/data-prep-kit A toolkit for streamlining data preparation for developers building large language model applications 290
capitalone/dataprofiler A Python library to analyze and profile datasets, detecting sensitive data and generating reports. 1,434
data-8/datascience An introductory data science library for Python. 626
iceye-ltd/icecube A Python library designed to organize SAR images and annotations for supervised machine learning applications. 82
pablofrommars/fsharp-notebook An interactive data science tool for F# that provides visualization and export capabilities. 2
pyjanitor-devs/pyjanitor A Python library providing a clean and expressive API for data cleaning by chaining multiple operations together in a logical order. 1,364
cuttlefishh/python-for-data-analysis An introduction to data science using Python and Pandas with Jupyter notebooks 847
idea-fasoc/datasheet-scrubber Automates extraction of key circuit information from PDF datasheets/documents to build a database of commercial off-the-shelf IP. 51
mapbox/postgis-vt-util A set of PostgreSQL functions to assist with vector tile creation and data preparation 273
dgilland/fnc A Python library providing functional programming utilities and tools for working with generators and data structures. 256
capitalone/datacompy A tool for comparing and analyzing data in various formats, such as Pandas DataFrames and Spark DataFrames. 485
dataoneorg/d1_python A collection of Python libraries and tools for interacting with DataONE repositories 17
opendatacube/datacube-core A Python-based platform for integrated gridded data analysis from decades of Earth observation satellite data 514
olirice/flupy A library that provides a fluent interface for processing data pipelines in Python without holding large amounts of memory 193