rosetta

Data processor

Tools and utilities for efficient data processing with a focus on text analysis.

Tools, wrappers, etc... for data science with a concentration on text processing

GitHub

206 stars
22 watching
47 forks
Language: Jupyter Notebook
last commit: about 2 years ago
Linked from 2 awesome lists


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
ceos-seo/data_cube_notebooks A collection of Jupyter Notebooks for analyzing satellite data using the Open Data Cube algorithm and functions. 55
data-8/datascience An introductory data science library for Python. 626
reubano/meza A lightweight toolkit for processing tabular data with a focus on functional programming and PyPy compatibility. 416
scicloj/tablecloth A dataset manipulation library built on top of tech.ml.dataset, providing a simplified API for data processing and analysis. 303
supercowpowers/data_hacking A repository of interactive exercises and projects demonstrating the application of data analysis and machine learning techniques to security-related data sets. 775
spreads/spreads A high-performance library for real-time data processing and time series manipulation 427
romainchor/datascience A collection of projects and notebooks focused on data science, machine learning, and research, showcasing various techniques and tools. 0
comet-ml/kangas A tool for exploring and visualizing large-scale multimedia data 1,041
shanky-21/data_visualization Provides a platform for data visualization using Jupyter Notebook 41
techascent/tech.ml.dataset A Clojure library for efficient tabular data processing and analysis 681
stamusnetworks/suricata-analytics Provides resources and tools for analyzing Suricata data 27
bluenote10/nimdata A data manipulation and analysis library built on top of the Nim programming language. 341
linealabs/lineapy Automates cleaning and analysis of messy data science notebooks to improve productivity and reproducibility 663
juliasilge/tidytext Provides tools and data to convert text into tidy data formats for natural language processing tasks 1,180
geoscienceaustralia/dea-notebooks Provides tools and workflows for analyzing geospatial data from Australian satellite imagery 448