awesome-public-datasets

Dataset repository

A curated collection of high-quality public datasets organized by topic.

A topic-centric list of HQ open datasets.

GitHub

61k stars
2k watching
10k forks
last commit: 8 days ago
Linked from 23 awesome lists

aaron-swartzawesome-public-datasetsdatasetsopendata

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
awesomedata/apd-core Provides metadata and core functionality for a curated collection of public datasets. 358
datasciencemasters/data A curated list of accessible data sources with clear licensing terms and moderate restrictions. 506
gopherdata/resources A collection of Go-based resources and tools for data science tasks 876
aitutorials/datasets A comprehensive collection of datasets from various AI-related sources worldwide. 46
hepdata/hepdata A web application for managing and sharing high-energy physics data from experiments 41
meteoswiss/publication-opendata Provides access to standardized meteorological and climatological data from MeteoSwiss. 70
cidree/forestdata A package providing easy access to forestry and land use datasets. 13
alessandrogianfelici/danish_reviews_dataset A dataset of Danish reviews scraped from the internet to train sentiment classification models 2
kwstat/agridat A collection of agricultural datasets and analysis tools 118
osdg-ai/osdg-data A dataset of human-labeled text excerpts validated against the Sustainable Development Goals. 28
pizzadedados/datascience-pizza A resource collection and community hub for data science knowledge and learning 2,361
openarabic/ocr_gs_data A collection of double-checked gold standard data for training and testing OCR engines. 13
iqss/dataverse A platform for sharing, preserving, and preserving research data among researchers and institutions worldwide. 882
gogs/docs-api A repository documenting Gogs API v1 usage and endpoints. 136
endatabas/endb A database with full history and immutable data storage using Common Lisp. 275