goodreads

Dataset explorer

Provides code samples and notebooks to download, read, and analyze Goodreads datasets for research purposes.

code samples for the goodreads datasets

GitHub

251 stars
3 watching
59 forks
Language: Jupyter Notebook
last commit: over 1 year ago
Linked from 1 awesome list

book-reviewscomputational-social-sciencedatasetmachine-learningnatural-language-processingrecommendation-systemrecommender-systemresearchspoilers

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
ymcui/cmrc2018 A collection of data for evaluating Chinese machine reading comprehension systems 415
pratyushmaini/llm_dataset_inference Detects whether a given text sequence is part of the training data used to train a large language model. 23
mdluo/goodreads-box An automated tool that updates a pinned GitHub Gist with information about currently reading books and progress on Goodreads. 26
mohamedadaly/labr A dataset of Arabic book reviews for natural language processing tasks 44
comet-ml/kangas A tool for exploring and visualizing large-scale multimedia data 1,041
cuttlefishh/python-for-data-analysis An introduction to data science using Python and Pandas with Jupyter notebooks 847
alessandrogianfelici/danish_reviews_dataset A dataset of Danish reviews scraped from the internet to train sentiment classification models 2
nborwankar/learndatascience A collection of data science learning materials in the form of IPython Notebooks covering various techniques such as regression, classification, and clustering. 2,958
jsoendermann/mongostyleguide An opinionated guide to data modeling with MongoDB. 89
jadianes/data-science-your-way An introduction to data science concepts and applications in R and Python using hands-on tutorials 594
gopherdata/resources A collection of Go-based resources and tools for data science tasks 876
gee-community/ee-tensorflow-notebooks Repository providing example notebooks for Deep Learning applications with TensorFlow and Earth Engine. 75
datamllab/xdeep Provides tools for interpreting deep neural networks 42
src-d/datasets Provides datasets and tools for analyzing source code in various aspects such as programming languages, commits, and more. 323
arm-doe/act A comprehensive toolkit for working with atmospheric time-series datasets. 146