 goodreads
 goodreads 
 Dataset explorer
 Provides code samples and notebooks to download, read, and analyze Goodreads datasets for research purposes.
code samples for the goodreads datasets
252 stars
 3 watching
 59 forks
 
Language: Jupyter Notebook 
last commit: over 2 years ago 
Linked from   1 awesome list  
  book-reviewscomputational-social-sciencedatasetmachine-learningnatural-language-processingrecommendation-systemrecommender-systemresearchspoilers 
 Related projects:
| Repository | Description | Stars | 
|---|---|---|
|  | A collection of data for evaluating Chinese machine reading comprehension systems | 419 | 
|  | Detects whether a given text sequence is part of the training data used to train a large language model. | 23 | 
|  | An automated tool that updates a pinned GitHub Gist with information about currently reading books and progress on Goodreads. | 26 | 
|  | A dataset of Arabic book reviews for natural language processing tasks | 44 | 
|  | A tool for exploring and visualizing large-scale multimedia data | 1,045 | 
|  | An introduction to data science using Python and Pandas with Jupyter notebooks | 851 | 
|  | A dataset of Danish reviews scraped from the internet to train sentiment classification models | 2 | 
|  | A collection of data science learning materials in the form of IPython Notebooks covering various techniques such as regression, classification, and clustering. | 2,964 | 
|  | An opinionated guide to data modeling with MongoDB. | 89 | 
|  | An introduction to data science concepts and applications in R and Python using hands-on tutorials | 597 | 
|  | A collection of Go-based resources and tools for data science tasks | 879 | 
|  | Repository providing example notebooks for Deep Learning applications with TensorFlow and Earth Engine. | 76 | 
|  | Provides tools for interpreting deep neural networks | 42 | 
|  | Provides datasets and tools for analyzing source code in various aspects such as programming languages, commits, and more. | 323 | 
|  | A comprehensive toolkit for working with atmospheric time-series datasets | 152 |