nsmc
Movie review dataset
A dataset of Korean movie reviews with labeled sentiment annotations.
Naver sentiment movie corpus
566 stars
23 watching
212 forks
Language: Python
last commit: over 7 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
alessandrogianfelici/danish_reviews_dataset | A dataset of Danish reviews scraped from the internet to train sentiment classification models | 2 |
nytud/husst | A dataset and benchmarking kit for evaluating language understanding in Hungarian | 1 |
ichait/moviemon | A command-line tool to display information about movies in a directory. | 208 |
haoopeng/cnn-yelp-challenge-2016-sentiment-classification | This repository trains a Convolutional Neural Network to classify customer reviews based on their sentiment. | 109 |
ymcui/cmrc2018 | A collection of data for evaluating Chinese machine reading comprehension systems | 415 |
helsinki-nlp/xed | A multilingual dataset for sentiment analysis and emotion detection from movie subtitles. | 56 |
shahroudy/nturgb-d | A large-scale dataset for human action recognition | 758 |
neuro-inc/ml-recipe-hier-attention | An implementation of a neural network architecture for sentiment classification using hierarchical attention mechanisms. | 2 |
mirfan899/urdu | A collection of Urdu language datasets for various NLP tasks and applications | 71 |
nosmokingbandit/watcher3 | Automated movie NZB & Torrent searcher and snatcher with post-processing capabilities | 279 |
jagerv3/sentiment_analysis_thai | Analyzes sentiment in Thai text using machine learning algorithms and natural language processing techniques. | 12 |
fwang91/imdb-face | A large-scale noise-controlled face recognition dataset designed to study the impact of data noise on recognition accuracy. | 431 |
kevincobain2000/sentiment_classifier | A Python library for sentiment analysis using word sense disambiguation and machine learning algorithms. | 172 |
hadley/data-movies | Extracts relevant data from IMDB movie files and stores it in a usable format | 203 |
mohamedadaly/labr | A dataset of Arabic book reviews for natural language processing tasks | 44 |