HuSST

Hungarian Sentiment Dataset

A dataset of annotated sentences for training and evaluating sentiment analysis models in the Hungarian language.

Hungarian version of the Stanford Sentiment Treebank

GitHub

1 stars
2 watching
0 forks
last commit: 6 months ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
nytud/huws A dataset of manually curated Hungarian sentences with ambiguous wordings that require world knowledge and reasoning for resolution. 1
nytud/hulu A collection of linguistic datasets and benchmarks for natural language understanding tasks 8
nytud/happ A dataset of Hungarian translations of human-language examples to test anaphora resolution algorithms 1
nytud/hucola A collection of 9,076 annotated sentences in Hungarian to evaluate linguistic acceptability and grammaticality 1
nytud/pws A collection of parallel corpora of Winograd schemata in multiple languages 0
nytud/panmorph Harmonized tagset and annotation scheme for Hungarian morphological analysers 4
alessandrogianfelici/danish_reviews_dataset A dataset of Danish reviews scraped from the internet to train sentiment classification models 2
novakat/nytk-nerkor-cars-ontonotespp A large annotated dataset of Hungarian text with over 30 entity types derived from various sources and formats. 1
nytud/huwnli A dataset and toolset for Hungarian anaphora resolution in natural language inference tasks 0
nytud/hucopa A dataset and annotation scheme for Hungarian causal reasoning tasks. 1
steffan267/sentiment-analysis-on-danish-social-media This project provides annotated data and guidelines for fine-grained sentiment analysis on Danish social media comments. 7
universaldependencies/ud_hungarian-szeged This repository provides a Hungarian language treebank dataset in the Universal Dependencies format. 5
pythainlp/wisesight-sentiment A large Thai social media text sentiment dataset with annotated labels 77
nytud/emtsv A text processing system designed to handle various tasks in Hungarian language processing using Python and TSV-based data exchange. 28
e9t/nsmc A dataset of Korean movie reviews with labeled sentiment annotations. 566