HuSST
Hungarian LUL
A dataset and benchmarking kit for evaluating language understanding in Hungarian
Hungarian version of the Stanford Sentiment Treebank
1 stars
2 watching
0 forks
last commit: 5 months ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
nytud/huws | A dataset of manually curated Hungarian sentences with ambiguous wordings that require world knowledge and reasoning for resolution. | 1 |
nytud/hulu | A collection of linguistic datasets and benchmarks for natural language understanding tasks | 9 |
nytud/happ | A dataset of Hungarian translations of human-language examples to test anaphora resolution algorithms | 1 |
nytud/hucola | A dataset of Hungarian sentences annotated for their grammatical acceptability. | 1 |
nytud/pws | A collection of parallel corpora of Winograd schemata in multiple languages | 0 |
nytud/panmorph | Harmonized tagset and annotation scheme for Hungarian morphological analysers | 4 |
alessandrogianfelici/danish_reviews_dataset | A dataset of Danish reviews scraped from the internet to train sentiment classification models | 2 |
novakat/nytk-nerkor-cars-ontonotespp | A large annotated dataset of Hungarian text with over 30 entity types derived from various sources and formats. | 1 |
nytud/huwnli | A dataset and toolset for Hungarian anaphora resolution in natural language inference tasks | 0 |
nytud/hucopa | A dataset of Hungarian translations of English 'cause-and-effect' questions with plausible alternative answers | 1 |
steffan267/sentiment-analysis-on-danish-social-media | This project provides annotated data and guidelines for fine-grained sentiment analysis on Danish social media comments. | 7 |
universaldependencies/ud_hungarian-szeged | A corpus of annotated Hungarian text data for machine learning and natural language processing tasks | 5 |
pythainlp/wisesight-sentiment | A large Thai social media text sentiment dataset with annotated labels | 77 |
nytud/emtsv | A text processing system designed to handle various tasks in Hungarian language processing using Python and TSV-based data exchange. | 27 |
e9t/nsmc | A dataset of Korean movie reviews with labeled sentiment annotations. | 566 |