HunEmPoli_corpus

Corpus

A manually annotated corpus for training and testing machine learning models of Aspect Based Sentiment Analysis (ABSA) in Hungarian language.

Corpus for Aspect Based Sentiment Analysis (ABSA) of Hungarian parliamentary texts, annotated at token level.

GitHub

0 stars
0 watching
1 forks
last commit: almost 2 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
nytud/hucola A dataset of Hungarian sentences annotated for their grammatical acceptability. 1
poltextlab/sentiment_hun A tool for analyzing the sentiment of Hungarian news articles based on word embeddings and dictionaries. 1
vadno/korkor_pilot A large annotated corpus of Hungarian text with various linguistic annotations, split into development and test datasets for natural language processing tasks. 2
nytud/hucopa A dataset of Hungarian translations of English 'cause-and-effect' questions with plausible alternative answers 1
elte-dh/regenykorpusz A large corpus of Hungarian novels with annotated texts and metadata, developed by the Department of Digital Humanities at Eötvös Loránd University. 4
famrashel/idn-tagged-corpus A manually tagged Indonesian language corpus in tab-separated file format 88
elte-dh/drama-corpus A comprehensive annotated corpus of Hungarian drama texts, including structural annotations and grammatical features. 1
huspacy/huspacy An industrial-strength natural language processing library for Hungarian language text analysis 155
hdasprachtechnologie/opinionspam A collection of labeled text data used for training and evaluating machine learning models to detect opinion spam in text reviews. 2
lowresourcelanguages/hltdi-morphology Provides morphological analysis tools for various languages, including verb and noun generation, based on archived web pages. 5
bertez/corpora A collection of Galician language data in JSON format. 2
elte-dh/poetry-corpus A comprehensive poetry corpus with annotated text data in TEI XML format 7
sergioburdisso/pyss3 A Python package implementing an interpretable machine learning model for text classification with visualization tools 336
several27/fakenewscorpus A large dataset of news articles with labeled categories to train fake news recognition algorithms 387
nytud/panmorph Harmonized tagset and annotation scheme for Hungarian morphological analysers 4