HunEmPoli_corpus

Corpus

A manually annotated corpus for training and testing machine learning models of Aspect Based Sentiment Analysis (ABSA) in Hungarian language.

Corpus for Aspect Based Sentiment Analysis (ABSA) of Hungarian parliamentary texts, annotated at token level.

GitHub

0 stars
0 watching
1 forks
last commit: almost 2 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
nytud/hucola A collection of 9,076 annotated sentences in Hungarian to evaluate linguistic acceptability and grammaticality 1
poltextlab/sentiment_hun A tool for analyzing the sentiment of Hungarian news articles based on word embeddings and dictionaries. 1
vadno/korkor_pilot A large annotated corpus of Hungarian text with various linguistic annotations, split into development and test datasets for natural language processing tasks. 2
nytud/hucopa A dataset and annotation scheme for Hungarian causal reasoning tasks. 1
elte-dh/regenykorpusz A large corpus of Hungarian novels with annotated texts and metadata, developed by the Department of Digital Humanities at Eötvös Loránd University. 4
famrashel/idn-tagged-corpus A manually tagged Indonesian language corpus in tab-separated file format 88
elte-dh/drama-corpus A comprehensive annotated corpus of Hungarian drama texts, including structural annotations and grammatical features. 1
huspacy/huspacy An industrial-strength natural language processing library for Hungarian language text analysis 158
hdasprachtechnologie/opinionspam A collection of labeled text data used for training and evaluating machine learning models to detect opinion spam in text reviews. 2
lowresourcelanguages/hltdi-morphology Provides morphological analysis tools for various languages, including verb and noun generation, based on archived web pages. 5
bertez/corpora A collection of Galician language data in JSON format. 2
elte-dh/poetry-corpus A large corpus of annotated Hungarian poems in XML format, with various annotations including grammatical features and sound patterns. 7
sergioburdisso/pyss3 A Python package implementing an interpretable machine learning model for text classification with visualization tools 336
several27/fakenewscorpus A large dataset of news articles with labeled categories to train fake news recognition algorithms 385
nytud/panmorph Harmonized tagset and annotation scheme for Hungarian morphological analysers 4