poetry-corpus

Hungarian poem corpus

A large corpus of annotated Hungarian poems in XML format, with various annotations including grammatical features and sound patterns.

Corpus of Hungarian poems in TEI XML with machine annotation

GitHub

7 stars
5 watching
2 forks
last commit: about 1 month ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
elte-dh/drama-corpus A comprehensive annotated corpus of Hungarian drama texts, including structural annotations and grammatical features. 1
elte-dh/regenykorpusz A large corpus of Hungarian novels with annotated texts and metadata, developed by the Department of Digital Humanities at Eötvös Loránd University. 4
poltextlab/hunempoli_corpus A manually annotated corpus for training and testing machine learning models of Aspect Based Sentiment Analysis (ABSA) in Hungarian language. 0
pld-linux/apertium-dict-es-gl A dictionary file for machine translation between two languages using a specific rule-based machine translation system 1
famrashel/idn-tagged-corpus A manually tagged Indonesian language corpus in tab-separated file format 88
nytud/hucola A collection of 9,076 annotated sentences in Hungarian to evaluate linguistic acceptability and grammaticality 1
ukrainian-to-english-corpora/folktale_corpus A collection of Ukrainian folktales translated into English for linguistic and literary research purposes. 0
pld-linux/apertium-dict-en-gl An English-Galician language translation dictionary for the Apertium platform. 1
da-southampton/redgpt A library providing a pre-trained language model for natural language inference tasks using a transformer architecture. 61
eleutherai/polyglot Large language models designed to perform well in multiple languages and address performance issues with current multilingual models. 476
chinese-poetry/chinese-poetry A comprehensive JSON-based repository of Chinese poetry and related texts, aiming to facilitate development of applications using these ancient texts. 48,381
thomaspatzke/equel A query language for Elasticsearch that simplifies data analysis and visualization. 56
nytud/hucopa A dataset and annotation scheme for Hungarian causal reasoning tasks. 1
pld-linux/apertium-dict-pt-gl A dictionary and language pair for machine translation between Portuguese and Galician 1
kagnes/prevlex Provides a comprehensive, manually verified lexical resource for Hungarian finite verbs with possessive conjugation 0