poetry-corpus
Hungarian poem corpus
A large corpus of annotated Hungarian poems in XML format, with various annotations including grammatical features and sound patterns.
Corpus of Hungarian poems in TEI XML with machine annotation
7 stars
5 watching
2 forks
last commit: about 1 month ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
elte-dh/drama-corpus | A comprehensive annotated corpus of Hungarian drama texts, including structural annotations and grammatical features. | 1 |
elte-dh/regenykorpusz | A large corpus of Hungarian novels with annotated texts and metadata, developed by the Department of Digital Humanities at Eötvös Loránd University. | 4 |
poltextlab/hunempoli_corpus | A manually annotated corpus for training and testing machine learning models of Aspect Based Sentiment Analysis (ABSA) in Hungarian language. | 0 |
pld-linux/apertium-dict-es-gl | A dictionary file for machine translation between two languages using a specific rule-based machine translation system | 1 |
famrashel/idn-tagged-corpus | A manually tagged Indonesian language corpus in tab-separated file format | 88 |
nytud/hucola | A collection of 9,076 annotated sentences in Hungarian to evaluate linguistic acceptability and grammaticality | 1 |
ukrainian-to-english-corpora/folktale_corpus | A collection of Ukrainian folktales translated into English for linguistic and literary research purposes. | 0 |
pld-linux/apertium-dict-en-gl | An English-Galician language translation dictionary for the Apertium platform. | 1 |
da-southampton/redgpt | A library providing a pre-trained language model for natural language inference tasks using a transformer architecture. | 61 |
eleutherai/polyglot | Large language models designed to perform well in multiple languages and address performance issues with current multilingual models. | 476 |
chinese-poetry/chinese-poetry | A comprehensive JSON-based repository of Chinese poetry and related texts, aiming to facilitate development of applications using these ancient texts. | 48,381 |
thomaspatzke/equel | A query language for Elasticsearch that simplifies data analysis and visualization. | 56 |
nytud/hucopa | A dataset and annotation scheme for Hungarian causal reasoning tasks. | 1 |
pld-linux/apertium-dict-pt-gl | A dictionary and language pair for machine translation between Portuguese and Galician | 1 |
kagnes/prevlex | Provides a comprehensive, manually verified lexical resource for Hungarian finite verbs with possessive conjugation | 0 |