NYTK-NerKor
Hungarian NE corpus
A Hungarian language named entity annotated corpus containing 1 million tokens with morphological annotation layers and various source files.
The home repository of the NerKor corpus, a Hungarian gold standard named entity annotated corpus containing 1 million tokens.
15 stars
7 watching
6 forks
Language: Shell
last commit: over 1 year ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| A large annotated dataset of Hungarian text with over 30 entity types derived from various sources and formats. | 1 |
| A collection of 9,076 annotated sentences in Hungarian to evaluate linguistic acceptability and grammaticality | 1 |
| A large annotated corpus of Hungarian text with various linguistic annotations, split into development and test datasets for natural language processing tasks. | 2 |
| A text processing system designed to handle various tasks in Hungarian language processing using Python and TSV-based data exchange. | 28 |
| Harmonized tagset and annotation scheme for Hungarian morphological analysers | 4 |
| A collection of Hungarian NLP tools integrated as GATE processing resources | 8 |
| A Ukrainian NER corpus and annotation dataset for training and evaluating named entity recognition models. | 90 |
| A C++ tokenizer that tokenizes Hungarian text | 14 |
| Preprocessing and modeling scripts for Hungarian language modeling using Python and TensorFlow. | 3 |
| A dataset and annotation scheme for Hungarian causal reasoning tasks. | 1 |
| A dataset and toolset for Hungarian anaphora resolution in natural language inference tasks | 0 |
| A collection of linguistic datasets and benchmarks for natural language understanding tasks | 8 |
| An attempt to transcribe Cyrillic text into Hungarian script for a large dataset of WWII prisoner-of-war records | 23 |
| A large corpus of Hungarian novels with annotated texts and metadata, developed by the Department of Digital Humanities at Eötvös Loránd University. | 4 |
| Provides machine translation models and a demo site for Hungarian language translations | 5 |