panmorph
Hungarian tagset
Harmonized tagset and annotation scheme for Hungarian morphological analysers
Tagsets and description of Hungarian morphological analysers.
4 stars
12 watching
0 forks
last commit: almost 4 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
nytud/hucola | A collection of 9,076 annotated sentences in Hungarian to evaluate linguistic acceptability and grammaticality | 1 |
nytud/emmorph | An online Hungarian humor analysis tool using morphology and finite-state grammar. | 14 |
nytud/emtsv | A text processing system designed to handle various tasks in Hungarian language processing using Python and TSV-based data exchange. | 28 |
nytud/nytk-nerkor | A Hungarian language named entity annotated corpus containing 1 million tokens with morphological annotation layers and various source files. | 15 |
recski/hunparse | An NLTK-based parser that provides morphological annotation for languages using KR-style annotations. | 4 |
nytud/hulu | A collection of linguistic datasets and benchmarks for natural language understanding tasks | 8 |
novakat/nytk-nerkor-cars-ontonotespp | A large annotated dataset of Hungarian text with over 30 entity types derived from various sources and formats. | 1 |
nytud/happ | A dataset of Hungarian translations of human-language examples to test anaphora resolution algorithms | 1 |
nytud/hadifogoly-adatbazis | An attempt to transcribe Cyrillic text into Hungarian script for a large dataset of WWII prisoner-of-war records | 23 |
nytud/husst | A dataset of annotated sentences for training and evaluating sentiment analysis models in the Hungarian language. | 1 |
nytud/huwnli | A dataset and toolset for Hungarian anaphora resolution in natural language inference tasks | 0 |
nytud/quntoken | A C++ tokenizer that tokenizes Hungarian text | 14 |
nytud/huws | A dataset of manually curated Hungarian sentences with ambiguous wordings that require world knowledge and reasoning for resolution. | 1 |
nytud/emlam | Preprocessing and modeling scripts for Hungarian language modeling using Python and TensorFlow. | 3 |
nytud/pws | A collection of parallel corpora of Winograd schemata in multiple languages | 0 |