panmorph

Hungarian tagset

Harmonized tagset and annotation scheme for Hungarian morphological analysers

Tagsets and description of Hungarian morphological analysers.

GitHub

4 stars
12 watching
0 forks
last commit: over 3 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
nytud/hucola A dataset of Hungarian sentences annotated for their grammatical acceptability. 1
nytud/emmorph An online Hungarian humor analysis tool using morphology and finite-state grammar. 14
nytud/emtsv A text processing system designed to handle various tasks in Hungarian language processing using Python and TSV-based data exchange. 27
nytud/nytk-nerkor A Hungarian language named entity annotated corpus containing 1 million tokens with morphological annotation layers and various source files. 14
recski/hunparse An NLTK-based parser that provides morphological annotation for languages using KR-style annotations. 4
nytud/hulu A collection of linguistic datasets and benchmarks for natural language understanding tasks 9
novakat/nytk-nerkor-cars-ontonotespp A large annotated dataset of Hungarian text with over 30 entity types derived from various sources and formats. 1
nytud/happ A dataset of Hungarian translations of human-language examples to test anaphora resolution algorithms 1
nytud/hadifogoly-adatbazis An attempt to transcribe Cyrillic text into Hungarian script for a large dataset of WWII prisoner-of-war records 23
nytud/husst A dataset and benchmarking kit for evaluating language understanding in Hungarian 1
nytud/huwnli A dataset and toolset for Hungarian anaphora resolution in natural language inference tasks 0
nytud/quntoken A C++ tokenizer that tokenizes Hungarian text 14
nytud/huws A dataset of manually curated Hungarian sentences with ambiguous wordings that require world knowledge and reasoning for resolution. 1
nytud/emlam Preprocessing and modeling scripts for Hungarian language modeling using Python and TensorFlow. 3
nytud/pws A collection of parallel corpora of Winograd schemata in multiple languages 0