emtsv
Hungarian NLP system
A text processing system designed to handle various tasks in Hungarian language processing using Python and TSV-based data exchange.
e-magyar text processing system -- inter-module communication via tsv + REST API
28 stars
9 watching
11 forks
Language: Python
last commit: 12 months ago
Linked from 1 awesome list
hungariannlppythontsv
Related projects:
Repository | Description | Stars |
---|---|---|
nytud/emlam | Preprocessing and modeling scripts for Hungarian language modeling using Python and TensorFlow. | 3 |
nytud/machine-translation | Provides machine translation models and a demo site for Hungarian language translations | 5 |
nytud/hunlp-gate | A collection of Hungarian NLP tools integrated as GATE processing resources | 8 |
nytud/panmorph | Harmonized tagset and annotation scheme for Hungarian morphological analysers | 4 |
nytud/hadifogoly-adatbazis | An attempt to transcribe Cyrillic text into Hungarian script for a large dataset of WWII prisoner-of-war records | 23 |
nytud/nytk-nerkor | A Hungarian language named entity annotated corpus containing 1 million tokens with morphological annotation layers and various source files. | 15 |
nytud/hucola | A collection of 9,076 annotated sentences in Hungarian to evaluate linguistic acceptability and grammaticality | 1 |
tyson925/magyarlanc_spark | A Spark-based tool for processing Hungarian text data with Magyarlanc language processing features and optional integration with ElasticSearch. | 4 |
nytud/emmorph | An online Hungarian humor analysis tool using morphology and finite-state grammar. | 14 |
sedthh/lara-hungarian-nlp | A lightweight Python library for natural language processing in Hungarian | 29 |
nytud/huws | A dataset of manually curated Hungarian sentences with ambiguous wordings that require world knowledge and reasoning for resolution. | 1 |
nytud/quntoken | A C++ tokenizer that tokenizes Hungarian text | 14 |
huspacy/huspacy | An industrial-strength natural language processing library for Hungarian language text analysis | 158 |
davidnemeskey/embert | Provides pre-trained transformer-based models and tools for natural language processing tasks | 2 |
nytud/hulu | A collection of linguistic datasets and benchmarks for natural language understanding tasks | 8 |