emtsv

Hungarian NLP system

A text processing system designed to handle various tasks in Hungarian language processing using Python and TSV-based data exchange.

e-magyar text processing system -- inter-module communication via tsv + REST API

GitHub

28 stars
9 watching
11 forks
Language: Python
last commit: 12 months ago
Linked from 1 awesome list

hungariannlppythontsv

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
nytud/emlam Preprocessing and modeling scripts for Hungarian language modeling using Python and TensorFlow. 3
nytud/machine-translation Provides machine translation models and a demo site for Hungarian language translations 5
nytud/hunlp-gate A collection of Hungarian NLP tools integrated as GATE processing resources 8
nytud/panmorph Harmonized tagset and annotation scheme for Hungarian morphological analysers 4
nytud/hadifogoly-adatbazis An attempt to transcribe Cyrillic text into Hungarian script for a large dataset of WWII prisoner-of-war records 23
nytud/nytk-nerkor A Hungarian language named entity annotated corpus containing 1 million tokens with morphological annotation layers and various source files. 15
nytud/hucola A collection of 9,076 annotated sentences in Hungarian to evaluate linguistic acceptability and grammaticality 1
tyson925/magyarlanc_spark A Spark-based tool for processing Hungarian text data with Magyarlanc language processing features and optional integration with ElasticSearch. 4
nytud/emmorph An online Hungarian humor analysis tool using morphology and finite-state grammar. 14
sedthh/lara-hungarian-nlp A lightweight Python library for natural language processing in Hungarian 29
nytud/huws A dataset of manually curated Hungarian sentences with ambiguous wordings that require world knowledge and reasoning for resolution. 1
nytud/quntoken A C++ tokenizer that tokenizes Hungarian text 14
huspacy/huspacy An industrial-strength natural language processing library for Hungarian language text analysis 158
davidnemeskey/embert Provides pre-trained transformer-based models and tools for natural language processing tasks 2
nytud/hulu A collection of linguistic datasets and benchmarks for natural language understanding tasks 8