hunaccent

Diacritic restorer

A C++ library that uses machine learning to restore diacritics in Hungarian text

Accentize Hungarian text

GitHub

15 stars
3 watching
1 forks
Language: C++
last commit: 3 months ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
aielte-research/diacritics_restoration A Python-based framework for training and evaluating diacritics restoration models using lightweight 1D convolutional neural networks 4
attilanagy234/neural-punctuator An implementation of BERT-based models for automatic punctuation restoration in English and Hungarian texts. 48
cisocrgroup/pocoto A Java-based tool for correcting errors in OCR'd historical documents 40
algolzw/daclip-uir This project controls vision-language models to restore degraded images in various environments and conditions. 662
huspacy/huspacy An industrial-strength natural language processing library for Hungarian language text analysis 155
nytud/hucola A dataset of Hungarian sentences annotated for their grammatical acceptability. 1
jecisc/chanel A tool for cleaning and improving Smalltalk code 22
kuhumcst/cstlemma A lemmatiser tool for multiple languages using affix rules and supervised learning from full-form dictionaries. 35
hamdikahloun/windows_ocr An OCR library allowing developers to embed high-quality character recognition functionality in their products. 18
alvenirai/punctfix A Python library that adds punctuation and capitalization to text without punctuation. 22
patois/xray Tool for filtering and highlighting decompiler output based on regular expressions 125
cisocrgroup/resources Resources and data for developing a language-aware OCR document error profiler and PoCoTo tools. 15
yaacov/hebocr A Hebrew character recognition library 5
hodefoting/kernagic A tool for adjusting font spacing interactively 75
poltextlab/hunempoli_corpus A manually annotated corpus for training and testing machine learning models of Aspect Based Sentiment Analysis (ABSA) in Hungarian language. 0