caighdean
Text normalizer
A tool for normalizing and translating Irish-Gaelic text into standardized forms of the language.
Inneall aistriúcháin atá taobh thiar de Chaighdeánaitheoir na Gaeilge, agus aistritheoirí Gàidhlig/Gaelg→Gaeilge
18 stars
7 watching
4 forks
Language: Perl
last commit: 2 months ago
Linked from 1 awesome list
gaeilgegaelggaelg-gaeilgegaelicgaidhligirishtext-normalizationtranslation
Related projects:
Repository | Description | Stars |
---|---|---|
kscanne/gaidhlig | NLP resources for Scottish Gaelic language support | 3 |
kscanne/gaelg | Provides NLP resources and tools for the Manx Gaelic language to support machine translation and natural language processing tasks. | 3 |
kscanne/gramadoir | A grammar checking engine tailored for minority languages, specifically Irish. | 13 |
kscanne/gaelspell | A spellchecking tool for the Irish language, providing integration with popular UNIX-based spelling packages. | 17 |
kscanne/tesseract-gle-uncial | Provides training data and scripts to enhance OCR accuracy for Irish Gaelic fonts | 3 |
kscanne/orthotree | A tool for generating large phylogenetic language trees based on orthographic distance | 10 |
kscanne/hunspell-gd | Provides data and tools for building Scottish Gaelic spell checkers | 10 |
syntax-tree/nlcst-normalize | A utility to normalize words for comparison by standardizing punctuation and case. | 7 |
comphist/norma | A tool for normalizing spelling in non-standard language data by combining multiple techniques with training data and a target dictionary. | 20 |
dbuenzli/uunf | A library for normalizing Unicode text to a standardized format | 21 |
avito-tech/normalize | A library providing tools to normalize and compare fuzzy text inputs for better matching and association. | 46 |
jonschlinkert/normalize-pkg | Tools to normalize package.json data for better compatibility and readability | 18 |
johnalbin/normalize-scss | A collection of HTML element and attribute rulesets to normalize styles across all browsers. | 1,434 |
wojtekdz/gd-fcfg | A grammar representation of Scottish Gaelic language using context-free feature-based approach | 3 |
dimuska139/go-email-normalizer | A library for providing a standard form of email addresses | 64 |