gaelg

Gaelic NLP toolkit

Provides NLP resources and tools for the Manx Gaelic language to support machine translation and natural language processing tasks.

NLP resources for Manx Gaelic, mainly in support of the gv2ga MT engine

GitHub

3 stars
2 watching
1 forks
Language: Perl
last commit: 3 months ago
Linked from 1 awesome list

corpusdictionarygaelgmanxnlpuniversal-dependencies

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
kscanne/gaidhlig NLP resources for Scottish Gaelic language support 3
colinbatchelor/gdbank Tools and resources for natural language processing of Scottish Gaelic. 4
kscanne/caighdean A tool for normalizing and translating Irish-Gaelic text into standardized forms of the language. 18
pld-linux/aspell-gv A Manx Gaelic dictionary integrated into the aspell spell-checking software 1
kscanne/gaelspell A spellchecking tool for the Irish language, providing integration with popular UNIX-based spelling packages. 17
wojtekdz/gd-fcfg A grammar representation of Scottish Gaelic language using context-free feature-based approach 3
kscanne/hunspell-gd Provides data and tools for building Scottish Gaelic spell checkers 10
kscanne/chichewa A collection of NLP resources for a Bantu language, including a basic lexicon and script for morphological generation. 9
kscanne/orthotree A tool for generating large phylogenetic language trees based on orthographic distance 10
kscanne/gramadoir A grammar checking engine tailored for minority languages, specifically Irish. 13
kscanne/tesseract-gle-uncial Provides training data and scripts to enhance OCR accuracy for Irish Gaelic fonts 3
martavillegas/eurowordnetlemon Generates lexicons from multilingual wordnet repository 1
kscanne/aimsigh A revived version of an Irish search engine from the defunct aimsigh.com, preserving its archive and functionality. 1
universaldependencies/ud_galician-ctg This is a collection of annotated text data for the Galician language. 1
lex4all/lex4all Software tool to generate pronunciation lexicons for low-resource languages using speech recognition and machine learning algorithms. 21