SuGarLike
Language identifier
A tool that identifies languages in text by comparing them to a reference set of patterns.
Language Identification for Low Resource Languages (by Susanne, Guy and Liling)
1 stars
5 watching
0 forks
Language: Python
last commit: over 10 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
alvations/sugali | A system designed to identify the language of an arbitrary text string using machine learning and multiple data sources. | 2 |
alvations/seedling | A corpus and API for human language data | 11 |
twerkmeister/ilid | A deep learning-based system for identifying spoken language in audio files. | 90 |
cisnlp/glotlid | A language identification model that supports over 2000 languages and can be used for various NLP tasks. | 90 |
ajaech/twitter_langid | An open-source software framework for training and evaluating hierarchical neural networks for language identification tasks. | 15 |
minibikini/paasaa | Tools for detecting the language of unstructured text in Elixir applications | 115 |
vseloved/wiki-lang-detect | Uses Wikipedia data to identify the language of unstructured text | 31 |
sergioburdisso/pyss3 | A Python package implementing an interpretable machine learning model for text classification with visualization tools | 336 |
liby/recent-languages-box | A tool that analyzes recent GitHub commits and displays the languages used | 0 |
leosmigel/analyzingalpha | Analyzes and processes Alpha data to extract insights | 477 |
alvations/dltk | A collection of tools and utilities for the German language | 12 |
odddollar/leafscript | A lightweight programming language designed to be simple and efficient. | 28 |
get-woke/woke | Detects and suggests replacements for non-inclusive language in source code. | 457 |
hit-scir/elmoformanylangs | Provides pre-trained ELMo representations for multiple languages to improve NLP tasks. | 1,463 |
gururise/alpacadatacleaned | A cleaned and curated version of an Alpaca dataset used to train a large language model | 1,516 |