SuGarLike

Language identifier

A tool that identifies languages in text by comparing them to a reference set of patterns.

Language Identification for Low Resource Languages (by Susanne, Guy and Liling)

GitHub

1 stars
5 watching
0 forks
Language: Python
last commit: over 10 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
alvations/sugali A system designed to identify the language of an arbitrary text string using machine learning and multiple data sources. 2
alvations/seedling A corpus and API for human language data 11
twerkmeister/ilid A deep learning-based system for identifying spoken language in audio files. 90
cisnlp/glotlid A language identification model that supports over 2000 languages and can be used for various NLP tasks. 90
ajaech/twitter_langid An open-source software framework for training and evaluating hierarchical neural networks for language identification tasks. 15
minibikini/paasaa Tools for detecting the language of unstructured text in Elixir applications 115
vseloved/wiki-lang-detect Uses Wikipedia data to identify the language of unstructured text 31
sergioburdisso/pyss3 A Python package implementing an interpretable machine learning model for text classification with visualization tools 336
liby/recent-languages-box A tool that analyzes recent GitHub commits and displays the languages used 0
leosmigel/analyzingalpha Analyzes and processes Alpha data to extract insights 477
alvations/dltk A collection of tools and utilities for the German language 12
odddollar/leafscript A lightweight programming language designed to be simple and efficient. 28
get-woke/woke Detects and suggests replacements for non-inclusive language in source code. 457
hit-scir/elmoformanylangs Provides pre-trained ELMo representations for multiple languages to improve NLP tasks. 1,463
gururise/alpacadatacleaned A cleaned and curated version of an Alpaca dataset used to train a large language model 1,516