lingua
Language detector
An accurate language detection library for Java and the JVM suitable for both short and long text inputs.
The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
707 stars
11 watching
64 forks
Language: Kotlin
last commit: 6 days ago
Linked from 1 awesome list
android-libraryjava-librarykotlin-librarylanguage-classificationlanguage-detectionlanguage-identificationlanguage-processinglanguage-recognitionnatural-languagenatural-language-processingnlpnlp-librarynlp-machine-learning
Related projects:
Repository | Description | Stars |
---|---|---|
pemistahl/lingua-go | A library that accurately detects the language of short to long text inputs without requiring external APIs or configuration. | 1,190 |
olivomarco/lc4j | An open-source Java library implementing text categorization and language detection using N-grams. | 5 |
hashwin/scylla | A Ruby-based language detection tool that uses N-Gram based text categorization to identify the language of given text. | 36 |
alvations/sugali | A system designed to identify the language of an arbitrary text string using machine learning and multiple data sources. | 2 |
rylans/getlang | A natural language detection package that identifies the source language of input text without an internet connection. | 171 |
unlyed/universal-language-detector | Detects and resolves the language used in user requests | 95 |
vseloved/wiki-lang-detect | Uses Wikipedia data to identify the language of unstructured text | 31 |
teknologi-umum/flourite | Automatically detects programming languages from given strings. | 38 |
abadojack/whatlanggo | A library for detecting and identifying languages in text | 643 |
greyblake/whatlang-rs | A Rust library for detecting the language of text, including script recognition and reliability estimation. | 970 |
minibikini/paasaa | Tools for detecting the language of unstructured text in Elixir applications | 115 |
hyphenliu/cnminlangwebcollect | Detects languages of Chinese minority websites and collects them into a dataset. | 1 |
endeveit/guesslanguage | A tool for detecting the language of input text | 58 |
get-woke/woke | Detects and suggests replacements for non-inclusive language in source code. | 454 |
detectlanguage/detectlanguage-go | A Go client for detecting the language of given text and interacting with the Detect Language API | 25 |