lingua

Language detector

An accurate language detection library for Java and the JVM suitable for both short and long text inputs.

The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike

GitHub

707 stars
11 watching
64 forks
Language: Kotlin
last commit: 6 days ago
Linked from 1 awesome list

android-libraryjava-librarykotlin-librarylanguage-classificationlanguage-detectionlanguage-identificationlanguage-processinglanguage-recognitionnatural-languagenatural-language-processingnlpnlp-librarynlp-machine-learning

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
pemistahl/lingua-go A library that accurately detects the language of short to long text inputs without requiring external APIs or configuration. 1,190
olivomarco/lc4j An open-source Java library implementing text categorization and language detection using N-grams. 5
hashwin/scylla A Ruby-based language detection tool that uses N-Gram based text categorization to identify the language of given text. 36
alvations/sugali A system designed to identify the language of an arbitrary text string using machine learning and multiple data sources. 2
rylans/getlang A natural language detection package that identifies the source language of input text without an internet connection. 171
unlyed/universal-language-detector Detects and resolves the language used in user requests 95
vseloved/wiki-lang-detect Uses Wikipedia data to identify the language of unstructured text 31
teknologi-umum/flourite Automatically detects programming languages from given strings. 38
abadojack/whatlanggo A library for detecting and identifying languages in text 643
greyblake/whatlang-rs A Rust library for detecting the language of text, including script recognition and reliability estimation. 970
minibikini/paasaa Tools for detecting the language of unstructured text in Elixir applications 115
hyphenliu/cnminlangwebcollect Detects languages of Chinese minority websites and collects them into a dataset. 1
endeveit/guesslanguage A tool for detecting the language of input text 58
get-woke/woke Detects and suggests replacements for non-inclusive language in source code. 454
detectlanguage/detectlanguage-go A Go client for detecting the language of given text and interacting with the Detect Language API 25