lc4j

Language detector

An open-source Java library implementing text categorization and language detection using N-grams.

Language Categorization library for Java

GitHub

5 stars
1 watching
0 forks
Language: Java
last commit: about 4 years ago
javalanguage-categorizationlanguage-detectiontext-categorization

Related projects:

Repository Description Stars
pemistahl/lingua An accurate language detection library for Java and the JVM suitable for both short and long text inputs. 707
abadojack/whatlanggo A library for detecting and identifying languages in text 643
alvations/sugali A system designed to identify the language of an arbitrary text string using machine learning and multiple data sources. 2
pemistahl/lingua-go A library that accurately detects the language of short to long text inputs without requiring external APIs or configuration. 1,190
rylans/getlang A natural language detection package that identifies the source language of input text without an internet connection. 171
detectlanguage/detectlanguage-go A Go client for detecting the language of given text and interacting with the Detect Language API 25
hashwin/scylla A Ruby-based language detection tool that uses N-Gram based text categorization to identify the language of given text. 36
teknologi-umum/flourite Automatically detects programming languages from given strings. 38
unlyed/universal-language-detector Detects and resolves the language used in user requests 95
jtoy/cld A compact language detection library for Ruby based on Google Chrome's technology 210
detectlanguage/detectlanguage-ruby A Ruby client for detecting the language of given text 29
kwonoj/cld3-asm A WebAssembly-based JavaScript binding to Google's Compact Language Detector v3 58
endeveit/enca Provides minimal cgo bindings for the libenca language detection library 16
vseloved/wiki-lang-detect Uses Wikipedia data to identify the language of unstructured text 31
endeveit/guesslanguage A tool for detecting the language of input text 58