lingua-go

Language detector

A library that accurately detects the language of short to long text inputs without requiring external APIs or configuration.

The most accurate natural language detection library for Go, suitable for short text and mixed-language text

GitHub

1k stars
12 watching
66 forks
Language: Go
last commit: 6 days ago
Linked from 1 awesome list

gogolang-librarylanguage-classificationlanguage-detectionlanguage-identificationlanguage-modelinglanguage-processinglanguage-recognitionnatural-language-processingnlpnlp-machine-learningtext-processing

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
pemistahl/lingua An accurate language detection library for Java and the JVM suitable for both short and long text inputs. 707
detectlanguage/detectlanguage-go A Go client for detecting the language of given text and interacting with the Detect Language API 25
abadojack/whatlanggo A library for detecting and identifying languages in text 643
rylans/getlang A natural language detection package that identifies the source language of input text without an internet connection. 171
endeveit/guesslanguage A tool for detecting the language of input text 58
alvations/sugali A system designed to identify the language of an arbitrary text string using machine learning and multiple data sources. 2
get-woke/woke Detects and suggests replacements for non-inclusive language in source code. 454
endeveit/enca Provides minimal cgo bindings for the libenca language detection library 16
olivomarco/lc4j An open-source Java library implementing text categorization and language detection using N-grams. 5
vseloved/wiki-lang-detect Uses Wikipedia data to identify the language of unstructured text 31
hashwin/scylla A Ruby-based language detection tool that uses N-Gram based text categorization to identify the language of given text. 36
james-bowman/nlp This project provides a set of algorithms and implementations for natural language processing in Go. 450
greyblake/whatlang-rs A Rust library for detecting the language of text, including script recognition and reliability estimation. 970
unlyed/universal-language-detector Detects and resolves the language used in user requests 95
src-d/enry A tool that detects programming languages in files and returns their type and safety flags 460