WSI4URLang

Language sense induction

Develops techniques to induce word senses in under-resourced languages using computational methods.

Word Sense Induction (WSI) for Under-resourced Languages (URLang)

GitHub

0 stars
2 watching
2 forks
Language: Java
last commit: about 4 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
uhh-lt/sensegram Tools and techniques for analyzing word meanings from word embeddings 212
olivomarco/lc4j An open-source Java library implementing text categorization and language detection using N-grams. 5
pemistahl/lingua An accurate language detection library for Java and the JVM suitable for both short and long text inputs. 707
alvations/sugali A system designed to identify the language of an arbitrary text string using machine learning and multiple data sources. 2
unlyed/universal-language-detector Detects and resolves the language used in user requests 95
teknologi-umum/flourite Automatically detects programming languages from given strings. 38
vseloved/wiki-lang-detect Uses Wikipedia data to identify the language of unstructured text 31
galuhsahid/indonesian-word-embedding Demonstrates word embedding in Indonesian language using pre-trained Word2vec models 20
get-woke/woke Detects and suggests replacements for non-inclusive language in source code. 454
wyvernlang/wyvern A programming language designed to support adaptation and assurance in software systems 556
hit-scir/elmoformanylangs Provides pre-trained ELMo representations for multiple languages to improve NLP tasks. 1,463
kyubyong/wordvectors Provides pre-trained word vectors for multiple languages to facilitate NLP tasks 2,215
alvations/sugarlike A tool that identifies languages in text by comparing them to a reference set of patterns. 1
kwonoj/cld3-asm A WebAssembly-based JavaScript binding to Google's Compact Language Detector v3 58
minibikini/paasaa Tools for detecting the language of unstructured text in Elixir applications 115