wordvectors
Language vectors
Provides pre-trained word vectors for multiple languages to facilitate NLP tasks
Pre-trained word vectors of 30+ languages
2k stars
90 watching
391 forks
Language: Python
last commit: about 6 years ago
Linked from 2 awesome lists
fasttextlanguagevectorword2vec
Related projects:
Repository | Description | Stars |
---|---|---|
mfaruqui/non-distributional | Provides non-distributional word vector representations and tools to create them from linguistic lexicons | 62 |
tmikolov/word2vec | A tool for training word vectors using distributed neural network architectures | 1,527 |
inejc/paragraph-vectors | A PyTorch implementation of a model for generating dense vector representations of paragraphs from text data. | 412 |
explosion/sense2vec | A Python library that generates contextually-keyed word vectors from text data using a variation of the Word2Vec algorithm. | 1,625 |
mmihaltz/word2vec-googlenews-vectors | A repository hosting pre-trained word vector model (3 million 300-dimension English word vectors) from the Google News corpus. | 516 |
ytsvetko/qvec | Tools to evaluate word vectors based on their alignment with lexical resources and features. | 75 |
bmschmidt/wordvectors | An R package for building and exploring word embedding models | 282 |
nmrksic/counter-fitting | A tool for modifying word vectors to conform to linguistic constraints | 144 |
babylonhealth/fasttext_multilingual | A repository providing aligned multilingual word vectors for 78 languages using the SVD method. | 1,197 |
nlprinceton/alacarte | Tools and code for inducing custom semantic vector representations from text data | 104 |
proycon/python-frog | A Python binding to a C++ NLP tool for Dutch language processing tasks | 47 |
microsoft/unicoder | This repository provides pre-trained models and code for understanding and generation tasks in multiple languages. | 88 |
chengyuegongr/frequency-agnostic | Improves word embeddings by using adversarial training to make them less dependent on word frequencies | 118 |
kagnes/prevlex | Provides a comprehensive, manually verified lexical resource for Hungarian finite verbs with possessive conjugation | 0 |
c4n/pythonlexto | A Python wrapper around the Thai word segmentator LexTo, allowing developers to easily integrate it into their applications. | 1 |