wordvectors
Language vectors
Provides pre-trained word vectors for multiple languages to facilitate NLP tasks
Pre-trained word vectors of 30+ languages
2k stars
90 watching
391 forks
Language: Python
last commit: over 6 years ago
Linked from 2 awesome lists
fasttextlanguagevectorword2vec
Related projects:
Repository | Description | Stars |
---|---|---|
mfaruqui/non-distributional | A repository providing pre-built non-distributional word vector representations for NLP tasks | 62 |
tmikolov/word2vec | A tool for training word vectors using distributed neural network architectures | 1,525 |
inejc/paragraph-vectors | A PyTorch implementation of a model for generating dense vector representations of paragraphs from text data. | 413 |
explosion/sense2vec | A Python library that generates contextually-keyed word vectors from text data using a variation of the Word2Vec algorithm. | 1,630 |
mmihaltz/word2vec-googlenews-vectors | A repository hosting pre-trained word vector model (3 million 300-dimension English word vectors) from the Google News corpus. | 519 |
ytsvetko/qvec | A tool to measure the quality of word vectors based on their correlation with manually crafted lexical resources | 75 |
bmschmidt/wordvectors | An R package for building and exploring word embedding models | 282 |
nmrksic/counter-fitting | A tool to update word vectors to align with linguistic constraints | 145 |
babylonhealth/fasttext_multilingual | A repository providing aligned multilingual word vectors for 78 languages using the SVD method. | 1,197 |
nlprinceton/alacarte | Tools and code for inducing custom semantic vector representations from text data | 104 |
proycon/python-frog | A Python binding to a C++ NLP tool for Dutch language processing tasks | 47 |
microsoft/unicoder | This repository provides pre-trained models and code for understanding and generation tasks in multiple languages. | 89 |
chengyuegongr/frequency-agnostic | Improves word embeddings by training with adversarial objectives | 118 |
kagnes/prevlex | Provides a comprehensive, manually verified lexical resource for Hungarian finite verbs with possessive conjugation | 0 |
c4n/pythonlexto | A Python wrapper around the Thai word segmentator LexTo, allowing developers to easily integrate it into their applications. | 1 |