wordvectors

Language vectors

Provides pre-trained word vectors for multiple languages to facilitate NLP tasks

Pre-trained word vectors of 30+ languages

GitHub

2k stars
90 watching
391 forks
Language: Python
last commit: over 6 years ago
Linked from 2 awesome lists

fasttextlanguagevectorword2vec

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
mfaruqui/non-distributional A repository providing pre-built non-distributional word vector representations for NLP tasks 62
tmikolov/word2vec A tool for training word vectors using distributed neural network architectures 1,525
inejc/paragraph-vectors A PyTorch implementation of a model for generating dense vector representations of paragraphs from text data. 413
explosion/sense2vec A Python library that generates contextually-keyed word vectors from text data using a variation of the Word2Vec algorithm. 1,630
mmihaltz/word2vec-googlenews-vectors A repository hosting pre-trained word vector model (3 million 300-dimension English word vectors) from the Google News corpus. 519
ytsvetko/qvec A tool to measure the quality of word vectors based on their correlation with manually crafted lexical resources 75
bmschmidt/wordvectors An R package for building and exploring word embedding models 282
nmrksic/counter-fitting A tool to update word vectors to align with linguistic constraints 145
babylonhealth/fasttext_multilingual A repository providing aligned multilingual word vectors for 78 languages using the SVD method. 1,197
nlprinceton/alacarte Tools and code for inducing custom semantic vector representations from text data 104
proycon/python-frog A Python binding to a C++ NLP tool for Dutch language processing tasks 47
microsoft/unicoder This repository provides pre-trained models and code for understanding and generation tasks in multiple languages. 89
chengyuegongr/frequency-agnostic Improves word embeddings by training with adversarial objectives 118
kagnes/prevlex Provides a comprehensive, manually verified lexical resource for Hungarian finite verbs with possessive conjugation 0
c4n/pythonlexto A Python wrapper around the Thai word segmentator LexTo, allowing developers to easily integrate it into their applications. 1