wordvectors

Language vectors

Provides pre-trained word vectors for multiple languages to facilitate NLP tasks

Pre-trained word vectors of 30+ languages

GitHub

2k stars
90 watching
391 forks
Language: Python
last commit: about 6 years ago
Linked from 2 awesome lists

fasttextlanguagevectorword2vec

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
mfaruqui/non-distributional Provides non-distributional word vector representations and tools to create them from linguistic lexicons 62
tmikolov/word2vec A tool for training word vectors using distributed neural network architectures 1,527
inejc/paragraph-vectors A PyTorch implementation of a model for generating dense vector representations of paragraphs from text data. 412
explosion/sense2vec A Python library that generates contextually-keyed word vectors from text data using a variation of the Word2Vec algorithm. 1,625
mmihaltz/word2vec-googlenews-vectors A repository hosting pre-trained word vector model (3 million 300-dimension English word vectors) from the Google News corpus. 516
ytsvetko/qvec Tools to evaluate word vectors based on their alignment with lexical resources and features. 75
bmschmidt/wordvectors An R package for building and exploring word embedding models 282
nmrksic/counter-fitting A tool for modifying word vectors to conform to linguistic constraints 144
babylonhealth/fasttext_multilingual A repository providing aligned multilingual word vectors for 78 languages using the SVD method. 1,197
nlprinceton/alacarte Tools and code for inducing custom semantic vector representations from text data 104
proycon/python-frog A Python binding to a C++ NLP tool for Dutch language processing tasks 47
microsoft/unicoder This repository provides pre-trained models and code for understanding and generation tasks in multiple languages. 88
chengyuegongr/frequency-agnostic Improves word embeddings by using adversarial training to make them less dependent on word frequencies 118
kagnes/prevlex Provides a comprehensive, manually verified lexical resource for Hungarian finite verbs with possessive conjugation 0
c4n/pythonlexto A Python wrapper around the Thai word segmentator LexTo, allowing developers to easily integrate it into their applications. 1