SO_word2vec

Word Embedding Model

This is a word embedding model trained on Stack Overflow posts for use in natural language processing tasks.

A word2vec model trained over Stack Overflow

GitHub

40 stars
1 watching
11 forks
last commit: over 6 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
tca19/dict2vec A framework to learn word embeddings using lexical dictionaries 115
satwikkottur/visualword2vec Learning word embeddings from abstract images to improve language understanding 19
alexandres/lexvec An implementation of a word embedding model that uses character n-grams and achieves state-of-the-art results in multiple NLP tasks 803
auspicious3000/contentvec An implementation of a self-supervised speech representation model using PyTorch and disentangled speaker embeddings 468
cod3licious/conec A library for training and evaluating a type of word embedding model that extends the original Word2Vec algorithm 20
juliatext/word2vec.jl A Julia interface to the Word2Vec word embedding algorithm 61
wikipedia2vec/wikipedia2vec A tool for learning vector representations of words and entities from Wikipedia text data. 940
botcenter/spanish-sent2vec This project trains a machine learning model to generate sentence embeddings from Spanish text data using the sent2vec algorithm. 4
mmihaltz/word2vec-googlenews-vectors A repository hosting pre-trained word vector model (3 million 300-dimension English word vectors) from the Google News corpus. 516
refefer/word2vec-scala A Scala implementation of the word2vec model representation. 11
danieldk/go2vec A package for reading and analyzing word embeddings from the word2vec format in Go. 56
dalinvip/cw2vec A software framework for learning Chinese word embeddings with stroke n-gram information 274
largelymfs/topical_word_embeddings A codebase implementing topical word embeddings using various NLP techniques as demonstrated in a paper accepted by AAAI'15. 315
zhezhaoa/ngram2vec A toolkit for learning high-quality word and text representations from ngram co-occurrence statistics 846
eyaler/word2vec-slim Slims down a large pre-trained word2vec model to reduce size and improve loading time 212