conec

Word Embedding Model

A library for training and evaluating a type of word embedding model that extends the original Word2Vec algorithm

Context Encoders (ConEc) as a simple but powerful extension of the word2vec model for learning word embeddings

GitHub

20 stars
5 watching
5 forks
Language: Python
last commit: over 4 years ago
machine-learningnatural-language-processingword-embeddings

Related projects:

Repository Description Stars
tca19/dict2vec A framework to learn word embeddings using lexical dictionaries 115
vefstathiou/so_word2vec This is a word embedding model trained on Stack Overflow posts for use in natural language processing tasks. 40
malllabiisc/wordgcn A deep learning model that generates word embeddings by predicting words based on their dependency context 290
alexandres/lexvec An implementation of a word embedding model that uses character n-grams and achieves state-of-the-art results in multiple NLP tasks 803
auspicious3000/contentvec An implementation of a self-supervised speech representation model using PyTorch and disentangled speaker embeddings 467
satwikkottur/visualword2vec Learning word embeddings from abstract images to improve language understanding 19
hslcy/vcwe This project provides code and corpora for creating word embeddings by considering the visual characteristics of words. 15
dalinvip/cw2vec A software framework for learning Chinese word embeddings with stroke n-gram information 274
jwieting/acl2017 A codebase for training and using models of sentence embeddings. 33
deepcs233/visual-cot Develops a multi-modal language model with a comprehensive dataset and benchmark for chain-of-thought reasoning 134
rguthrie3/morphologicalpriorsforwordembeddings A project implementing a method to incorporate morphological information into word embeddings using a neural network model 52
wikipedia2vec/wikipedia2vec A tool for learning vector representations of words and entities from Wikipedia text data. 940
zhezhaoa/ngram2vec A toolkit for learning high-quality word and text representations from ngram co-occurrence statistics 846
dccuchile/spanish-word-embeddings A collection of precomputed word embeddings for the Spanish language, derived from different corpora and computational methods. 356
danieldk/go2vec A package for reading and analyzing word embeddings from the word2vec format in Go. 56