cw2vec

Chinese Word Embeddings Framework

A software framework for learning Chinese word embeddings with stroke n-gram information

cw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information

GitHub

274 stars
11 watching
66 forks
Language: C++
last commit: over 1 year ago
cw2vecembeddingsfasttextstroke-informationword2vec

Related projects:

Repository Description Stars
tca19/dict2vec A framework to learn word embeddings using lexical dictionaries 115
satwikkottur/visualword2vec Learning word embeddings from abstract images to improve language understanding 19
vefstathiou/so_word2vec This is a word embedding model trained on Stack Overflow posts for use in natural language processing tasks. 40
danieldk/go2vec A package for reading and analyzing word embeddings from the word2vec format in Go. 56
hassygo/charngram2vec A repository providing a re-implementation of character n-gram embeddings for pre-training in natural language processing tasks 23
cod3licious/conec A library for training and evaluating a type of word embedding model that extends the original Word2Vec algorithm 20
malllabiisc/wordgcn A deep learning model that generates word embeddings by predicting words based on their dependency context 290
zhezhaoa/ngram2vec A toolkit for learning high-quality word and text representations from ngram co-occurrence statistics 846
hkust-knowcomp/jwe This is a software project that trains and evaluates word embeddings for Chinese words, characters, and fine-grained subcharacter components. 99
hslcy/vcwe This project provides code and corpora for creating word embeddings by considering the visual characteristics of words. 15
dccuchile/spanish-word-embeddings A collection of precomputed word embeddings for the Spanish language, derived from different corpora and computational methods. 356
vzhong/embeddings Provides fast and efficient word embeddings for natural language processing. 223
yoonkim/word2vec_torch Torch implementation of Word2Vec algorithm for creating vector representations of words from text data 61
juliatext/word2vec.jl A Julia interface to the Word2Vec word embedding algorithm 61
ray1007/gwe A software implementation of a word embedding method using character glyphs, enhancing traditional Chinese language processing 30