charNgram2vec

N-gram embeddings

A repository providing a re-implementation of character n-gram embeddings for pre-training in natural language processing tasks

Pre-training character n-gram embeddings

GitHub

23 stars
6 watching
4 forks
Language: C++
last commit: about 1 year ago
embeddings

Related projects:

Repository Description Stars
jwieting/charagram A tool for training and using character n-gram based word and sentence embeddings in natural language processing. 125
zhezhaoa/ngram2vec A toolkit for learning high-quality word and text representations from ngram co-occurrence statistics 846
giuseppemarra/char-word-embeddings This repository provides an unsupervised approach to learning character-aware word and context embeddings. 0
pwoolcoc/ngrams Generates n-grams from sequences of tokens 27
reddavis/n-gram Generates sequences of characters from a given text, useful for data analysis and modeling 37
alexandres/lexvec An implementation of a word embedding model that uses character n-grams and achieves state-of-the-art results in multiple NLP tasks 803
dalinvip/cw2vec A software framework for learning Chinese word embeddings with stroke n-gram information 274
danieldk/go2vec A package for reading and analyzing word embeddings from the word2vec format in Go. 56
ynqa/wego An open-source Go library for learning and manipulating vector representations of words 474
vzhong/embeddings Provides fast and efficient word embeddings for natural language processing. 223
malllabiisc/wordgcn A deep learning model that generates word embeddings by predicting words based on their dependency context 290
seomoz/word2gauss This implementation provides a way to represent words as multivariate Gaussian distributions, allowing scalable word embeddings. 190
ray1007/gwe A software implementation of a word embedding method using character glyphs, enhancing traditional Chinese language processing 30
atgreen/cl-embeddings A Common Lisp library for generating word embeddings using neural network models. 8
benedekrozemberczki/role2vec An implementation of a deep learning-based method for creating vector representations of nodes in a graph 166