char-word-embeddings

Character embedding generator

This repository provides an unsupervised approach to learning character-aware word and context embeddings.

This repository contains a usable code from the paper G. Marra, A. Zugarini, S. Melacci, and M. Maggini, “An unsupervised character-aware neural approach to word and context representation learning”.

GitHub

0 stars
2 watching
1 forks
Language: Python
last commit: about 6 years ago

Related projects:

Repository Description Stars
jwieting/charagram A tool for training and using character n-gram based word and sentence embeddings in natural language processing. 125
hassygo/charngram2vec A repository providing a re-implementation of character n-gram embeddings for pre-training in natural language processing tasks 23
rguthrie3/morphologicalpriorsforwordembeddings A project implementing a method to incorporate morphological information into word embeddings using a neural network model 52
botcenter/spanishwordembeddings This project generates Spanish word embeddings using fastText on large corpora. 9
jwieting/paragram-word Trains word embeddings from a paraphrase database to represent semantic relationships between words. 30
yuvalpinter/mimick A system for approximating word embeddings using character-level neural networks 153
harsh19/spine Transforms existing word embeddings into more interpretable ones by applying a novel extension of k-sparse autoencoder with stricter sparsity constraints 52
malllabiisc/wordgcn A deep learning model that generates word embeddings by predicting words based on their dependency context 290
nlprinceton/text_embedding A utility class for generating and evaluating document representations using word embeddings. 54
dccuchile/spanish-word-embeddings A collection of precomputed word embeddings for the Spanish language, derived from different corpora and computational methods. 356
juliatext/embeddings.jl Provides access to pre-trained word embeddings for NLP tasks. 81
hslcy/vcwe This project provides code and corpora for creating word embeddings by considering the visual characteristics of words. 15
zhezhaoa/ngram2vec A toolkit for learning high-quality word and text representations from ngram co-occurrence statistics 846
bigredt/vico Multi-sense word embeddings learned from visual cooccurrences 25
ray1007/gwe A software implementation of a word embedding method using character glyphs, enhancing traditional Chinese language processing 30