char-word-embeddings
Character embedding generator
This repository provides an unsupervised approach to learning character-aware word and context embeddings.
This repository contains a usable code from the paper G. Marra, A. Zugarini, S. Melacci, and M. Maggini, “An unsupervised character-aware neural approach to word and context representation learning”.
0 stars
2 watching
1 forks
Language: Python
last commit: over 6 years ago Related projects:
Repository | Description | Stars |
---|---|---|
| A tool for training and using character n-gram based word and sentence embeddings in natural language processing. | 125 |
| A repository providing a re-implementation of character n-gram embeddings for pre-training in natural language processing tasks | 23 |
| A project implementing a method to incorporate morphological information into word embeddings using a neural network model | 52 |
| This project generates Spanish word embeddings using fastText on large corpora. | 9 |
| Trains word embeddings from a paraphrase database to represent semantic relationships between words. | 30 |
| A system for approximating word embeddings using character-level neural networks | 153 |
| Transforms existing word embeddings into more interpretable ones by applying a novel extension of k-sparse autoencoder with stricter sparsity constraints | 52 |
| A deep learning model that generates word embeddings by predicting words based on their dependency context | 291 |
| A utility class for generating and evaluating document representations using word embeddings. | 54 |
| A collection of precomputed word embeddings for the Spanish language, derived from different corpora and computational methods. | 354 |
| Provides access to pre-trained word embeddings for NLP tasks. | 81 |
| This project provides code and corpora for creating word embeddings by considering the visual characteristics of words. | 15 |
| A toolkit for learning high-quality word and text representations from ngram co-occurrence statistics | 848 |
| Multi-sense word embeddings learned from visual cooccurrences | 25 |
| A software implementation of a word embedding method using character glyphs, enhancing traditional Chinese language processing | 30 |