charagram
N-gram embedder
A tool for training and using character n-gram based word and sentence embeddings in natural language processing.
Code to train and use models from "Charagram: Embedding Words and Sentences via Character n-grams".
125 stars
13 watching
41 forks
Language: Python
last commit: over 9 years ago Related projects:
| Repository | Description | Stars |
|---|---|---|
| | A repository providing a re-implementation of character n-gram embeddings for pre-training in natural language processing tasks | 23 |
| | This repository provides an unsupervised approach to learning character-aware word and context embeddings. | 0 |
| | Trains word embeddings from a paraphrase database to represent semantic relationships between words. | 30 |
| | A codebase for training and using models of sentence embeddings. | 33 |
| | A collection of pre-trained models and code for training paraphrastic sentence embeddings from large machine translation datasets. | 102 |
| | A Python implementation of a neural network model for learning word embeddings from text data | 6 |
| | Generates sequences of characters from a given text, useful for data analysis and modeling | 37 |
| | A software implementation of a word embedding method using character glyphs, enhancing traditional Chinese language processing | 30 |
| | A PyTorch implementation of the skip-gram model for learning word embeddings. | 188 |
| | A deep learning model that generates word embeddings by predicting words based on their dependency context | 291 |
| | Quantifies 100 years of gender and ethnic stereotypes through word embeddings. | 161 |
| | Provides fast and efficient word embeddings for natural language processing. | 223 |
| | A utility class for generating and evaluating document representations using word embeddings. | 54 |
| | Provides methods for evaluating word embeddings on various benchmarks | 437 |
| | A method to generate sentence embeddings from pre-trained language models | 178 |