charagram

N-gram embedder

A tool for training and using character n-gram based word and sentence embeddings in natural language processing.

Code to train and use models from "Charagram: Embedding Words and Sentences via Character n-grams".

GitHub

125 stars
13 watching
41 forks
Language: Python
last commit: over 8 years ago

Related projects:

Repository Description Stars
hassygo/charngram2vec A repository providing a re-implementation of character n-gram embeddings for pre-training in natural language processing tasks 23
giuseppemarra/char-word-embeddings This repository provides an unsupervised approach to learning character-aware word and context embeddings. 0
jwieting/paragram-word Trains word embeddings from a paraphrase database to represent semantic relationships between words. 30
jwieting/acl2017 A codebase for training and using models of sentence embeddings. 33
jwieting/para-nmt-50m A collection of pre-trained models and code for training paraphrastic sentence embeddings from large machine translation datasets. 102
zhangxiann/skip-gram A Python implementation of a neural network model for learning word embeddings from text data 6
reddavis/n-gram Generates sequences of characters from a given text, useful for data analysis and modeling 37
ray1007/gwe A software implementation of a word embedding method using character glyphs, enhancing traditional Chinese language processing 30
fanglanting/skip-gram-pytorch A PyTorch implementation of the skip-gram model for learning word embeddings. 188
malllabiisc/wordgcn A deep learning model that generates word embeddings by predicting words based on their dependency context 290
nikhgarg/embeddingdynamicstereotypes Quantifies 100 years of gender and ethnic stereotypes through word embeddings. 159
vzhong/embeddings Provides fast and efficient word embeddings for natural language processing. 223
nlprinceton/text_embedding A utility class for generating and evaluating document representations using word embeddings. 54
kudkudak/word-embeddings-benchmarks Provides methods for evaluating word embeddings on various benchmarks 437
binwang28/sbert-wk-sentence-embedding A method to generate sentence embeddings from pre-trained language models 177