Fast_Sentence_Embeddings

Sentence embedder

A Python library for efficiently computing sentence embeddings from large datasets

Compute Sentence Embeddings Fast!

GitHub

618 stars
12 watching
83 forks
Language: Jupyter Notebook
last commit: over 1 year ago
Linked from 1 awesome list

cythondocument-similarityembeddingsfasttextfsegensimgensim-modelmaxpoolingsentence-embeddingssentence-representationsentence-similaritysifswemusifword2vec-modelwordembedding

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
iarroyof/sentence_embedding A method to convert word embeddings into sentence representations by applying entropy weights calculated from TFIDF transform. 9
princetonml/sif A Python implementation of a sentence embedding algorithm using the Smooth Inverse Frequency weighting scheme 1,083
binwang28/sbert-wk-sentence-embedding A method to generate sentence embeddings from pre-trained language models 177
fursovia/geometric_embedding An implementation of a non-parameterized approach for building sentence representations 19
wangyuxinwhy/uniem Develops unified sentence embedding models for NLP tasks 833
kostyaev/sentence2vec This is a tool for creating deep sentence embeddings using Sequence-to-Sequence learning. 22
lajanugen/s2v An implementation of a neural network model for learning efficient sentence representations from text data. 205
jwieting/iclr2016 Code for training universal paraphrastic sentence embeddings and models on semantic similarity tasks 193
xiaoqijiao/coling2018 Provides training and testing code for a CNN-based sentence embedding model 2
epfml/sent2vec An unsupervised technique to generate numerical representations of sentences and words for use in machine learning tasks 1,193
vzhong/embeddings Provides fast and efficient word embeddings for natural language processing. 223
botcenter/spanishwordembeddings This project generates Spanish word embeddings using fastText on large corpora. 9
jwieting/acl2017 A codebase for training and using models of sentence embeddings. 33
losyer/compact_reconstruction This project enables efficient reconstruction of word embeddings by leveraging subword representations. 9
seomoz/word2gauss This implementation provides a way to represent words as multivariate Gaussian distributions, allowing scalable word embeddings. 190