sent2vec

Sentence embedder

An unsupervised technique to generate numerical representations of sentences and words for use in machine learning tasks

General purpose unsupervised sentence representations

GitHub

1k stars
40 watching
256 forks
Language: C++
last commit: over 2 years ago

Related projects:

Repository Description Stars
kostyaev/sentence2vec This is a tool for creating deep sentence embeddings using Sequence-to-Sequence learning. 22
lajanugen/s2v An implementation of a neural network model for learning efficient sentence representations from text data. 205
botcenter/spanish-sent2vec This project trains a machine learning model to generate sentence embeddings from Spanish text data using the sent2vec algorithm. 4
oborchers/fast_sentence_embeddings A Python library for efficiently computing sentence embeddings from large datasets 618
vefstathiou/so_word2vec This is a word embedding model trained on Stack Overflow posts for use in natural language processing tasks. 40
binwang28/sbert-wk-sentence-embedding A method to generate sentence embeddings from pre-trained language models 177
voidism/diffcse An unsupervised contrastive learning framework for learning sentence embeddings sensitive to differences between original and edited sentences. 291
iarroyof/sentence_embedding A method to convert word embeddings into sentence representations by applying entropy weights calculated from TFIDF transform. 9
auspicious3000/contentvec An implementation of a self-supervised speech representation model using PyTorch and disentangled speaker embeddings 468
wikipedia2vec/wikipedia2vec A tool for learning vector representations of words and entities from Wikipedia text data. 940
tca19/dict2vec A framework to learn word embeddings using lexical dictionaries 115
fursovia/geometric_embedding An implementation of a non-parameterized approach for building sentence representations 19
princetonml/sif A Python implementation of a sentence embedding algorithm using the Smooth Inverse Frequency weighting scheme 1,083
bohanli/bert-flow A TensorFlow implementation of sentence embedding from pre-trained language models 529
facebookresearch/senteval Tool for evaluating the quality of sentence embeddings as features in various downstream tasks. 2,087