sent2vec
Sentence embedder
An unsupervised technique to generate numerical representations of sentences and words for use in machine learning tasks
General purpose unsupervised sentence representations
1k stars
40 watching
256 forks
Language: C++
last commit: over 2 years ago Related projects:
Repository | Description | Stars |
---|---|---|
kostyaev/sentence2vec | This is a tool for creating deep sentence embeddings using Sequence-to-Sequence learning. | 22 |
lajanugen/s2v | An implementation of a neural network model for learning efficient sentence representations from text data. | 205 |
botcenter/spanish-sent2vec | This project trains a machine learning model to generate sentence embeddings from Spanish text data using the sent2vec algorithm. | 4 |
oborchers/fast_sentence_embeddings | A Python library for efficiently computing sentence embeddings from large datasets | 618 |
vefstathiou/so_word2vec | This is a word embedding model trained on Stack Overflow posts for use in natural language processing tasks. | 40 |
binwang28/sbert-wk-sentence-embedding | A method to generate sentence embeddings from pre-trained language models | 177 |
voidism/diffcse | An unsupervised contrastive learning framework for learning sentence embeddings sensitive to differences between original and edited sentences. | 291 |
iarroyof/sentence_embedding | A method to convert word embeddings into sentence representations by applying entropy weights calculated from TFIDF transform. | 9 |
auspicious3000/contentvec | An implementation of a self-supervised speech representation model using PyTorch and disentangled speaker embeddings | 468 |
wikipedia2vec/wikipedia2vec | A tool for learning vector representations of words and entities from Wikipedia text data. | 940 |
tca19/dict2vec | A framework to learn word embeddings using lexical dictionaries | 115 |
fursovia/geometric_embedding | An implementation of a non-parameterized approach for building sentence representations | 19 |
princetonml/sif | A Python implementation of a sentence embedding algorithm using the Smooth Inverse Frequency weighting scheme | 1,083 |
bohanli/bert-flow | A TensorFlow implementation of sentence embedding from pre-trained language models | 529 |
facebookresearch/senteval | Tool for evaluating the quality of sentence embeddings as features in various downstream tasks. | 2,087 |