SIF

Sentence Embedder

A Python implementation of a sentence embedding algorithm using the Smooth Inverse Frequency weighting scheme

sentence embedding by Smooth Inverse Frequency weighting scheme

GitHub

1k stars
34 watching
306 forks
Language: Python
last commit: over 5 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
iarroyof/sentence_embedding A method to convert word embeddings into sentence representations by applying entropy weights calculated from TFIDF transform. 9
oborchers/fast_sentence_embeddings A Python library for efficiently computing sentence embeddings from large datasets 618
binwang28/sbert-wk-sentence-embedding A method to generate sentence embeddings from pre-trained language models 177
lajanugen/s2v An implementation of a neural network model for learning efficient sentence representations from text data. 205
fursovia/geometric_embedding An implementation of a non-parameterized approach for building sentence representations 19
dfki-interactive-machine-learning/arasif Provides sentence embeddings for Arabic languages using pre-trained word embeddings and Smooth Inverse Frequency algorithm 5
xiaoqijiao/coling2018 Provides training and testing code for a CNN-based sentence embedding model 2
wangyuxinwhy/uniem Develops unified sentence embedding models for NLP tasks 833
jwieting/acl2017 A codebase for training and using models of sentence embeddings. 33
jwieting/iclr2016 Code for training universal paraphrastic sentence embeddings and models on semantic similarity tasks 193
largelymfs/topical_word_embeddings A codebase implementing topical word embeddings using various NLP techniques as demonstrated in a paper accepted by AAAI'15. 315
epfml/sent2vec An unsupervised technique to generate numerical representations of sentences and words for use in machine learning tasks 1,193
fh295/sentencerepresentation A software framework for learning sentence representations using unsupervised machine learning algorithms 124
voidism/diffcse An unsupervised contrastive learning framework for learning sentence embeddings sensitive to differences between original and edited sentences. 291
bohanli/bert-flow A TensorFlow implementation of sentence embedding from pre-trained language models 529