SentenceRepresentation

Sentence Encoder

A software framework for learning sentence representations using unsupervised machine learning algorithms

GitHub

124 stars
8 watching
22 forks
Language: Python
last commit: over 7 years ago

Related projects:

Repository Description Stars
zminghua/sentencoding A software package providing tools to encode and process text data using a specific neural network architecture. 16
zhegan27/convsent Trains an autoencoder to learn generic sentence representations using convolutional neural networks 34
voidism/diffcse An unsupervised contrastive learning framework for learning sentence embeddings sensitive to differences between original and edited sentences. 291
iarroyof/sentence_embedding A method to convert word embeddings into sentence representations by applying entropy weights calculated from TFIDF transform. 9
ryankiros/skip-thoughts Provides an implementation of Skip-Thought Vectors for encoding and analyzing sentence pairs 2,047
princetonml/sif A Python implementation of a sentence embedding algorithm using the Smooth Inverse Frequency weighting scheme 1,083
fursovia/geometric_embedding An implementation of a non-parameterized approach for building sentence representations 19
jwieting/acl2017 A codebase for training and using models of sentence embeddings. 33
johngiorgi/declutr A tool for training and evaluating sentence embeddings using deep contrastive learning 379
darienhuss/shotgunyara Tools and utilities for generating encoded versions of input data 9
chardet/chardet A character encoding detection tool for determining the encoding of text files. 2,181
binwang28/sbert-wk-sentence-embedding A method to generate sentence embeddings from pre-trained language models 177
sija/base62.cr A library that encodes numbers using a compact set of characters (Base62) and provides decoding functionality. 10
xiaoqijiao/coling2018 Provides training and testing code for a CNN-based sentence embedding model 2
oborchers/fast_sentence_embeddings A Python library for efficiently computing sentence embeddings from large datasets 618