SimCSE

Sentence embedder

An open source framework for learning sentence embeddings using contrastive learning.

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

GitHub

3k stars
29 watching
518 forks
Language: Python
last commit: about 1 month ago
Linked from 1 awesome list

nlpsentence-embeddings

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
xiaoqijiao/coling2018 Provides training and testing code for a CNN-based sentence embedding model 2
voidism/diffcse An unsupervised contrastive learning framework for learning sentence embeddings sensitive to differences between original and edited sentences. 291
jwieting/iclr2016 Code for training universal paraphrastic sentence embeddings and models on semantic similarity tasks 193
bigscience-workshop/promptsource A toolkit for creating and using natural language prompts to enable large language models to generalize to new tasks. 2,700
princetonml/sif A Python implementation of a sentence embedding algorithm using the Smooth Inverse Frequency weighting scheme 1,083
huggingface/text-embeddings-inference A blazing fast inference solution for text embeddings models. 2,838
alexandres/lexvec An implementation of a word embedding model that uses character n-grams and achieves state-of-the-art results in multiple NLP tasks 803
oxford-cs-deepnlp-2017/lectures An open-source repository containing lecture slides and course materials for an advanced natural language processing course. 15,683
hit-scir/elmoformanylangs Provides pre-trained ELMo representations for multiple languages to improve NLP tasks. 1,463
wangyuxinwhy/uniem Develops unified sentence embedding models for NLP tasks 833
jwieting/paragram-word Trains word embeddings from a paraphrase database to represent semantic relationships between words. 30
dair-ai/ml-papers-explained An explanation of key concepts and advancements in the field of Machine Learning 7,315
brightmart/text_classification An NLP project offering various text classification models and techniques for deep learning exploration 7,861
explosion/spacy Industrial-strength NLP library for Python and Cython 30,230
harsh19/spine Transforms existing word embeddings into more interpretable ones by applying a novel extension of k-sparse autoencoder with stricter sparsity constraints 52