SimCSE
Sentence embedder
An open source framework for learning sentence embeddings using contrastive learning.
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
3k stars
29 watching
518 forks
Language: Python
last commit: about 1 month ago
Linked from 1 awesome list
nlpsentence-embeddings
Related projects:
Repository | Description | Stars |
---|---|---|
xiaoqijiao/coling2018 | Provides training and testing code for a CNN-based sentence embedding model | 2 |
voidism/diffcse | An unsupervised contrastive learning framework for learning sentence embeddings sensitive to differences between original and edited sentences. | 291 |
jwieting/iclr2016 | Code for training universal paraphrastic sentence embeddings and models on semantic similarity tasks | 193 |
bigscience-workshop/promptsource | A toolkit for creating and using natural language prompts to enable large language models to generalize to new tasks. | 2,700 |
princetonml/sif | A Python implementation of a sentence embedding algorithm using the Smooth Inverse Frequency weighting scheme | 1,083 |
huggingface/text-embeddings-inference | A blazing fast inference solution for text embeddings models. | 2,838 |
alexandres/lexvec | An implementation of a word embedding model that uses character n-grams and achieves state-of-the-art results in multiple NLP tasks | 803 |
oxford-cs-deepnlp-2017/lectures | An open-source repository containing lecture slides and course materials for an advanced natural language processing course. | 15,683 |
hit-scir/elmoformanylangs | Provides pre-trained ELMo representations for multiple languages to improve NLP tasks. | 1,463 |
wangyuxinwhy/uniem | Develops unified sentence embedding models for NLP tasks | 833 |
jwieting/paragram-word | Trains word embeddings from a paraphrase database to represent semantic relationships between words. | 30 |
dair-ai/ml-papers-explained | An explanation of key concepts and advancements in the field of Machine Learning | 7,315 |
brightmart/text_classification | An NLP project offering various text classification models and techniques for deep learning exploration | 7,861 |
explosion/spacy | Industrial-strength NLP library for Python and Cython | 30,230 |
harsh19/spine | Transforms existing word embeddings into more interpretable ones by applying a novel extension of k-sparse autoencoder with stricter sparsity constraints | 52 |