iclr2016

Embedding models

Code for training universal paraphrastic sentence embeddings and models on semantic similarity tasks

Python code for training all models in the ICLR paper, "Towards Universal Paraphrastic Sentence Embeddings". These models achieve strong performance on semantic similarity tasks without any training or tuning on the training data for those tasks. They also can produce features that are at least as discriminative as skip-thought vectors for semantic similarity tasks at a minimum. Moreover, this code can achieve state-of-the-art results on entailment and sentiment tasks.

GitHub

193 stars

11 watching

53 forks

Language: Python

last commit: over 9 years ago

Related projects:

Repository	Description	Stars
jwieting/paragram-word	Trains word embeddings from a paraphrase database to represent semantic relationships between words.	30
jwieting/acl2017	A codebase for training and using models of sentence embeddings.	33
jwieting/para-nmt-50m	A collection of pre-trained models and code for training paraphrastic sentence embeddings from large machine translation datasets.	102
xiaoqijiao/coling2018	Provides training and testing code for a CNN-based sentence embedding model	2
tbepler/protein-sequence-embedding-iclr2019	Developing models to learn and represent protein sequences based on their structure	259
johngiorgi/declutr	A tool for training and evaluating sentence embeddings using deep contrastive learning	380
oborchers/fast_sentence_embeddings	A Python library for efficiently computing sentence embeddings from large datasets	618
wangyuxinwhy/uniem	Develops unified sentence embedding models for NLP tasks	840
iarroyof/sentence_embedding	A method to convert word embeddings into sentence representations by applying entropy weights calculated from TFIDF transform.	9
lajanugen/s2v	An implementation of a neural network model for learning efficient sentence representations from text data.	205
jwieting/charagram	A tool for training and using character n-gram based word and sentence embeddings in natural language processing.	125
voidism/diffcse	An unsupervised contrastive learning framework for learning sentence embeddings sensitive to differences between original and edited sentences.	292
princetonml/sif	A Python implementation of a sentence embedding algorithm using the Smooth Inverse Frequency weighting scheme	1,084
javeywang/pyramid-attention-networks-pytorch	An implementation of a deep learning model using PyTorch for semantic segmentation tasks.	237
antoine77340/mixture-of-embedding-experts	An open-source implementation of the Mixture-of-Embeddings-Experts model in Pytorch for video-text retrieval tasks.	118