iclr2016

Embedding models

Code for training universal paraphrastic sentence embeddings and models on semantic similarity tasks

Python code for training all models in the ICLR paper, "Towards Universal Paraphrastic Sentence Embeddings". These models achieve strong performance on semantic similarity tasks without any training or tuning on the training data for those tasks. They also can produce features that are at least as discriminative as skip-thought vectors for semantic similarity tasks at a minimum. Moreover, this code can achieve state-of-the-art results on entailment and sentiment tasks.

GitHub

193 stars
11 watching
53 forks
Language: Python
last commit: almost 9 years ago

Related projects:

Repository Description Stars
jwieting/paragram-word Trains word embeddings from a paraphrase database to represent semantic relationships between words. 30
jwieting/acl2017 A codebase for training and using models of sentence embeddings. 33
jwieting/para-nmt-50m A collection of pre-trained models and code for training paraphrastic sentence embeddings from large machine translation datasets. 102
xiaoqijiao/coling2018 Provides training and testing code for a CNN-based sentence embedding model 2
tbepler/protein-sequence-embedding-iclr2019 A framework for learning protein sequence and structure embeddings using deep learning models. 258
johngiorgi/declutr A tool for training and evaluating sentence embeddings using deep contrastive learning 379
oborchers/fast_sentence_embeddings A Python library for efficiently computing sentence embeddings from large datasets 618
wangyuxinwhy/uniem Develops unified sentence embedding models for NLP tasks 833
iarroyof/sentence_embedding A method to convert word embeddings into sentence representations by applying entropy weights calculated from TFIDF transform. 9
lajanugen/s2v An implementation of a neural network model for learning efficient sentence representations from text data. 205
jwieting/charagram A tool for training and using character n-gram based word and sentence embeddings in natural language processing. 125
voidism/diffcse An unsupervised contrastive learning framework for learning sentence embeddings sensitive to differences between original and edited sentences. 291
princetonml/sif A Python implementation of a sentence embedding algorithm using the Smooth Inverse Frequency weighting scheme 1,083
javeywang/pyramid-attention-networks-pytorch An implementation of a deep learning model using PyTorch for semantic segmentation tasks. 235
antoine77340/mixture-of-embedding-experts An open-source implementation of the Mixture-of-Embeddings-Experts model in Pytorch for video-text retrieval tasks. 118