attention-transfer

Attention transfer

Improves performance of convolutional neural networks by transferring knowledge from teacher models to student models using attention mechanisms.

Improving Convolutional Networks via Attention Transfer (ICLR 2017)

GitHub

1k stars

51 watching

276 forks

Language: Jupyter Notebook

last commit: about 7 years ago

attentiondeep-learningknowledge-distillationpytorch

Screenshot of szagoruyko/attention-transfer website

arxiv.org/abs/1612.03928

Related projects:

Repository	Description	Stars
google-research/big_transfer	Pre-trained models and code for fine-tuning image recognition tasks using deep learning frameworks	1,516
lancopku/iais	This project proposes a novel method for calibrating attention distributions in multimodal models to improve contextualized representations of image-text pairs.	30
szagoruyko/binary-wide-resnet	An implementation of a 1-bit weight neural network architecture using PyTorch	124
szagoruyko/wide-residual-networks	An experimental study on residual networks to improve depth and width trade-offs in neural networks	1,299
koichiro11/residual-attention-network	An image classification neural network implementation using attention mechanisms and residual learning	94
pistony/residualattentionnetwork	A Gluon implementation of Residual Attention Network for image classification tasks	108
benedekrozemberczki/sgcn	An implementation of a deep learning algorithm for graph data	270
benedekrozemberczki/appnp	A PyTorch implementation of a graph neural network model that learns personalized node representations	367
ahirner/pytorch-retraining	An experiment and benchmarking framework for evaluating the effectiveness of transfer learning in PyTorch-based deep learning models	170
peteanderson80/bottom-up-attention	Trains a bottom-up attention model using Faster R-CNN and Visual Genome annotations for image captioning and VQA tasks	1,438
jhu-lcsr/good_robot	Research on repurposing reinforcement learning for transfer between tasks in robotics and multi-step visual tasks with simulation-to-real transfer	108
javeywang/pyramid-attention-networks-pytorch	An implementation of a deep learning model using PyTorch for semantic segmentation tasks.	237
benedekrozemberczki/gam	An implementation of a graph classification model using structural attention and PyTorch	270
hszhao/psanet	A deep learning framework for semantic segmentation with spatial attention mechanisms	218
emedvedev/attention-ocr	A TensorFlow model for recognizing text in images using visual attention and a sequence-to-sequence architecture.	1,079