attention-transfer

Attention transfer

Improves performance of convolutional neural networks by transferring knowledge from teacher models to student models using attention mechanisms.

Improving Convolutional Networks via Attention Transfer (ICLR 2017)

GitHub

1k stars
51 watching
276 forks
Language: Jupyter Notebook
last commit: over 6 years ago
attentiondeep-learningknowledge-distillationpytorch

Related projects:

Repository Description Stars
google-research/big_transfer Pre-trained models and code for fine-tuning image recognition tasks using deep learning frameworks 1,513
lancopku/iais This project proposes a novel method for calibrating attention distributions in multimodal models to improve contextualized representations of image-text pairs. 30
szagoruyko/binary-wide-resnet An implementation of a 1-bit weight neural network architecture using PyTorch 124
szagoruyko/wide-residual-networks An experimental study on residual networks to improve depth and width trade-offs in neural networks 1,298
koichiro11/residual-attention-network An image classification neural network implementation using attention mechanisms and residual learning 94
pistony/residualattentionnetwork A Gluon implementation of Residual Attention Network for image classification tasks 107
benedekrozemberczki/sgcn An implementation of a deep learning algorithm for graph data 268
benedekrozemberczki/appnp A PyTorch implementation of a graph neural network model that learns personalized node representations 363
ahirner/pytorch-retraining An experiment and benchmarking framework for evaluating the effectiveness of transfer learning in PyTorch-based deep learning models 170
peteanderson80/bottom-up-attention Trains a bottom-up attention model using Faster R-CNN and Visual Genome annotations for image captioning and VQA tasks 1,433
jhu-lcsr/good_robot Research on repurposing reinforcement learning for transfer between tasks in robotics and multi-step visual tasks with simulation-to-real transfer 107
javeywang/pyramid-attention-networks-pytorch An implementation of a deep learning model using PyTorch for semantic segmentation tasks. 235
benedekrozemberczki/gam An implementation of a graph classification model using structural attention and PyTorch 268
hszhao/psanet A deep learning framework for semantic segmentation with spatial attention mechanisms 216
emedvedev/attention-ocr A TensorFlow model for recognizing text in images using visual attention and a sequence-to-sequence architecture. 1,077