nested-transformer

Vision Transformer

An implementation of a transformer-based vision model that aggregates local transformers on image blocks to improve accuracy and efficiency.

Nested Hierarchical Transformer https://arxiv.org/pdf/2105.12723.pdf

GitHub

193 stars
8 watching
26 forks
Language: Jupyter Notebook
last commit: 4 months ago
Linked from 1 awesome list

imagenettransformervision

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
jeonsworld/vit-pytorch A PyTorch implementation of the Vision Transformer model for image recognition tasks. 1,940
lahoud/3d-vision-transformers Compiles and shares 3D computer vision papers using transformer models 406
dirtyharrylyl/transformer-in-vision A collection of resources and papers related to Transformer-based computer vision models and techniques. 1,319
microsoft/focal-transformer A vision transformer architecture that uses a novel attention mechanism to capture local-global interactions in images 545
google-research/vision_transformer Provides pre-trained models and code for training vision transformers and mixers using JAX/Flax 10,502
yitu-opensource/t2t-vit A deep learning framework for training vision transformers from scratch on image data. 1,148
zhoudaquan/dvit_repo An implementation of Deep Vision Transformer models with modifications to improve performance by preventing attention collapse 136
whai362/pvt An implementation of Pyramid Vision Transformers for image classification, object detection, and semantic segmentation tasks 1,728
lucidrains/reformer-pytorch An implementation of Reformer, an efficient Transformer model for natural language processing tasks. 2,120
gordonhu608/mqt-llava A vision-language model that uses a query transformer to encode images as visual tokens and allows flexible choice of the number of visual tokens. 97
gamrix/cs231n_proj This project focuses on manipulating 3D views using deep learning techniques. 6
zsdonghao/spatial-transformer-nets An implementation of Spatial Transformer Networks in TensorFlow for learning to apply transformations to images via classification tasks. 36
josejuanqm/ody An image generator that applies various transformations and styles to images 45
ibrahimsobh/transformers An implementation of deep neural network architectures, including Transformers, in Python. 212
matlab-deep-learning/transformer-models An implementation of deep learning transformer models in MATLAB 206