nested-transformer
Vision Transformer
An implementation of a transformer-based vision model that aggregates local transformers on image blocks to improve accuracy and efficiency.
Nested Hierarchical Transformer https://arxiv.org/pdf/2105.12723.pdf
193 stars
8 watching
26 forks
Language: Jupyter Notebook
last commit: 4 months ago
Linked from 1 awesome list
imagenettransformervision
Related projects:
Repository | Description | Stars |
---|---|---|
jeonsworld/vit-pytorch | A PyTorch implementation of the Vision Transformer model for image recognition tasks. | 1,940 |
lahoud/3d-vision-transformers | Compiles and shares 3D computer vision papers using transformer models | 406 |
dirtyharrylyl/transformer-in-vision | A collection of resources and papers related to Transformer-based computer vision models and techniques. | 1,319 |
microsoft/focal-transformer | A vision transformer architecture that uses a novel attention mechanism to capture local-global interactions in images | 545 |
google-research/vision_transformer | Provides pre-trained models and code for training vision transformers and mixers using JAX/Flax | 10,502 |
yitu-opensource/t2t-vit | A deep learning framework for training vision transformers from scratch on image data. | 1,148 |
zhoudaquan/dvit_repo | An implementation of Deep Vision Transformer models with modifications to improve performance by preventing attention collapse | 136 |
whai362/pvt | An implementation of Pyramid Vision Transformers for image classification, object detection, and semantic segmentation tasks | 1,728 |
lucidrains/reformer-pytorch | An implementation of Reformer, an efficient Transformer model for natural language processing tasks. | 2,120 |
gordonhu608/mqt-llava | A vision-language model that uses a query transformer to encode images as visual tokens and allows flexible choice of the number of visual tokens. | 97 |
gamrix/cs231n_proj | This project focuses on manipulating 3D views using deep learning techniques. | 6 |
zsdonghao/spatial-transformer-nets | An implementation of Spatial Transformer Networks in TensorFlow for learning to apply transformations to images via classification tasks. | 36 |
josejuanqm/ody | An image generator that applies various transformations and styles to images | 45 |
ibrahimsobh/transformers | An implementation of deep neural network architectures, including Transformers, in Python. | 212 |
matlab-deep-learning/transformer-models | An implementation of deep learning transformer models in MATLAB | 206 |