Transformer-in-Vision

CV Transformers

A collection of resources and papers related to Transformer-based computer vision models and techniques.

Recent Transformer-based CV and related works.

GitHub

1k stars
87 watching
144 forks
last commit: about 1 year ago
Linked from 1 awesome list

computer-visiondeep-learningmulti-modalpaperself-attentiontransformervision-transformersvisual-language

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
lahoud/3d-vision-transformers Compiles and shares 3D computer vision papers using transformer models 406
google-research/nested-transformer An implementation of a transformer-based vision model that aggregates local transformers on image blocks to improve accuracy and efficiency. 193
microsoft/cvt An implementation of a new neural network architecture that combines the strengths of convolutional and transformer designs to improve performance on image classification tasks. 555
donnyyou/pytorchcv A PyTorch-based framework for building and training deep learning models in computer vision. 47
jeonsworld/vit-pytorch A PyTorch implementation of the Vision Transformer model for image recognition tasks. 1,940
gamrix/cs231n_proj This project focuses on manipulating 3D views using deep learning techniques. 6
tongjilibo/bert4torch An implementation of transformer models in PyTorch for natural language processing tasks 1,241
jhcho99/coformer An implementation of a deep learning model for grounding situation recognition in images 43
ibrahimsobh/transformers An implementation of deep neural network architectures, including Transformers, in Python. 212
swintransformer/video-swin-transformer An implementation of the Video Swin Transformer architecture for video recognition tasks 1,444
atiyo/deep_image_prior Reconstructs images using untrained neural networks to manipulate and transform existing images 215
swz30/restormer Proposes an efficient neural architecture model for high-resolution image restoration tasks 1,805
nebgnahz/cv-rs Rust wrapper around OpenCV 3.x 204
kaiyangzhou/dassl.pytorch A PyTorch toolbox for supporting research and development of domain adaptation, generalization, and semi-supervised learning methods in computer vision. 1,217
whai362/pvt An implementation of Pyramid Vision Transformers for image classification, object detection, and semantic segmentation tasks 1,728