Transformer-in-Vision
CV Transformers
A collection of resources and papers related to Transformer-based computer vision models and techniques.
Recent Transformer-based CV and related works.
1k stars
87 watching
144 forks
last commit: over 1 year ago
Linked from 1 awesome list
computer-visiondeep-learningmulti-modalpaperself-attentiontransformervision-transformersvisual-language
Related projects:
Repository | Description | Stars |
---|---|---|
lahoud/3d-vision-transformers | Compiles and shares 3D computer vision papers using transformer models | 406 |
google-research/nested-transformer | An implementation of a transformer-based vision model that aggregates local transformers on image blocks to improve accuracy and efficiency. | 193 |
microsoft/cvt | An implementation of a new neural network architecture that combines the strengths of convolutional and transformer designs to improve performance on image classification tasks. | 555 |
donnyyou/pytorchcv | A PyTorch-based framework for building and training deep learning models in computer vision. | 47 |
jeonsworld/vit-pytorch | A PyTorch implementation of the Vision Transformer model for image recognition tasks. | 1,940 |
gamrix/cs231n_proj | This project focuses on manipulating 3D views using deep learning techniques. | 6 |
tongjilibo/bert4torch | An implementation of transformer models in PyTorch for natural language processing tasks | 1,241 |
jhcho99/coformer | An implementation of a deep learning model for grounding situation recognition in images | 43 |
ibrahimsobh/transformers | An implementation of deep neural network architectures, including Transformers, in Python. | 212 |
swintransformer/video-swin-transformer | An implementation of the Video Swin Transformer architecture for video recognition tasks | 1,452 |
atiyo/deep_image_prior | Reconstructs images using untrained neural networks to manipulate and transform existing images | 215 |
swz30/restormer | Proposes an efficient neural architecture model for high-resolution image restoration tasks | 1,805 |
nebgnahz/cv-rs | Rust wrapper around OpenCV 3.x | 204 |
kaiyangzhou/dassl.pytorch | A PyTorch toolbox for supporting research and development of domain adaptation, generalization, and semi-supervised learning methods in computer vision. | 1,217 |
whai362/pvt | An implementation of Pyramid Vision Transformers for image classification, object detection, and semantic segmentation tasks | 1,728 |