PVT
PVT
An implementation of Pyramid Vision Transformers for image classification, object detection, and semantic segmentation tasks
Official implementation of PVT series
2k stars
23 watching
246 forks
Language: Python
last commit: about 2 years ago
Linked from 1 awesome list
backbonedetectionpvtpvtv2segmentationtransformer
Related projects:
Repository | Description | Stars |
---|---|---|
huawei-noah/pretrained-ipt | This project develops a pre-trained transformer model for image processing tasks such as denoising, super-resolution, and deraining. | 449 |
yitu-opensource/t2t-vit | A deep learning framework for training vision transformers from scratch on image data. | 1,151 |
jeonsworld/vit-pytorch | A PyTorch implementation of the Vision Transformer model for image recognition tasks. | 1,950 |
microsoft/cvt | An implementation of a new neural network architecture that combines the strengths of convolutional and transformer designs to improve performance on image classification tasks. | 556 |
gordonhu608/mqt-llava | A vision-language model that uses a query transformer to encode images as visual tokens and allows flexible choice of the number of visual tokens. | 100 |
pixart-alpha/pixart-sigma | Develops a PyTorch model for 4K text-to-image generation using diffusion transformer | 1,692 |
peiyunh/wysiwyg | Exploiting visibility for 3D object detection | 115 |
zhoudaquan/dvit_repo | An implementation of Deep Vision Transformer models with modifications to improve performance by preventing attention collapse | 136 |
wmvanvliet/pytorch_hmax | An implementation of the HMAX model for vision processing in PyTorch | 57 |
4uiiurz1/pytorch-nested-unet | An implementation of a deep learning model for image segmentation using PyTorch | 860 |
google-research/nested-transformer | An implementation of a transformer-based vision model that aggregates local transformers on image blocks to improve accuracy and efficiency. | 193 |
pvs-hd-tea/pytypes | Infer static type information from Python code by tracing execution paths | 11 |
javeywang/pyramid-attention-networks-pytorch | An implementation of a deep learning model using PyTorch for semantic segmentation tasks. | 236 |
ruiminshen/yolo2-pytorch | A PyTorch implementation of the YOLO (You Only Look Once) v2 object detection algorithm with flexible configuration and parallel training capabilities. | 440 |
davidtvs/pytorch-enet | A PyTorch implementation of a real-time semantic segmentation model using ENet architecture | 390 |