PVT

An implementation of Pyramid Vision Transformers for image classification, object detection, and semantic segmentation tasks

Official implementation of PVT series

GitHub

2k stars

23 watching

247 forks

Language: Python

last commit: almost 3 years ago

Linked from 1 awesome list

backbonedetectionpvtpvtv2segmentationtransformer

Backlinks from these awesome lists:

weiaicunzai/awesome-image-classification

Related projects:

Repository	Description	Stars
huawei-noah/pretrained-ipt	This project develops a pre-trained transformer model for image processing tasks such as denoising, super-resolution, and deraining.	451
yitu-opensource/t2t-vit	A deep learning framework for training vision transformers from scratch on image data.	1,162
jeonsworld/vit-pytorch	A PyTorch implementation of the Vision Transformer model for image recognition tasks.	1,959
microsoft/cvt	An implementation of a new neural network architecture that combines the strengths of convolutional and transformer designs to improve performance on image classification tasks.	559
gordonhu608/mqt-llava	A vision-language model that uses a query transformer to encode images as visual tokens and allows flexible choice of the number of visual tokens.	101
pixart-alpha/pixart-sigma	Develops a PyTorch model for 4K text-to-image generation using diffusion transformer	1,711
peiyunh/wysiwyg	Exploiting visibility for 3D object detection	115
zhoudaquan/dvit_repo	An implementation of Deep Vision Transformer models with modifications to improve performance by preventing attention collapse	137
wmvanvliet/pytorch_hmax	An implementation of the HMAX model for vision processing in PyTorch	57
4uiiurz1/pytorch-nested-unet	An implementation of a deep learning model for image segmentation using PyTorch	868
google-research/nested-transformer	An implementation of a transformer-based vision model that aggregates local transformers on image blocks to improve accuracy and efficiency.	195
pvs-hd-tea/pytypes	Infer static type information from Python code by tracing execution paths	11
javeywang/pyramid-attention-networks-pytorch	An implementation of a deep learning model using PyTorch for semantic segmentation tasks.	237
ruiminshen/yolo2-pytorch	A PyTorch implementation of the YOLO (You Only Look Once) v2 object detection algorithm with flexible configuration and parallel training capabilities.	440
davidtvs/pytorch-enet	A PyTorch implementation of a real-time semantic segmentation model using ENet architecture	392