vqa.pytorch
VQA model
A PyTorch implementation of visual question answering with multimodal representation learning
Visual Question Answering in Pytorch
716 stars
33 watching
178 forks
Language: Python
last commit: almost 5 years ago clevrcocodeep-learningpytorchresnetskipthoughtstorchvgenomevqa
Related projects:
Repository | Description | Stars |
---|---|---|
jayleicn/tvqa | PyTorch implementation of video question answering system based on TVQA dataset | 172 |
markdtw/vqa-winner-cvprw-2017 | Implementations and tools for training and fine-tuning a visual question answering model based on the 2017 CVPR workshop winner's approach. | 164 |
hengyuan-hu/bottom-up-attention-vqa | An implementation of a VQA system using bottom-up attention, aiming to improve the efficiency and speed of visual question answering tasks. | 754 |
noagarcia/roll-videoqa | A PyTorch-based model for answering questions about videos based on unseen scenes and storylines | 19 |
milvlg/prophet | An implementation of a two-stage framework designed to prompt large language models with answer heuristics for knowledge-based visual question answering tasks. | 267 |
gt-vision-lab/vqa_lstm_cnn | A Visual Question Answering model using a deeper LSTM and normalized CNN architecture. | 376 |
hitvoice/drqa | Implementing reading comprehension from Wikipedia questions to answer open-domain queries using PyTorch and SQuAD dataset | 401 |
jayleicn/clipbert | An efficient framework for end-to-end learning on image-text and video-text tasks | 704 |
pasqal-io/pyqtorch | A PyTorch-based simulator for quantum machine learning | 45 |
hyeonwoonoh/vqa-transfer-externaldata | Tools and scripts for training and evaluating a visual question answering model using transfer learning from an external data source. | 20 |
kaiyangzhou/dassl.pytorch | A PyTorch toolbox for supporting research and development of domain adaptation, generalization, and semi-supervised learning methods in computer vision. | 1,217 |
vlgiitr/dmn-plus | A PyTorch implementation of an improved question answering architecture with dynamic memory networks and attention mechanisms | 64 |
xxradon/igcv3-pytorch | Reimplements MobileNet-V2 and IGCV3 using PyTorch for efficient deep learning. | 19 |
prabhuomkar/pytorch-cpp | A C++ implementation of PyTorch tutorials | 1,965 |
prlz77/resnext.pytorch | Reproduces ResNet-V3 with PyTorch for computer vision tasks | 508 |