vqa.pytorch

VQA model

A PyTorch implementation of visual question answering with multimodal representation learning

Visual Question Answering in Pytorch

GitHub

718 stars

33 watching

177 forks

Language: Python

last commit: over 5 years ago

clevrcocodeep-learningpytorchresnetskipthoughtstorchvgenomevqa

Related projects:

Repository	Description	Stars
jayleicn/tvqa	PyTorch implementation of video question answering system based on TVQA dataset	172
markdtw/vqa-winner-cvprw-2017	Implementations and tools for training and fine-tuning a visual question answering model based on the 2017 CVPR workshop winner's approach.	164
hengyuan-hu/bottom-up-attention-vqa	An implementation of a VQA system using bottom-up attention, aiming to improve the efficiency and speed of visual question answering tasks.	755
noagarcia/roll-videoqa	A PyTorch-based model for answering questions about videos based on unseen scenes and storylines	19
milvlg/prophet	An implementation of a two-stage framework designed to prompt large language models with answer heuristics for knowledge-based visual question answering tasks.	270
gt-vision-lab/vqa_lstm_cnn	A Visual Question Answering model using a deeper LSTM and normalized CNN architecture.	377
hitvoice/drqa	Implementing reading comprehension from Wikipedia questions to answer open-domain queries using PyTorch and SQuAD dataset	401
jayleicn/clipbert	An efficient framework for end-to-end learning on image-text and video-text tasks	709
pasqal-io/pyqtorch	A PyTorch-based simulator for quantum machine learning	45
hyeonwoonoh/vqa-transfer-externaldata	Tools and scripts for training and evaluating a visual question answering model using transfer learning from an external data source.	20
kaiyangzhou/dassl.pytorch	A PyTorch toolbox for supporting research and development of domain adaptation, generalization, and semi-supervised learning methods in computer vision.	1,236
vlgiitr/dmn-plus	A PyTorch implementation of an improved question answering architecture with dynamic memory networks and attention mechanisms	64
xxradon/igcv3-pytorch	Reimplements MobileNet-V2 and IGCV3 using PyTorch for efficient deep learning.	19
prabhuomkar/pytorch-cpp	A C++ implementation of PyTorch tutorials	1,978
prlz77/resnext.pytorch	Reproduces ResNet-V3 with PyTorch for computer vision tasks	511