vqa-winner-cvprw-2017
VQA Model Trainer
Implementations and tools for training and fine-tuning a visual question answering model based on the 2017 CVPR workshop winner's approach.
Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17
164 stars
11 watching
38 forks
Language: Python
last commit: about 6 years ago pytorchvisual-question-answering
Related projects:
Repository | Description | Stars |
---|---|---|
| A PyTorch implementation of visual question answering with multimodal representation learning | 718 |
| Tools and scripts for training and evaluating a visual question answering model using transfer learning from an external data source. | 20 |
| PyTorch implementation of video question answering system based on TVQA dataset | 172 |
| An implementation of a two-stage framework designed to prompt large language models with answer heuristics for knowledge-based visual question answering tasks. | 270 |
| Implementing reading comprehension from Wikipedia questions to answer open-domain queries using PyTorch and SQuAD dataset | 401 |
| An implementation of a VQA system using bottom-up attention, aiming to improve the efficiency and speed of visual question answering tasks. | 755 |
| An efficient framework for end-to-end learning on image-text and video-text tasks | 709 |
| A PyTorch-based simulator for quantum machine learning | 45 |
| PyTorch implementation of guided visual search mechanism for multimodal LLMs | 541 |
| A Visual Question Answering model using a deeper LSTM and normalized CNN architecture. | 377 |
| A PyTorch toolbox for supporting research and development of domain adaptation, generalization, and semi-supervised learning methods in computer vision. | 1,236 |
| An open-source PyTorch implementation of a visual semantic reasoning model for image-text matching | 294 |
| A PyTorch-based framework for training large language models in parallel on multiple devices | 679 |
| A PyTorch implementation of visual-semantic embedding methods for image-caption retrieval | 492 |
| A PyTorch implementation of an improved question answering architecture with dynamic memory networks and attention mechanisms | 64 |