TVQA
VQA system
PyTorch implementation of video question answering system based on TVQA dataset
[EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering
172 stars
9 watching
32 forks
Language: Python
last commit: about 3 years ago datasetpytorchtvqavideoqa
Related projects:
| Repository | Description | Stars |
|---|---|---|
| | A PyTorch implementation of visual question answering with multimodal representation learning | 718 |
| | An efficient framework for end-to-end learning on image-text and video-text tasks | 709 |
| | Implementations and tools for training and fine-tuning a visual question answering model based on the 2017 CVPR workshop winner's approach. | 164 |
| | A PyTorch-based model for answering questions about videos based on unseen scenes and storylines | 19 |
| | A PyTorch implementation of an improved question answering architecture with dynamic memory networks and attention mechanisms | 64 |
| | Implementing reading comprehension from Wikipedia questions to answer open-domain queries using PyTorch and SQuAD dataset | 401 |
| | Tools and scripts for training and evaluating a visual question answering model using transfer learning from an external data source. | 20 |
| | A PyTorch-based simulator for quantum machine learning | 45 |
| | PyTorch implementation of video captioning, combining deep learning and computer vision techniques. | 402 |
| | An implementation of a VQA system using bottom-up attention, aiming to improve the efficiency and speed of visual question answering tasks. | 755 |
| | A PyTorch implementation of a real-time semantic segmentation model using ENet architecture | 392 |
| | A Visual Question Answering model using a deeper LSTM and normalized CNN architecture. | 377 |
| | A PyTorch implementation of Neural Module Networks for Visual Question Answering | 108 |
| | PyTorch implementation of PNASNet-5 architecture | 317 |
| | Trains PyTorch models using a genetic algorithm | 96 |