TVQA
VQA system
PyTorch implementation of video question answering system based on TVQA dataset
[EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering
172 stars
9 watching
32 forks
Language: Python
last commit: over 2 years ago datasetpytorchtvqavideoqa
Related projects:
Repository | Description | Stars |
---|---|---|
| A PyTorch implementation of visual question answering with multimodal representation learning | 718 |
| An efficient framework for end-to-end learning on image-text and video-text tasks | 709 |
| Implementations and tools for training and fine-tuning a visual question answering model based on the 2017 CVPR workshop winner's approach. | 164 |
| A PyTorch-based model for answering questions about videos based on unseen scenes and storylines | 19 |
| A PyTorch implementation of an improved question answering architecture with dynamic memory networks and attention mechanisms | 64 |
| Implementing reading comprehension from Wikipedia questions to answer open-domain queries using PyTorch and SQuAD dataset | 401 |
| Tools and scripts for training and evaluating a visual question answering model using transfer learning from an external data source. | 20 |
| A PyTorch-based simulator for quantum machine learning | 45 |
| PyTorch implementation of video captioning, combining deep learning and computer vision techniques. | 402 |
| An implementation of a VQA system using bottom-up attention, aiming to improve the efficiency and speed of visual question answering tasks. | 755 |
| A PyTorch implementation of a real-time semantic segmentation model using ENet architecture | 392 |
| A Visual Question Answering model using a deeper LSTM and normalized CNN architecture. | 377 |
| A PyTorch implementation of Neural Module Networks for Visual Question Answering | 108 |
| PyTorch implementation of PNASNet-5 architecture | 317 |
| Trains PyTorch models using a genetic algorithm | 96 |