UNK-VQA
VQA challenge data
A VQA dataset with unanswerable questions designed to test the limits of large models' knowledge and reasoning abilities.
A VQA dataset that includes unanswerable questions [TPAMI 2024].
3 stars
1 watching
0 forks
last commit: 4 months ago Related projects:
Repository | Description | Stars |
---|---|---|
hyeonwoonoh/vqa-transfer-externaldata | Tools and scripts for training and evaluating a visual question answering model using transfer learning from an external data source. | 20 |
hengyuan-hu/bottom-up-attention-vqa | An implementation of a VQA system using bottom-up attention, aiming to improve the efficiency and speed of visual question answering tasks. | 755 |
gt-vision-lab/vqa_lstm_cnn | A Visual Question Answering model using a deeper LSTM and normalized CNN architecture. | 377 |
findalexli/scigraphqa | A dataset and benchmarking framework for evaluating the performance of large language models on multi-turn question answering tasks for scientific graphs. | 38 |
akirafukui/vqa-mcb | A software framework for training and deploying multimodal visual question answering models using compact bilinear pooling. | 222 |
xiaoman-zhang/pmc-vqa | A medical visual question-answering dataset and toolkit for training models to understand medical images and instructions. | 180 |
maluuba/newsqa | Compiles and provides structured access to Maluuba's NewsQA dataset for natural language question answering research. | 253 |
jayleicn/tvqa | PyTorch implementation of video question answering system based on TVQA dataset | 172 |
kha7iq/kha7iq | An infinite loop simulator | 26 |
cadene/vqa.pytorch | A PyTorch implementation of visual question answering with multimodal representation learning | 718 |
ysu1989/graphquestions | A characteristic-rich dataset for factoid question answering with explicit specification of question characteristics and logical forms. | 92 |
google-deepmind/narrativeqa | A dataset collection providing text documents with corresponding summaries and questions. | 463 |
masaiahhan/correlationqa | An investigation into the relationship between misleading images and hallucinations in large language models | 8 |
jnhwkim/nips-mrn-vqa | This project presents a neural network model designed to answer visual questions by combining question and image features in a residual learning framework. | 39 |
lucasvazq/lucasvazq | Personal showcase of a developer's experience and expertise in building web platforms with a focus on accessibility, performance, and robust code. | 30 |