PMC-VQA
Medical image understanding toolkit
A medical visual question-answering dataset and toolkit for training models to understand medical images and instructions.
PMC-VQA is a large-scale medical visual question-answering dataset, which contains 227k VQA pairs of 149k images that cover various modalities or diseases.
180 stars
3 watching
11 forks
Language: Python
last commit: 3 months ago Related projects:
Repository | Description | Stars |
---|---|---|
| A PyTorch implementation of visual question answering with multimodal representation learning | 718 |
| This project provides code for training image question answering models using stacked attention networks and convolutional neural networks. | 108 |
| An implementation of a two-stage framework designed to prompt large language models with answer heuristics for knowledge-based visual question answering tasks. | 270 |
| A Visual Question Answering model using a deeper LSTM and normalized CNN architecture. | 377 |
| A software framework for training and deploying multimodal visual question answering models using compact bilinear pooling. | 222 |
| An implementation of a VQA system using bottom-up attention, aiming to improve the efficiency and speed of visual question answering tasks. | 755 |
| This project presents a neural network model designed to answer visual questions by combining question and image features in a residual learning framework. | 39 |
| PyTorch implementation of video question answering system based on TVQA dataset | 172 |
| A multimodal LLM designed to handle text-rich visual questions | 270 |
| An open-source toolbox for action understanding from video data using PyTorch. | 1,863 |
| A comprehensive computer vision toolkit with tools and algorithms for video and image analytics in multiple environments. | 291 |
| A VQA dataset with unanswerable questions designed to test the limits of large models' knowledge and reasoning abilities. | 3 |
| Provides standardized implementations of algorithms for non-contact physiological measurement using image analysis techniques. | 193 |
| A toolkit for interactive visualization of high-resolution bioimaging data. | 290 |
| Tools and scripts for training and evaluating a visual question answering model using transfer learning from an external data source. | 20 |