TAG
Text VQA Generator
A Python-based system for generating visual question-answer pairs using text-aware approaches to improve Text-VQA performance.
21 stars
1 watching
0 forks
Language: Python
last commit: almost 3 years ago Related projects:
| Repository | Description | Stars |
|---|---|---|
| | Tools and scripts for training and evaluating a visual question answering model using transfer learning from an external data source. | 20 |
| | An implementation of a VQA system using bottom-up attention, aiming to improve the efficiency and speed of visual question answering tasks. | 755 |
| | Implementations and tools for training and fine-tuning a visual question answering model based on the 2017 CVPR workshop winner's approach. | 164 |
| | An implementation of a two-stage framework designed to prompt large language models with answer heuristics for knowledge-based visual question answering tasks. | 270 |
| | Autonomously generates high-quality image-text instruction fine-tuning datasets | 91 |
| | This project provides code and corpora for creating word embeddings by considering the visual characteristics of words. | 15 |
| | A PyTorch implementation of visual question answering with multimodal representation learning | 718 |
| | A framework for generating and executing Python code to solve visual inference tasks using large language models | 1,666 |
| | Automates the creation of uniform reports from various input sources in web application security assessments | 66 |
| | A tool for rendering LaTeX documents to HTML5, enabling interactive content on the web. | 61 |
| | A text-to-image tool using CLIP and FFT/DWT parameters to generate detailed images from user-provided text prompts. | 778 |
| | Pretrained Chinese text generation model trained on large-scale data | 558 |
| | Generates data for CARLA's visual navigation system using raw camera images and instructions. | 8 |
| | Implementing reading comprehension from Wikipedia questions to answer open-domain queries using PyTorch and SQuAD dataset | 401 |
| | PyTorch implementation of video question answering system based on TVQA dataset | 172 |