Insight-V

Visual Reasoning Engine

Exploring long-chain visual reasoning with large language models to improve image understanding

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

GitHub

113 stars
9 watching
4 forks
Language: Python
last commit: about 2 months ago