Insight-V
Visual Reasoning Engine
Exploring long-chain visual reasoning with large language models to improve image understanding
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
113 stars
9 watching
4 forks
Language: Python
last commit: about 2 months ago