vstar

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

GitHub

507 stars
11 watching
33 forks
Language: Python
last commit: 9 months ago