vstar
PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"
507 stars
11 watching
33 forks
Language: Python
last commit: 9 months ago PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"