LinVT

Video model adapter

An approach to adapt large language models trained on images to understand videos

LinVT: Empower Your Image-level Large Language Model to Understand Videos

GitHub

13 stars
3 watching
0 forks
Language: Python
last commit: about 1 month ago