LongLLaVA
LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture
151 stars
14 watching
9 forks
Language: Python
last commit: 11 days ago LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture