BLIVA
(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions
263 stars
12 watching
27 forks
Language: Python
last commit: 6 months ago blip2blivachatbotinstruction-tuningllamallmloramultimodalvisual-language-learning