BLIVA

(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions

GitHub

263 stars
12 watching
27 forks
Language: Python
last commit: 6 months ago
blip2blivachatbotinstruction-tuningllamallmloramultimodalvisual-language-learning