LLaVAR

Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding"

GitHub

254 stars
5 watching
12 forks
Language: Python
last commit: 4 months ago
chatbotchatgptgpt-4instruction-tuningllavamultimodalocrvision-and-language