prismer
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
1k stars
15 watching
75 forks
Language: Python
last commit: 9 months ago image-captioninglanguage-modelmulti-modal-learningmulti-task-learningvision-and-languagevision-language-modelvqa