prismer

The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".

GitHub

1k stars
15 watching
75 forks
Language: Python
last commit: 9 months ago
image-captioninglanguage-modelmulti-modal-learningmulti-task-learningvision-and-languagevision-language-modelvqa