awesome-vlm-architectures
VLMs
Documentation for famous Vision Language Models and their architectures
Famous Vision Language Models and Their Architectures
500 stars
12 watching
25 forks
Language: Markdown
last commit: 6 months ago awesomeawesome-listblipclipcogvlmimage-encoderinternlmkosmosllavamultimodalqwen-vltext-encodervision-language-modelvlm
👁️🗨️Awesome VLM Architectures | |||
Visit my other repo to try Vision Language Models on ComfyUI | 430 | 4 months ago | 📙 |
👁️🗨️Awesome VLM Architectures / Important References | |||
Guide to Vision-Language Models (VLMs) by Görkem Polat | |||
VLM Primer by Aman Chadha | |||
Generalized Visual Language Models by Lilian Weng |