awesome-vlm-architectures
VLMs
Documentation for famous Vision Language Models and their architectures
Famous Vision Language Models and Their Architectures
500 stars
12 watching
25 forks
Language: Markdown
last commit: about 1 year ago awesomeawesome-listblipclipcogvlmimage-encoderinternlmkosmosllavamultimodalqwen-vltext-encodervision-language-modelvlm
👁️🗨️Awesome VLM Architectures | |||
| Visit my other repo to try Vision Language Models on ComfyUI | 430 | about 1 year ago | 📙 |
👁️🗨️Awesome VLM Architectures / Important References | |||
| Guide to Vision-Language Models (VLMs) by Görkem Polat | |||
| VLM Primer by Aman Chadha | |||
| Generalized Visual Language Models by Lilian Weng | |||