moondream

Image processor

A vision language model for image understanding tasks such as captioning and object detection, optimized for efficient deployment on resource-constrained hardware.

tiny vision language model

GitHub

6k stars
57 watching
502 forks
Language: Jupyter Notebook
last commit: about 2 months ago