LLaVA-Interactive-Demo
Image processor
An all-in-one demo for interactive image processing and generation
LLaVA-Interactive-Demo
353 stars
16 watching
27 forks
Language: Python
last commit: 6 months ago lmmmultimodal
Related projects:
Repository | Description | Stars |
---|---|---|
dvlab-research/llama-vid | An image-based language model that uses large language models to generate visual and text features from videos | 748 |
wisconsinaivision/vip-llava | A system designed to enable large multimodal models to understand arbitrary visual prompts | 302 |
llava-vl/llava-plus-codebase | A platform for training and deploying large language and vision models that can use tools to perform tasks | 717 |
ailab-cvc/seed | An implementation of a multimodal language model with capabilities for comprehension and generation | 585 |
mlpc-ucsd/bliva | A multimodal LLM designed to handle text-rich visual questions | 270 |
alibaba/conv-llava | This project presents an optimization technique for large-scale image models to reduce computational requirements while maintaining performance. | 106 |
vita-mllm/vita | A large multimodal language model designed to process and analyze video, image, text, and audio inputs in real-time. | 1,005 |
nvlabs/eagle | Develops high-resolution multimodal LLMs by combining vision encoders and various input resolutions | 549 |
freedomintelligence/longllava | A system for scaling large language models to process and understand visual information from multiple images efficiently. | 183 |
dvlab-research/llmga | An implementation of a multimodal generation assistant using large language models and various image editing techniques. | 463 |
snunez1/llama.cl | A Common Lisp port of a Large Language Model (LLM) implementation | 36 |
luispedro/mahotas | A library of fast computer vision algorithms implemented in C++ for speed, operating over numpy arrays. | 855 |
airaria/visual-chinese-llama-alpaca | Develops a multimodal Chinese language model with visual capabilities | 429 |
libvips/lua-vips | A Lua binding for a fast image processing library with low memory needs. | 129 |
libav/libav | A collection of libraries and tools for processing multimedia content | 1,086 |