LLaVA-Interactive-Demo

Image processor

An all-in-one demo for interactive image processing and generation

LLaVA-Interactive-Demo

GitHub

351 stars
16 watching
26 forks
Language: Python
last commit: 4 months ago
lmmmultimodal

Related projects:

Repository Description Stars
dvlab-research/llama-vid An image-based language model that uses large language models to generate visual and text features from videos 733
wisconsinaivision/vip-llava A system designed to enable large multimodal models to understand arbitrary visual prompts 294
llava-vl/llava-plus-codebase A platform for training and deploying large language and vision models that can use tools to perform tasks 704
ailab-cvc/seed An implementation of a multimodal language model with capabilities for comprehension and generation 576
mlpc-ucsd/bliva A multimodal LLM designed to handle text-rich visual questions 269
alibaba/conv-llava This project presents an optimization technique for large-scale image models to reduce computational requirements while maintaining performance. 104
vita-mllm/vita A large multimodal language model designed to process and analyze video, image, text, and audio inputs in real-time. 961
nvlabs/eagle Develops high-resolution multimodal LLMs by combining vision encoders and various input resolutions 539
freedomintelligence/longllava A system for scaling large language models to process and understand visual information from multiple images efficiently. 179
dvlab-research/llmga An implementation of a multimodal generation assistant using large language models and various image editing techniques. 461
snunez1/llama.cl A Common Lisp port of a Large Language Model (LLM) implementation 35
luispedro/mahotas A library of fast computer vision algorithms implemented in C++ for speed, operating over numpy arrays. 844
airaria/visual-chinese-llama-alpaca Develops a multimodal Chinese language model with visual capabilities 424
libvips/lua-vips A Lua binding for a fast image processing library with low memory needs. 127
libav/libav A collection of libraries and tools for processing multimedia content 1,082