LLaVA-Interactive-Demo

Image processor

An all-in-one demo for interactive image processing and generation

LLaVA-Interactive-Demo

GitHub

353 stars
16 watching
27 forks
Language: Python
last commit: 6 months ago
lmmmultimodal

Related projects:

Repository Description Stars
dvlab-research/llama-vid An image-based language model that uses large language models to generate visual and text features from videos 748
wisconsinaivision/vip-llava A system designed to enable large multimodal models to understand arbitrary visual prompts 302
llava-vl/llava-plus-codebase A platform for training and deploying large language and vision models that can use tools to perform tasks 717
ailab-cvc/seed An implementation of a multimodal language model with capabilities for comprehension and generation 585
mlpc-ucsd/bliva A multimodal LLM designed to handle text-rich visual questions 270
alibaba/conv-llava This project presents an optimization technique for large-scale image models to reduce computational requirements while maintaining performance. 106
vita-mllm/vita A large multimodal language model designed to process and analyze video, image, text, and audio inputs in real-time. 1,005
nvlabs/eagle Develops high-resolution multimodal LLMs by combining vision encoders and various input resolutions 549
freedomintelligence/longllava A system for scaling large language models to process and understand visual information from multiple images efficiently. 183
dvlab-research/llmga An implementation of a multimodal generation assistant using large language models and various image editing techniques. 463
snunez1/llama.cl A Common Lisp port of a Large Language Model (LLM) implementation 36
luispedro/mahotas A library of fast computer vision algorithms implemented in C++ for speed, operating over numpy arrays. 855
airaria/visual-chinese-llama-alpaca Develops a multimodal Chinese language model with visual capabilities 429
libvips/lua-vips A Lua binding for a fast image processing library with low memory needs. 129
libav/libav A collection of libraries and tools for processing multimedia content 1,086