LISA

Image segmentation tool

A system that uses large language models to generate segmentation masks for images based on complex queries and world knowledge.

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

GitHub

2k stars
11 watching
131 forks
Language: Python
last commit: 5 months ago
large-language-modelllmmulti-modalsegmentation

Related projects:

Repository Description Stars
dvlab-research/llama-vid An image-based language model that uses large language models to generate visual and text features from videos 733
opengvlab/visionllm A large language model designed to process and generate visual information 915
thelegendali/deeplab-context An implementation of a deep learning system for semantic image segmentation using a combination of convolutional neural networks and conditional random fields. 239
balcilar/drlse-image-segmentation A method for image segmentation using level sets and a distance regularized term to avoid the need for re-initialization 88
nvlabs/prismer A deep learning framework for training multi-modal models with vision and language capabilities. 1,298
dvlab-research/llmga An implementation of a multimodal generation assistant using large language models and various image editing techniques. 461
dvlab-research/prompt-highlighter An interactive control system for text generation in multi-modal language models 132
zhengpeng7/birefnet An implementation of a deep learning-based image segmentation model for high-resolution images 1,319
esa-philab/iris A tool for manually segmenting images from satellite data with AI assistance 140
abbypa/nnproject_deepmask A deep learning implementation of an object segmentation algorithm. 187
yfzhang114/llava-align Debiasing techniques to minimize hallucinations in large visual language models 71
kreshuklab/plant-seg A tool for cell instance aware segmentation in densely packed 3D volumetric images 97
evolvinglmms-lab/longva This project provides a model for long context transfer from language to vision using a deep learning framework. 334
labforcomputationalvision/matlabpyrtools Tools for multi-scale image processing and analysis 178
freedomintelligence/longllava A system for scaling large language models to process and understand visual information from multiple images efficiently. 179