LISA
Image segmentation tool
A system that uses large language models to generate segmentation masks for images based on complex queries and world knowledge.
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
2k stars
11 watching
131 forks
Language: Python
last commit: 5 months ago large-language-modelllmmulti-modalsegmentation
Related projects:
Repository | Description | Stars |
---|---|---|
dvlab-research/llama-vid | An image-based language model that uses large language models to generate visual and text features from videos | 733 |
opengvlab/visionllm | A large language model designed to process and generate visual information | 915 |
thelegendali/deeplab-context | An implementation of a deep learning system for semantic image segmentation using a combination of convolutional neural networks and conditional random fields. | 239 |
balcilar/drlse-image-segmentation | A method for image segmentation using level sets and a distance regularized term to avoid the need for re-initialization | 88 |
nvlabs/prismer | A deep learning framework for training multi-modal models with vision and language capabilities. | 1,298 |
dvlab-research/llmga | An implementation of a multimodal generation assistant using large language models and various image editing techniques. | 461 |
dvlab-research/prompt-highlighter | An interactive control system for text generation in multi-modal language models | 132 |
zhengpeng7/birefnet | An implementation of a deep learning-based image segmentation model for high-resolution images | 1,319 |
esa-philab/iris | A tool for manually segmenting images from satellite data with AI assistance | 140 |
abbypa/nnproject_deepmask | A deep learning implementation of an object segmentation algorithm. | 187 |
yfzhang114/llava-align | Debiasing techniques to minimize hallucinations in large visual language models | 71 |
kreshuklab/plant-seg | A tool for cell instance aware segmentation in densely packed 3D volumetric images | 97 |
evolvinglmms-lab/longva | This project provides a model for long context transfer from language to vision using a deep learning framework. | 334 |
labforcomputationalvision/matlabpyrtools | Tools for multi-scale image processing and analysis | 178 |
freedomintelligence/longllava | A system for scaling large language models to process and understand visual information from multiple images efficiently. | 179 |