LISA
Image segmentation tool
A system that uses large language models to generate segmentation masks for images based on complex queries and world knowledge.
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
2k stars
11 watching
131 forks
Language: Python
last commit: 8 months ago large-language-modelllmmulti-modalsegmentation
Related projects:
Repository | Description | Stars |
---|---|---|
| An image-based language model that uses large language models to generate visual and text features from videos | 748 |
| A large language model designed to process and generate visual information | 956 |
| An implementation of a deep learning system for semantic image segmentation using a combination of convolutional neural networks and conditional random fields. | 239 |
| A method for image segmentation using level sets and a distance regularized term to avoid the need for re-initialization | 89 |
| A deep learning framework for training multi-modal models with vision and language capabilities. | 1,299 |
| An implementation of a multimodal generation assistant using large language models and various image editing techniques. | 463 |
| An interactive control system for text generation in multi-modal language models | 135 |
| An open-source implementation of an image segmentation model that combines background removal and object detection capabilities. | 1,484 |
| A tool for manually segmenting images from satellite data with AI assistance | 141 |
| A deep learning implementation of an object segmentation algorithm. | 187 |
| Debiasing techniques to minimize hallucinations in large visual language models | 75 |
| A tool for cell instance aware segmentation in densely packed 3D volumetric images | 101 |
| An open-source project that enables the transfer of language understanding to vision capabilities through long context processing. | 347 |
| Tools for multi-scale image processing and analysis | 180 |
| A system for scaling large language models to process and understand visual information from multiple images efficiently. | 183 |