SoM

Image marking tool

Enables visual grounding in large language models by overlaying spatial and speakable marks on images

Set-of-Mark Prompting for GPT-4V and LMMs

GitHub

1k stars
23 watching
98 forks
Language: Python
last commit: 4 months ago

Related projects:

Repository Description Stars
vhellendoorn/code-lms A guide to using pre-trained large language models in source code analysis and generation 1,786
opengvlab/visionllm A large language model designed to process and generate visual information 956
dvlab-research/lisa A system that uses large language models to generate segmentation masks for images based on complex queries and world knowledge. 1,923
elliottd/groundedtranslation Trains multilingual image description models using neural sequence models and extracts hidden features from trained models. 46
lzx1413/labelimgplus An image annotation tool supporting various modes and formats, including CLS, DET, SEG, and PASCAL VOC. 211
tenkoh/goldmark-img64 Automatically embeds image files as Base64-encoded data into rendered HTML from Markdown source 0
jshilong/gpt4roi Training and deploying large language models on computer vision tasks using region-of-interest inputs 517
thomas-neitmann/mdthemes Enables text rendering in popular data visualization packages using markdown 80
mnsignalprocessing/barelyml A markup language that combines elements from Markdown and DokuWiki to display text with formatting and structure 16
digitalglobe/mltools Tools for building machine learning solutions on satellite imagery 81
sweppner/labeld A tool for annotating images with tags and categories 134
vim-scripts/svg.vim A Vim plugin for syntax highlighting of Scalable Vector Graphics (SVG) files. 9
lunacookies/vim-sh Improves Vim's highlighting of shell scripts by providing enhanced syntax support. 7
mattonem/smalltalkenv A LaTeX environment and code highlighting solution for showcasing Smalltalk code in documents. 8
sopyer/scintillagl A C++ port of the Scintilla text editing control to OpenGL, enabling 3D rendering and interaction. 32