SoM

Image marking tool

Enables visual grounding in large language models by overlaying spatial and speakable marks on images

Set-of-Mark Prompting for GPT-4V and LMMs

GitHub

1k stars
22 watching
95 forks
Language: Python
last commit: 3 months ago

Related projects:

Repository Description Stars
vhellendoorn/code-lms A guide to using pre-trained large language models in source code analysis and generation 1,782
opengvlab/visionllm A large language model designed to process and generate visual information 915
dvlab-research/lisa A system that uses large language models to generate segmentation masks for images based on complex queries and world knowledge. 1,861
elliottd/groundedtranslation Trains multilingual image description models using neural sequence models and extracts hidden features from trained models. 46
lzx1413/labelimgplus An image annotation tool supporting various modes and formats, including CLS, DET, SEG, and PASCAL VOC. 211
tenkoh/goldmark-img64 An extention to goldmark that embeds local images into HTML as base64 encoded data 0
jshilong/gpt4roi Training and deploying large language models on computer vision tasks using region-of-interest inputs 506
thomas-neitmann/mdthemes Enables text rendering in popular data visualization packages using markdown 80
mnsignalprocessing/barelyml A markup language that combines elements from Markdown and DokuWiki to display text with formatting and structure 15
digitalglobe/mltools Tools for building machine learning solutions on satellite imagery 82
sweppner/labeld A tool for annotating images with tags and categories 134
vim-scripts/svg.vim A Vim plugin for syntax highlighting of Scalable Vector Graphics (SVG) files. 9
lunacookies/vim-sh Improves Vim's highlighting of shell scripts by providing enhanced syntax support. 7
mattonem/smalltalkenv A LaTeX environment and code highlighting solution for showcasing Smalltalk code in documents. 8
sopyer/scintillagl A C++ port of the Scintilla text editing control to OpenGL, enabling 3D rendering and interaction. 32