ContextDET

Object detection

An approach to detecting objects in images using multimodal large language models and contextual information

Contextual Object Detection with Multimodal Large Language Models

GitHub

208 stars
14 watching
5 forks
Language: Python
last commit: 2 months ago
large-language-modelobject-detection

Related projects:

Repository Description Stars
yutingzhang/fgs-obj Demonstrates object detection using fine-grained search and structured prediction techniques 50
yknzhu/segdeepm A tool for fine-tuning deep neural networks to improve object detection and segmentation capabilities by incorporating contextual information. 27
lartpang/ovcamo Develops an object segmentation algorithm to detect camouflaged objects in images with varying backgrounds and contexts. 20
jingzhang617/cod-rank-localize-and-segment Develops a system to detect, segment, and rank camouflaged objects in images. 74
damo-nlp-sg/vcd An approach to reduce object hallucinations in large vision-language models by contrasting output distributions derived from original and distorted visual inputs 222
taokong/ron A deep learning framework for object detection tasks using a novel neural network architecture 355
jxr326/swinmcnet A Python-based object detection framework utilizing transformers and computer vision techniques to detect salient objects in RGB-thermal images 16
szq0214/dsod A deep learning-based object detection system from scratch 706
yuwenxiong/py-r-fcn An implementation of R-FCN, an object detection algorithm using region-based fully convolutional networks. 1,048
xingyizhou/extremenet An object detection framework that uses extreme and center points to detect objects in images. 1,032
bharath272/sds_eccv2014 A Matlab implementation of simultaneous object detection and segmentation using deep learning techniques. 96
avanetten/simrdwn An object detection system using rapid satellite imagery and deep learning algorithms 216
ibm/max-object-detector A tool for detecting objects in images using deep learning models 290
naver-ai/vidt An object detection model that extends transformer-based technology to also support instance segmentation 307
chanyn/hkrm Develops a deep learning model for large-scale object detection that leverages hybrid knowledge and routing mechanisms. 105