vrd

Visual relation detector

This repository provides a framework for visual relationship detection using deep learning models and pre-processing tools.

two models for visual relationship detection

GitHub

94 stars
4 watching
29 forks
Language: Python
last commit: about 6 years ago

Related projects:

Repository Description Stars
doubledaibo/drnet_cvpr2017 This project presents a deep neural network architecture designed to detect visual relationships in images. 202
prof-lu-cewu/visual-relationship-detection Detects relationships and predicts predicates in images using language priors 214
msracver/relation-networks-for-object-detection An implementation of an object detection framework that incorporates relation networks and deformable convolutions 1,096
nexusapoorvacus/deepvariationstructuredrl An implementation of reinforcement learning for visual relationship and attribute detection using PyTorch. 63
damo-nlp-sg/vcd An approach to reduce object hallucinations in large vision-language models by contrasting output distributions derived from original and distorted visual inputs 222
yuwenxiong/py-r-fcn An implementation of R-FCN, an object detection algorithm using region-based fully convolutional networks. 1,048
jiwei0921/dmra A Python implementation of a depth-induced multi-scale recurrent attention network for RGB-D saliency detection 105
lz118/deep-correlation-network Develops a deep neural network model for detecting salient objects in RGBT images using correlation information from other colors. 13
stanfordvl/referringrelationships This repository provides code and tools for training and evaluating models of referring relationships in computer vision 260
kunpengli1994/vsrn An open-source PyTorch implementation of a visual semantic reasoning model for image-text matching 294
naver-ai/vidt An object detection model that extends transformer-based technology to also support instance segmentation 307
mrwu-mac/r-bench A repository providing preprocessed data and tools for evaluating and analyzing relationship hallucinations in large vision-language models. 20
google-research/visu3d An abstraction layer between various deep learning frameworks and your program. 149
jxingzhao/contrastprior A toolbox for detecting salient objects in RGBD images using a combination of contrast and pyramid integration techniques 51
xu-tianyang/ladcf An open-source implementation of a visual object tracking algorithm using adaptive correlation filters and spatial feature selection 92