vrd

Visual relation detector

This repository provides a framework for visual relationship detection using deep learning models and pre-processing tools.

two models for visual relationship detection

GitHub

94 stars

4 watching

29 forks

Language: Python

last commit: almost 7 years ago

Related projects:

Repository	Description	Stars
doubledaibo/drnet_cvpr2017	This project presents a deep neural network architecture designed to detect visual relationships in images.	202
prof-lu-cewu/visual-relationship-detection	Detects relationships and predicts predicates in images using language priors	214
msracver/relation-networks-for-object-detection	An implementation of an object detection framework that incorporates relation networks and deformable convolutions	1,096
nexusapoorvacus/deepvariationstructuredrl	An implementation of reinforcement learning for visual relationship and attribute detection using PyTorch.	63
damo-nlp-sg/vcd	An approach to reduce object hallucinations in large vision-language models by contrasting output distributions derived from original and distorted visual inputs	222
yuwenxiong/py-r-fcn	An implementation of R-FCN, an object detection algorithm using region-based fully convolutional networks.	1,048
jiwei0921/dmra	A Python implementation of a depth-induced multi-scale recurrent attention network for RGB-D saliency detection	105
lz118/deep-correlation-network	Develops a deep neural network model for detecting salient objects in RGBT images using correlation information from other colors.	13
stanfordvl/referringrelationships	This repository provides code and tools for training and evaluating models of referring relationships in computer vision	260
kunpengli1994/vsrn	An open-source PyTorch implementation of a visual semantic reasoning model for image-text matching	294
naver-ai/vidt	An object detection model that extends transformer-based technology to also support instance segmentation	307
mrwu-mac/r-bench	A repository providing preprocessed data and tools for evaluating and analyzing relationship hallucinations in large vision-language models.	20
google-research/visu3d	An abstraction layer between various deep learning frameworks and your program.	149
jxingzhao/contrastprior	A toolbox for detecting salient objects in RGBD images using a combination of contrast and pyramid integration techniques	51
xu-tianyang/ladcf	An open-source implementation of a visual object tracking algorithm using adaptive correlation filters and spatial feature selection	92