vrd
Visual relation detector
This repository provides a framework for visual relationship detection using deep learning models and pre-processing tools.
two models for visual relationship detection
94 stars
4 watching
29 forks
Language: Python
last commit: about 6 years ago Related projects:
Repository | Description | Stars |
---|---|---|
doubledaibo/drnet_cvpr2017 | This project presents a deep neural network architecture designed to detect visual relationships in images. | 202 |
prof-lu-cewu/visual-relationship-detection | Detects relationships and predicts predicates in images using language priors | 214 |
msracver/relation-networks-for-object-detection | An implementation of an object detection framework that incorporates relation networks and deformable convolutions | 1,096 |
nexusapoorvacus/deepvariationstructuredrl | An implementation of reinforcement learning for visual relationship and attribute detection using PyTorch. | 63 |
damo-nlp-sg/vcd | An approach to reduce object hallucinations in large vision-language models by contrasting output distributions derived from original and distorted visual inputs | 222 |
yuwenxiong/py-r-fcn | An implementation of R-FCN, an object detection algorithm using region-based fully convolutional networks. | 1,048 |
jiwei0921/dmra | A Python implementation of a depth-induced multi-scale recurrent attention network for RGB-D saliency detection | 105 |
lz118/deep-correlation-network | Develops a deep neural network model for detecting salient objects in RGBT images using correlation information from other colors. | 13 |
stanfordvl/referringrelationships | This repository provides code and tools for training and evaluating models of referring relationships in computer vision | 260 |
kunpengli1994/vsrn | An open-source PyTorch implementation of a visual semantic reasoning model for image-text matching | 294 |
naver-ai/vidt | An object detection model that extends transformer-based technology to also support instance segmentation | 307 |
mrwu-mac/r-bench | A repository providing preprocessed data and tools for evaluating and analyzing relationship hallucinations in large vision-language models. | 20 |
google-research/visu3d | An abstraction layer between various deep learning frameworks and your program. | 149 |
jxingzhao/contrastprior | A toolbox for detecting salient objects in RGBD images using a combination of contrast and pyramid integration techniques | 51 |
xu-tianyang/ladcf | An open-source implementation of a visual object tracking algorithm using adaptive correlation filters and spatial feature selection | 92 |