visual-concepts

Visual concept detection framework

This codebase provides a framework for detecting visual concepts in images by leveraging image captions and pre-trained models.

Code for detecting visual concepts in images.

GitHub

151 stars
13 watching
57 forks
Language: Python
last commit: over 6 years ago

Related projects:

Repository Description Stars
lxtgh/omg-seg Develops an end-to-end model for multiple visual perception and reasoning tasks using a single encoder, decoder, and large language model. 1,300
ukoethe/vigra A computer vision library providing flexible algorithms and generic data structures for image analysis. 412
cmsflash/beauty-net Provides a basic framework for training deep learning models on image classification tasks using PyTorch 187
satwikkottur/visualword2vec Learning word embeddings from abstract images to improve language understanding 19
felixgwu/img_classification_pk_pytorch A PyTorch project for comparing image classification models and facilitating quick experiment setup 365
yangxuntu/vrd This repository provides a framework for visual relationship detection using deep learning models and pre-processing tools. 94
damo-nlp-sg/vcd An approach to reduce object hallucinations in large vision-language models by contrasting output distributions derived from original and distorted visual inputs 209
lartpang/ovcamo Develops an object segmentation algorithm to detect camouflaged objects in images with varying backgrounds and contexts. 20
facebookresearch/pycls A flexible PyTorch image classification framework for rapid research exploration and model evaluation. 2,138
jaiverma/matlab A collection of MATLAB code examples for various digital image processing tasks. 36
prof-lu-cewu/visual-relationship-detection Detects relationships and predicts predicates in images using language priors 214
mcgill-nlp/imagecode Provides code and data for an image retrieval benchmark that tests contextual understanding of text descriptions with images. 39
sergioburdisso/pyss3 A Python package implementing an interpretable machine learning model for text classification with visualization tools 336
nvlabs/stylegan3-detector An initiative to improve AI forensics algorithms for detecting synthetic media, focusing on the challenges presented by new image generator models. 129
vmarkovtsev/codeneuron Recurrent neural network designed to detect code blocks in text. 13