visual-concepts
Visual concept detection framework
This codebase provides a framework for detecting visual concepts in images by leveraging image captions and pre-trained models.
Code for detecting visual concepts in images.
151 stars
13 watching
57 forks
Language: Python
last commit: over 6 years ago Related projects:
Repository | Description | Stars |
---|---|---|
lxtgh/omg-seg | Develops an end-to-end model for multiple visual perception and reasoning tasks using a single encoder, decoder, and large language model. | 1,300 |
ukoethe/vigra | A computer vision library providing flexible algorithms and generic data structures for image analysis. | 412 |
cmsflash/beauty-net | Provides a basic framework for training deep learning models on image classification tasks using PyTorch | 187 |
satwikkottur/visualword2vec | Learning word embeddings from abstract images to improve language understanding | 19 |
felixgwu/img_classification_pk_pytorch | A PyTorch project for comparing image classification models and facilitating quick experiment setup | 365 |
yangxuntu/vrd | This repository provides a framework for visual relationship detection using deep learning models and pre-processing tools. | 94 |
damo-nlp-sg/vcd | An approach to reduce object hallucinations in large vision-language models by contrasting output distributions derived from original and distorted visual inputs | 209 |
lartpang/ovcamo | Develops an object segmentation algorithm to detect camouflaged objects in images with varying backgrounds and contexts. | 20 |
facebookresearch/pycls | A flexible PyTorch image classification framework for rapid research exploration and model evaluation. | 2,138 |
jaiverma/matlab | A collection of MATLAB code examples for various digital image processing tasks. | 36 |
prof-lu-cewu/visual-relationship-detection | Detects relationships and predicts predicates in images using language priors | 214 |
mcgill-nlp/imagecode | Provides code and data for an image retrieval benchmark that tests contextual understanding of text descriptions with images. | 39 |
sergioburdisso/pyss3 | A Python package implementing an interpretable machine learning model for text classification with visualization tools | 336 |
nvlabs/stylegan3-detector | An initiative to improve AI forensics algorithms for detecting synthetic media, focusing on the challenges presented by new image generator models. | 129 |
vmarkovtsev/codeneuron | Recurrent neural network designed to detect code blocks in text. | 13 |