visual-concepts
Visual concept detection framework
This codebase provides a framework for detecting visual concepts in images by leveraging image captions and pre-trained models.
Code for detecting visual concepts in images.
150 stars
13 watching
57 forks
Language: Python
last commit: almost 8 years ago Related projects:
| Repository | Description | Stars |
|---|---|---|
| | Develops an end-to-end model for multiple visual perception and reasoning tasks using a single encoder, decoder, and large language model. | 1,336 |
| | A computer vision library providing flexible algorithms and generic data structures for image analysis. | 412 |
| | Provides a basic framework for training deep learning models on image classification tasks using PyTorch | 187 |
| | Learning word embeddings from abstract images to improve language understanding | 19 |
| | A PyTorch project for comparing image classification models and facilitating quick experiment setup | 366 |
| | This repository provides a framework for visual relationship detection using deep learning models and pre-processing tools. | 94 |
| | An approach to reduce object hallucinations in large vision-language models by contrasting output distributions derived from original and distorted visual inputs | 222 |
| | Develops an object segmentation algorithm to detect camouflaged objects in images with varying backgrounds and contexts. | 20 |
| | A flexible PyTorch image classification framework for rapid research exploration and model evaluation. | 2,141 |
| | A collection of MATLAB code examples for various digital image processing tasks. | 37 |
| | Detects relationships and predicts predicates in images using language priors | 214 |
| | Provides code and data for an image retrieval benchmark that tests contextual understanding of text descriptions with images. | 39 |
| | A Python package implementing an interpretable machine learning model for text classification with visualization tools | 336 |
| | Developing and testing AI algorithms to detect synthetic images generated by new media synthesis models like StyleGAN3. | 129 |
| | Recurrent neural network designed to detect code blocks in text. | 12 |