DeepFashion-MultiModal
Human image dataset
A large-scale human image dataset with rich annotations for various applications such as image generation, pose estimation, and attribute recognition.
A large-scale high-quality human dataset with rich multi-modal annotations
531 stars
9 watching
34 forks
last commit: about 2 years ago
Linked from 1 awesome list
siggraph2022
Related projects:
Repository | Description | Stars |
---|---|---|
| A collection of detailed pixel-wise annotations for fashion images used in human parsing research. | 213 |
| A large dataset of human matting images and corresponding results for training person segmentation models. | 615 |
| An approach to detecting objects in images using multimodal large language models and contextual information | 208 |
| An end-to-end image captioning system that uses large multi-modal models and provides tools for training, inference, and demo usage. | 1,849 |
| A large-scale noise-controlled face recognition dataset designed to study the impact of data noise on recognition accuracy. | 433 |
| Provides a toolbox for loading, visualizing, and evaluating a dataset of images with human annotations, including depth layers and age group classification. | 140 |
| A system that generates high-fidelity 3D models of clothed humans from images, combining explicit and implicit representations. | 1,115 |
| A computer vision library for detecting and tracking human presence in images and videos using convolutional neural networks. | 1,800 |
| Provides a dataset and tools for evaluating computer vision tasks in precision agriculture | 136 |
| A large-scale benchmark and dataset for whole-body pose estimation in images | 770 |
| A collection of standard datasets used to evaluate the performance of visual features in computer vision | 8 |
| Provides images of human faces with annotated age, gender, pose, and other attributes for testing age transformation algorithms. | 262 |
| An open-source implementation of an image segmentation model that combines background removal and object detection capabilities. | 1,484 |
| A large-scale face image dataset for training and evaluating algorithms in face parsing, recognition, generation, and editing. | 2,136 |
| Generates synthetic images and associated data for training deep learning models | 574 |