coyo-dataset
Image Text Dataset
A large-scale image-text pair dataset designed to support training of foundation models in computer vision and natural language processing.
COYO-700M: Large-scale Image-Text Pair Dataset
1k stars
15 watching
36 forks
Language: Python
last commit: almost 2 years ago Related projects:
Repository | Description | Stars |
---|---|---|
nightrome/cocostuff | Provides annotated image data and tools for semantic segmentation tasks | 837 |
jin-s13/coco-wholebody | A large-scale benchmark and dataset for whole-body pose estimation in images | 762 |
pku-yuangroup/open-sora-dataset | A large video dataset collected from various open-source websites for use in computer vision and multimedia applications. | 93 |
yumingj/deepfashion-multimodal | A large-scale human image dataset with rich annotations for various applications such as image generation, pose estimation, and attribute recognition. | 523 |
maluuba/geneva_datasets | Scripts to generate datasets for an image generation task using Generative Adversarial Networks and deep learning techniques | 37 |
ibm/max-image-caption-generator | An image caption generation system utilizing machine learning models and deep neural networks. | 84 |
kastnerkyle/kaggle-dogs-vs-cats | A Python implementation of a machine learning solution for classifying images as dogs or cats from the Kaggle competition. | 66 |
yeephycho/nasnet-tensorflow | A toolkit for training and deploying a state-of-the-art image classification architecture on TensorFlow | 136 |
sergioburdisso/pyss3 | A Python package implementing an interpretable machine learning model for text classification with visualization tools | 336 |
ryankiros/visual-semantic-embedding | A Python implementation of an image-sentence embedding method using LSTM networks. | 426 |
karthikncode/nlp-datasets | A curated list of Natural Language Processing datasets used to train and evaluate NLP models. | 919 |
felixgwu/img_classification_pk_pytorch | A PyTorch project for comparing image classification models and facilitating quick experiment setup | 365 |
ibm/max-resnet-50 | An image classification model using the ResNet-50 architecture, trained on the ImageNet dataset. | 14 |
embodiedgpt/embodiedgpt_pytorch | A PyTorch-based toolkit for creating customized multimedia datasets and handling heterogeneous data for training AI models. | 340 |
byungkwanlee/collavo | Develops a PyTorch implementation of an enhanced vision language model | 93 |