coyo-dataset

Image Text Dataset

A large-scale image-text pair dataset designed to support training of foundation models in computer vision and natural language processing.

COYO-700M: Large-scale Image-Text Pair Dataset

GitHub

1k stars
15 watching
36 forks
Language: Python
last commit: almost 2 years ago

Related projects:

Repository Description Stars
nightrome/cocostuff Provides annotated image data and tools for semantic segmentation tasks 837
jin-s13/coco-wholebody A large-scale benchmark and dataset for whole-body pose estimation in images 762
pku-yuangroup/open-sora-dataset A large video dataset collected from various open-source websites for use in computer vision and multimedia applications. 93
yumingj/deepfashion-multimodal A large-scale human image dataset with rich annotations for various applications such as image generation, pose estimation, and attribute recognition. 523
maluuba/geneva_datasets Scripts to generate datasets for an image generation task using Generative Adversarial Networks and deep learning techniques 37
ibm/max-image-caption-generator An image caption generation system utilizing machine learning models and deep neural networks. 84
kastnerkyle/kaggle-dogs-vs-cats A Python implementation of a machine learning solution for classifying images as dogs or cats from the Kaggle competition. 66
yeephycho/nasnet-tensorflow A toolkit for training and deploying a state-of-the-art image classification architecture on TensorFlow 136
sergioburdisso/pyss3 A Python package implementing an interpretable machine learning model for text classification with visualization tools 336
ryankiros/visual-semantic-embedding A Python implementation of an image-sentence embedding method using LSTM networks. 426
karthikncode/nlp-datasets A curated list of Natural Language Processing datasets used to train and evaluate NLP models. 919
felixgwu/img_classification_pk_pytorch A PyTorch project for comparing image classification models and facilitating quick experiment setup 365
ibm/max-resnet-50 An image classification model using the ResNet-50 architecture, trained on the ImageNet dataset. 14
embodiedgpt/embodiedgpt_pytorch A PyTorch-based toolkit for creating customized multimedia datasets and handling heterogeneous data for training AI models. 340
byungkwanlee/collavo Develops a PyTorch implementation of an enhanced vision language model 93