laion-datasets
AI dataset collection
A repository containing a collection of large datasets used for training and testing AI models, specifically designed to improve image-text matching capabilities.
Description and pointers of laion datasets
239 stars
6 watching
9 forks
Language: HTML
last commit: over 2 years ago Related projects:
Repository | Description | Stars |
---|---|---|
| Provides pre-trained face detection and analysis models using large-scale image-text data | 281 |
| A comprehensive collection of datasets from various AI-related sources worldwide. | 46 |
| Evaluates and compares the performance of various CLIP-like models on different tasks and datasets. | 632 |
| A library for learning audio embeddings from text and audio data using contrastive language-audio pretraining | 1,457 |
| Predicts aesthetic quality of images using CLIP model embeddings | 491 |
| Provides a collection of system log datasets for AI-driven analytics research. | 1,883 |
| A large dataset of human matting images and corresponding results for training person segmentation models. | 615 |
| A collection of Urdu language datasets for various NLP tasks and applications | 71 |
| A comprehensive resource for learning and exploring Artificial Intelligence (AI) concepts and applications | 1,667 |
| A curated list of Natural Language Processing datasets used to train and evaluate NLP models. | 919 |
| Detects whether a given text sequence is part of the training data used to train a large language model. | 23 |
| A command-line interface to generate textual datasets with Large Language Models | 293 |
| A collection of detailed pixel-wise annotations for fashion images used in human parsing research. | 213 |
| A collection of language resources extracted from publicly available sources. | 7 |
| An open-source toolkit for building and evaluating large language models | 267 |