DALI
Audio Dataset
A large dataset of synchronized audio, lyrics, and vocal notes created using machine learning
DALI: a large Dataset of synchronised Audio, LyrIcs and vocal notes.
351 stars
11 watching
34 forks
Language: Python
last commit: over 4 years ago
Linked from 1 awesome list
datasetdeep-learningismirmusic-information-retrievalsinging-voiceteacher-student
Related projects:
Repository | Description | Stars |
---|---|---|
kyubyong/css10 | A collection of speech datasets for 10 languages to support text-to-speech tasks | 467 |
dylanmeeus/goaudio | An audio processing library that provides tools for creating and manipulating waveforms | 353 |
nvidia/dataset_synthesizer | Generates synthetic images and associated data for training deep learning models | 574 |
ynop/audiomate | A Python library for handling audio datasets, providing tools for accessing, manipulating, and preparing data for machine learning tasks. | 133 |
dbd-research-group/birdset | A collection of audio classification datasets for bird sound recognition, including data preparation tools and model training support. | 29 |
google-research/cad-estate | A large dataset of 3D object and room layout annotations on RGB videos, designed to test automatic scene understanding methods. | 106 |
switchablenorms/celebamask-hq | A large-scale face image dataset for training and evaluating algorithms in face parsing, recognition, generation, and editing. | 2,136 |
gopherdata/resources | A collection of Go-based resources and tools for data science tasks | 879 |
rosejn/torch-datasets | A collection of pre-processed machine learning datasets for use with the Torch7 deep learning framework. | 37 |
philipperemy/timit | A collection of acoustic and phonetic speech data designed for training and evaluating automatic speech recognition systems | 297 |
nimrodpar/labeled-elfs | Provides labeled ELF binaries for research and testing purposes. | 87 |
charleswyt/audio_steganalysis_ml | An implementation of an audio steganalysis system using statistical machine learning and handcrafted features. | 35 |
soerenab/audiomnist | This project provides an implementation of a deep learning framework to classify audio signals and offers insights into the model's decision-making process using Explainable Artificial Intelligence (AI) techniques. | 351 |
fwang91/imdb-face | A large-scale noise-controlled face recognition dataset designed to study the impact of data noise on recognition accuracy. | 433 |
gorkemalgan/deep_learning_with_noisy_labels_literature | A collection of papers and repos on deep learning with noisy labels. | 235 |