DALI

Audio Dataset

A large dataset of synchronized audio, lyrics, and vocal notes created using machine learning

DALI: a large Dataset of synchronised Audio, LyrIcs and vocal notes.

GitHub

349 stars
11 watching
34 forks
Language: Python
last commit: over 4 years ago
Linked from 1 awesome list

datasetdeep-learningismirmusic-information-retrievalsinging-voiceteacher-student

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
kyubyong/css10 A collection of speech datasets for 10 languages to support text-to-speech tasks 465
dylanmeeus/goaudio An audio processing library that provides tools for creating and manipulating waveforms 351
nvidia/dataset_synthesizer Generates synthetic images and associated data for training deep learning models 573
ynop/audiomate A Python library for handling audio datasets, providing tools for accessing, manipulating, and preparing data for machine learning tasks. 131
dbd-research-group/birdset A comprehensive benchmark dataset collection for audio classification in avian bioacoustics, aiming to advance bird sound classification by providing diverse real-world evaluation use cases. 25
google-research/cad-estate A large dataset of 3D object and room layout annotations on RGB videos, designed to test automatic scene understanding methods. 105
switchablenorms/celebamask-hq A large-scale face image dataset for training and evaluating algorithms in face parsing, recognition, generation, and editing. 2,123
gopherdata/resources A collection of Go-based resources and tools for data science tasks 876
rosejn/torch-datasets A collection of pre-processed machine learning datasets for use with the Torch7 deep learning framework. 37
philipperemy/timit A collection of acoustic and phonetic speech data designed for training and evaluating automatic speech recognition systems 294
nimrodpar/labeled-elfs Provides labeled ELF binaries for research and testing purposes. 86
charleswyt/audio_steganalysis_ml An audio steganalysis tool utilizing statistical machine learning and handcrafted features to detect hidden messages in audio files. 35
soerenab/audiomnist This project provides an implementation of a deep learning framework to classify audio signals and offers insights into the model's decision-making process using Explainable Artificial Intelligence (AI) techniques. 347
fwang91/imdb-face A large-scale noise-controlled face recognition dataset designed to study the impact of data noise on recognition accuracy. 431
gorkemalgan/deep_learning_with_noisy_labels_literature A collection of papers and repos on deep learning with noisy labels. 235