vad

VAD system

An audio processing system that uses Deep Neural Networks and feature fusion to detect voice activity in speech recordings.

Voice Activity Detection system (Matlab-based implementation)

GitHub

105 stars
11 watching
48 forks
Language: Matlab
last commit: over 7 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
zhenghuatan/rvad An unsupervised method for detecting speech activity in noisy audio signals 130
yashdv/speech-recognition A Matlab code to recognize individuals based on their unique vocal characteristics. 40
wiseman/py-webrtcvad A Python interface to the WebRTC Voice Activity Detector 2,088
matlab-deep-learning/wav2vec-2.0 Enables speech-to-text transcription using a pre-trained neural network model in MATLAB. 7
vocalpy/vak A Python framework for training and applying neural networks to acoustic communication research 78
damo-nlp-sg/vcd An approach to reduce object hallucinations in large vision-language models by contrasting output distributions derived from original and distorted visual inputs 222
cvde/roomreverb A software tool for adding algorithmic reverb to audio recordings 111
gsoh/ved A large-scale dataset capturing vehicle energy consumption and usage patterns in real-world driving scenarios. 94
cvondrick/vatic Tools for efficiently scaling up video annotation using crowdsourced marketplaces. 609
vsitzmann/siren An implementation of a neural network architecture for implicit function representation learning using periodic activation functions. 1,776
vehicle-lang/vehicle A toolkit for enforcing logical specifications on neural networks 82
ahunnargikar/vagrant-mesos A Vagrant setup to create a Mesos/Docker/Marathon/Aurora/ Jenkins development environment for testing and development. 122
marvinteichmann/multinet An autonomous driving system that performs real-time road segmentation, car detection, and street classification using deep learning models. 549
vadymmarkov/beethoven A Swift library providing an interface to pitch detection in audio signals. 828
ksw0306/clarinet An implementation of a neural network-based vocoder using parallel-wavenet architecture and autoregressive flow 290