vad

VAD system

An audio processing system that uses Deep Neural Networks and feature fusion to detect voice activity in speech recordings.

Voice Activity Detection system (Matlab-based implementation)

GitHub

105 stars
11 watching
48 forks
Language: Matlab
last commit: over 7 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
zhenghuatan/rvad An unsupervised method for detecting speech activity in noisy audio signals 128
yashdv/speech-recognition A Matlab code to recognize individuals based on their unique vocal characteristics. 40
wiseman/py-webrtcvad A Python interface to the WebRTC Voice Activity Detector 2,066
matlab-deep-learning/wav2vec-2.0 Enables speech-to-text transcription using a pre-trained neural network model in MATLAB. 8
vocalpy/vak A Python framework for training and applying neural networks to acoustic communication research 78
damo-nlp-sg/vcd An approach to reduce object hallucinations in large vision-language models by contrasting output distributions derived from original and distorted visual inputs 209
cvde/roomreverb An audio plugin that simulates the effect of reverb in recordings 110
gsoh/ved A large-scale dataset for fuel and energy use of vehicles in real-world conditions. 91
cvondrick/vatic Tools for efficiently scaling up video annotation using crowdsourced marketplaces. 607
vsitzmann/siren An implementation of a neural network architecture for implicit function representation learning using periodic activation functions. 1,756
vehicle-lang/vehicle A toolkit for enforcing logical specifications on neural networks 80
ahunnargikar/vagrant-mesos A Vagrant setup to create a Mesos/Docker/Marathon/Aurora/ Jenkins development environment for testing and development. 122
marvinteichmann/multinet An autonomous driving system that performs real-time road segmentation, car detection, and street classification using deep learning models. 548
vadymmarkov/beethoven A Swift library providing an interface to pitch detection in audio signals. 827
ksw0306/clarinet An implementation of a neural network-based vocoder using parallel-wavenet architecture and autoregressive flow 289