vad
VAD system
An audio processing system that uses Deep Neural Networks and feature fusion to detect voice activity in speech recordings.
Voice Activity Detection system (Matlab-based implementation)
105 stars
11 watching
48 forks
Language: Matlab
last commit: over 7 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
zhenghuatan/rvad | An unsupervised method for detecting speech activity in noisy audio signals | 128 |
yashdv/speech-recognition | A Matlab code to recognize individuals based on their unique vocal characteristics. | 40 |
wiseman/py-webrtcvad | A Python interface to the WebRTC Voice Activity Detector | 2,066 |
matlab-deep-learning/wav2vec-2.0 | Enables speech-to-text transcription using a pre-trained neural network model in MATLAB. | 8 |
vocalpy/vak | A Python framework for training and applying neural networks to acoustic communication research | 78 |
damo-nlp-sg/vcd | An approach to reduce object hallucinations in large vision-language models by contrasting output distributions derived from original and distorted visual inputs | 209 |
cvde/roomreverb | An audio plugin that simulates the effect of reverb in recordings | 110 |
gsoh/ved | A large-scale dataset for fuel and energy use of vehicles in real-world conditions. | 91 |
cvondrick/vatic | Tools for efficiently scaling up video annotation using crowdsourced marketplaces. | 607 |
vsitzmann/siren | An implementation of a neural network architecture for implicit function representation learning using periodic activation functions. | 1,756 |
vehicle-lang/vehicle | A toolkit for enforcing logical specifications on neural networks | 80 |
ahunnargikar/vagrant-mesos | A Vagrant setup to create a Mesos/Docker/Marathon/Aurora/ Jenkins development environment for testing and development. | 122 |
marvinteichmann/multinet | An autonomous driving system that performs real-time road segmentation, car detection, and street classification using deep learning models. | 548 |
vadymmarkov/beethoven | A Swift library providing an interface to pitch detection in audio signals. | 827 |
ksw0306/clarinet | An implementation of a neural network-based vocoder using parallel-wavenet architecture and autoregressive flow | 289 |