vad

VAD system

An audio processing system that uses Deep Neural Networks and feature fusion to detect voice activity in speech recordings.

Voice Activity Detection system (Matlab-based implementation)

GitHub

105 stars

11 watching

48 forks

Language: Matlab

last commit: about 8 years ago

Linked from 1 awesome list

Backlinks from these awesome lists:

uhub/awesome-matlab

Related projects:

Repository	Description	Stars
zhenghuatan/rvad	An unsupervised method for detecting speech activity in noisy audio signals	130
yashdv/speech-recognition	A Matlab code to recognize individuals based on their unique vocal characteristics.	40
wiseman/py-webrtcvad	A Python interface to the WebRTC Voice Activity Detector	2,088
matlab-deep-learning/wav2vec-2.0	Enables speech-to-text transcription using a pre-trained neural network model in MATLAB.	7
vocalpy/vak	A Python framework for training and applying neural networks to acoustic communication research	78
damo-nlp-sg/vcd	An approach to reduce object hallucinations in large vision-language models by contrasting output distributions derived from original and distorted visual inputs	222
cvde/roomreverb	A software tool for adding algorithmic reverb to audio recordings	111
gsoh/ved	A large-scale dataset capturing vehicle energy consumption and usage patterns in real-world driving scenarios.	94
cvondrick/vatic	Tools for efficiently scaling up video annotation using crowdsourced marketplaces.	609
vsitzmann/siren	An implementation of a neural network architecture for implicit function representation learning using periodic activation functions.	1,776
vehicle-lang/vehicle	A toolkit for enforcing logical specifications on neural networks	82
ahunnargikar/vagrant-mesos	A Vagrant setup to create a Mesos/Docker/Marathon/Aurora/ Jenkins development environment for testing and development.	122
marvinteichmann/multinet	An autonomous driving system that performs real-time road segmentation, car detection, and street classification using deep learning models.	549
vadymmarkov/beethoven	A Swift library providing an interface to pitch detection in audio signals.	828
ksw0306/clarinet	An implementation of a neural network-based vocoder using parallel-wavenet architecture and autoregressive flow	290