wav2vec-2.0

Speech Transcription Engine

Enables speech-to-text transcription using a pre-trained neural network model in MATLAB.

This repo provides the pretrained baseline 960 hours wav2vec 2.0 model in MATLAB.

GitHub

8 stars

6 watching

4 forks

last commit: over 2 years ago

audiodeep-learningmatlabmatlab-deep-learningpretrained-modelsspeech-to-text

Screenshot of matlab-deep-learning/wav2vec-2.0 website

www.mathworks.com/products/deep-learning.html

Related projects:

Repository	Description	Stars
matlab-deep-learning/deepspeech	Enables speech-to-text transcription using a pre-trained Deep Speech model in MATLAB.	7
matlab-deep-learning/pretrained-deeplabv3plus	Provides pre-trained and customizable semantic segmentation model in MATLAB	23
matlab-deep-learning/pretrained-yolo-v4	Pretrained deep learning object detection model for image analysis in MATLAB	47
veenveenveen/speechsignalprocessingcourse	This project provides a collection of MATLAB source code examples for learning speech signal processing techniques	66
balavenkatesh3322/audio-pretrained-model	A collection of pre-trained audio and speech models for various applications	182
matlab-deep-learning/transformer-models	An implementation of deep learning transformer models in MATLAB	206
vefstathiou/so_word2vec	This is a word embedding model trained on Stack Overflow posts for use in natural language processing tasks.	40
trekhleb/machine-learning-octave	A repository providing MatLab/Octave examples and explanations of popular machine learning algorithms	852
apache/tvm-vta	A comprehensive hardware design stack for accelerating deep learning models	254
picovoice/rhino	A deep learning-based speech-to-intent engine for on-device voice interaction	629
peak1995/speech-enhancement-dsp	This repository provides MATLAB implementations of traditional speech enhancement techniques including spectral subtraction, Wiener filtering, and Kalman filtering.	82
auspicious3000/contentvec	An implementation of a self-supervised speech representation model using PyTorch and disentangled speaker embeddings	467
visenger/handson-ml	Teaches Machine Learning fundamentals in Python using Scikit-Learn and TensorFlow	6
matlab-deep-learning/pretrained-salsanext	Provides pre-trained deep learning model for semantic segmentation of 3D point clouds using SalsaNext architecture	14
huckiyang/voice2series-reprogramming	An approach to reprogramming acoustic models for time series classification using differential mel-spectrograms and adversarial training	69