wav2vec-2.0

Speech Transcription Engine

Enables speech-to-text transcription using a pre-trained neural network model in MATLAB.

This repo provides the pretrained baseline 960 hours wav2vec 2.0 model in MATLAB.

GitHub

8 stars
6 watching
4 forks
last commit: over 2 years ago
audiodeep-learningmatlabmatlab-deep-learningpretrained-modelsspeech-to-text

Related projects:

Repository Description Stars
matlab-deep-learning/deepspeech Enables speech-to-text transcription using a pre-trained Deep Speech model in MATLAB. 7
matlab-deep-learning/pretrained-deeplabv3plus Provides pre-trained and customizable semantic segmentation model in MATLAB 23
matlab-deep-learning/pretrained-yolo-v4 Pretrained deep learning object detection model for image analysis in MATLAB 47
veenveenveen/speechsignalprocessingcourse This project provides a collection of MATLAB source code examples for learning speech signal processing techniques 66
balavenkatesh3322/audio-pretrained-model A collection of pre-trained audio and speech models for various applications 182
matlab-deep-learning/transformer-models An implementation of deep learning transformer models in MATLAB 206
vefstathiou/so_word2vec This is a word embedding model trained on Stack Overflow posts for use in natural language processing tasks. 40
trekhleb/machine-learning-octave A repository providing MatLab/Octave examples and explanations of popular machine learning algorithms 852
apache/tvm-vta A comprehensive hardware design stack for accelerating deep learning models 254
picovoice/rhino A deep learning-based speech-to-intent engine for on-device voice interaction 629
peak1995/speech-enhancement-dsp This repository provides MATLAB implementations of traditional speech enhancement techniques including spectral subtraction, Wiener filtering, and Kalman filtering. 82
auspicious3000/contentvec An implementation of a self-supervised speech representation model using PyTorch and disentangled speaker embeddings 467
visenger/handson-ml Teaches Machine Learning fundamentals in Python using Scikit-Learn and TensorFlow 6
matlab-deep-learning/pretrained-salsanext Provides pre-trained deep learning model for semantic segmentation of 3D point clouds using SalsaNext architecture 14
huckiyang/voice2series-reprogramming An approach to reprogramming acoustic models for time series classification using differential mel-spectrograms and adversarial training 69