3D-convolutional-speaker-recognition

Speaker Verifier

Develops deep learning models using 3D convolutional neural networks for speaker verification tasks

speaker Deep Learning & 3D Convolutional Neural Networks for Speaker Verification

GitHub

782 stars
58 watching
274 forks
Language: Python
last commit: over 4 years ago
Linked from 1 awesome list

3dconvolutional-neural-networksdeep-learningspeaker-recognition

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
astorfi/lip-reading-deeplearning Deep learning-based system for recognizing speech from lip movements using 3D convolutional neural networks. 1,833
soerenab/audiomnist This project provides an implementation of a deep learning framework to classify audio signals and offers insights into the model's decision-making process using Explainable Artificial Intelligence (AI) techniques. 347
keunwoochoi/auralisation Reconstructs audio features learned by convolutional neural networks into audible sounds 42
vipl-audio-visual-speech-understanding/lipreading-densenet3d A software implementation of a deep learning model designed to understand lip movements in videos 117
tinghuiz/sfmlearner An unsupervised learning framework for depth and ego-motion estimation from monocular videos 1,967
denti/alexnet3d An implementation of a 3D convolutional neural network based on the AlexNet architecture for image recognition in 3D data. 43
charlesq34/frustum-pointnets A deep learning framework for 3D object detection from RGB-D data 1,592
vita-epfl/monoloco A library for 3D vision tasks using 2D keypoints 428
astorfi/speechpy Provides tools and libraries for extracting speech features from audio data. 880
matlab-deep-learning/wav2vec-2.0 Enables speech-to-text transcription using a pre-trained neural network model in MATLAB. 8
bytedance/salmonn A large language model enabling speech, audio event perception and music inputs to achieve multilingual capabilities 1,053
kinwaicheuk/nnaudio An audio processing toolkit using PyTorch convolutional neural networks to generate spectrograms from raw audio data 1,032
matlab-deep-learning/deepspeech Enables speech-to-text transcription using a pre-trained Deep Speech model in MATLAB. 7
preritj/segmentation Deep learning models for semantic segmentation of images 100
coedl/elpis A tool that enables language workers to build speech recognition models using multiple systems, including Kaldi and Huggingface Transformers. 152