3D-convolutional-speaker-recognition
Speaker Verifier
Develops deep learning models using 3D convolutional neural networks for speaker verification tasks
Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
782 stars
58 watching
274 forks
Language: Python
last commit: over 4 years ago
Linked from 1 awesome list
3dconvolutional-neural-networksdeep-learningspeaker-recognition
Related projects:
Repository | Description | Stars |
---|---|---|
astorfi/lip-reading-deeplearning | Deep learning-based system for recognizing speech from lip movements using 3D convolutional neural networks. | 1,833 |
soerenab/audiomnist | This project provides an implementation of a deep learning framework to classify audio signals and offers insights into the model's decision-making process using Explainable Artificial Intelligence (AI) techniques. | 347 |
keunwoochoi/auralisation | Reconstructs audio features learned by convolutional neural networks into audible sounds | 42 |
vipl-audio-visual-speech-understanding/lipreading-densenet3d | A software implementation of a deep learning model designed to understand lip movements in videos | 117 |
tinghuiz/sfmlearner | An unsupervised learning framework for depth and ego-motion estimation from monocular videos | 1,967 |
denti/alexnet3d | An implementation of a 3D convolutional neural network based on the AlexNet architecture for image recognition in 3D data. | 43 |
charlesq34/frustum-pointnets | A deep learning framework for 3D object detection from RGB-D data | 1,592 |
vita-epfl/monoloco | A library for 3D vision tasks using 2D keypoints | 428 |
astorfi/speechpy | Provides tools and libraries for extracting speech features from audio data. | 880 |
matlab-deep-learning/wav2vec-2.0 | Enables speech-to-text transcription using a pre-trained neural network model in MATLAB. | 8 |
bytedance/salmonn | A large language model enabling speech, audio event perception and music inputs to achieve multilingual capabilities | 1,053 |
kinwaicheuk/nnaudio | An audio processing toolkit using PyTorch convolutional neural networks to generate spectrograms from raw audio data | 1,032 |
matlab-deep-learning/deepspeech | Enables speech-to-text transcription using a pre-trained Deep Speech model in MATLAB. | 7 |
preritj/segmentation | Deep learning models for semantic segmentation of images | 100 |
coedl/elpis | A tool that enables language workers to build speech recognition models using multiple systems, including Kaldi and Huggingface Transformers. | 152 |