3D-convolutional-speaker-recognition
Speaker Verifier
Develops deep learning models using 3D convolutional neural networks for speaker verification tasks
Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
783 stars
58 watching
274 forks
Language: Python
last commit: almost 5 years ago
Linked from 1 awesome list
3dconvolutional-neural-networksdeep-learningspeaker-recognition
Related projects:
Repository | Description | Stars |
---|---|---|
astorfi/lip-reading-deeplearning | Deep learning-based system for recognizing speech from lip movements using 3D convolutional neural networks. | 1,840 |
soerenab/audiomnist | This project provides an implementation of a deep learning framework to classify audio signals and offers insights into the model's decision-making process using Explainable Artificial Intelligence (AI) techniques. | 351 |
keunwoochoi/auralisation | Reconstructs audio features learned by convolutional neural networks into audible sounds | 42 |
vipl-audio-visual-speech-understanding/lipreading-densenet3d | A software implementation of a deep learning model designed to understand lip movements in videos | 117 |
tinghuiz/sfmlearner | A framework for unsupervised depth and ego-motion estimation from monocular videos using deep learning | 1,977 |
denti/alexnet3d | An implementation of a 3D convolutional neural network based on the AlexNet architecture for image recognition in 3D data. | 42 |
charlesq34/frustum-pointnets | A deep learning framework for 3D object detection from RGB-D data | 1,598 |
vita-epfl/monoloco | A software framework for 3D vision and computer vision tasks using deep learning and 2D keypoints. | 431 |
astorfi/speechpy | Provides tools and libraries for extracting speech features from audio data. | 881 |
matlab-deep-learning/wav2vec-2.0 | Enables speech-to-text transcription using a pre-trained neural network model in MATLAB. | 7 |
bytedance/salmonn | A large language model enabling speech, audio event perception and music inputs to achieve multilingual capabilities | 1,091 |
kinwaicheuk/nnaudio | An audio processing toolkit using PyTorch convolutional neural networks to generate spectrograms from raw audio data | 1,036 |
matlab-deep-learning/deepspeech | Enables speech-to-text transcription using a pre-trained Deep Speech model in MATLAB. | 7 |
preritj/segmentation | Deep learning models for semantic segmentation of images | 101 |
coedl/elpis | A tool that enables language workers to build speech recognition models using multiple systems, including Kaldi and Huggingface Transformers. | 152 |