DDSP-SVC
Voice converter
Real-time end-to-end singing voice conversion system using DDSP models and various machine learning techniques
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
2k stars
23 watching
250 forks
Language: Python
last commit: 24 days ago
Linked from 1 awesome list
pytorch
Related projects:
Repository | Description | Stars |
---|---|---|
js-mim/mss_pytorch | This project provides a PyTorch implementation of a singing voice separation algorithm using recurrent inference and skip-filtering connections. | 171 |
lifeiteng/vall-e | A PyTorch implementation of a text-to-speech synthesizer based on large language models | 2,049 |
seannaren/deepspeech.pytorch | A deep learning-based speech recognition system built on top of PyTorch Lightning. | 2,104 |
yongxuustc/dcase2017_task4_cvssp | A system for audio classification and detection using machine learning models | 4 |
r9y9/deepvoice3_pytorch | An implementation of text-to-speech synthesis using convolutional neural networks in PyTorch | 1,969 |
r9y9/tacotron_pytorch | An implementation of Tacotron speech synthesis model using PyTorch. | 309 |
thudm/glm-4-voice | An end-to-end speech synthesis model that generates human-like speech in real-time | 2,269 |
yashdv/speech-recognition | A Matlab code to recognize individuals based on their unique vocal characteristics. | 40 |
inisis/brocolli | Converts PyTorch models to various formats for deployment and testing in deep learning frameworks. | 341 |
thecodrr/vspeech | Provides an interface to Mozilla's DeepSpeech TensorFlow-based Speech-to-Text library using V bindings. | 50 |
leviswind/pytorch-transformer | Implementation of a transformer-based translation model in PyTorch | 239 |
cpitclaudel/monospacifier | Converts variable-width fonts to monospace fonts to improve Unicode coverage and formatting consistency in programming. | 381 |
ibm/max-speech-to-text-converter | Converts spoken words into text form using speech recognition technology | 76 |
awni/speech | A PyTorch implementation of end-to-end speech recognition models. | 754 |
byulparan/sc-vst | A Common Lisp library providing VST plugin support for the cl-collider digital signal processing framework. | 6 |