DDSP-SVC

Voice converter

Real-time end-to-end singing voice conversion system using DDSP models and various machine learning techniques

Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)

GitHub

2k stars
23 watching
250 forks
Language: Python
last commit: 24 days ago
Linked from 1 awesome list

pytorch

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
js-mim/mss_pytorch This project provides a PyTorch implementation of a singing voice separation algorithm using recurrent inference and skip-filtering connections. 171
lifeiteng/vall-e A PyTorch implementation of a text-to-speech synthesizer based on large language models 2,049
seannaren/deepspeech.pytorch A deep learning-based speech recognition system built on top of PyTorch Lightning. 2,104
yongxuustc/dcase2017_task4_cvssp A system for audio classification and detection using machine learning models 4
r9y9/deepvoice3_pytorch An implementation of text-to-speech synthesis using convolutional neural networks in PyTorch 1,969
r9y9/tacotron_pytorch An implementation of Tacotron speech synthesis model using PyTorch. 309
thudm/glm-4-voice An end-to-end speech synthesis model that generates human-like speech in real-time 2,269
yashdv/speech-recognition A Matlab code to recognize individuals based on their unique vocal characteristics. 40
inisis/brocolli Converts PyTorch models to various formats for deployment and testing in deep learning frameworks. 341
thecodrr/vspeech Provides an interface to Mozilla's DeepSpeech TensorFlow-based Speech-to-Text library using V bindings. 50
leviswind/pytorch-transformer Implementation of a transformer-based translation model in PyTorch 239
cpitclaudel/monospacifier Converts variable-width fonts to monospace fonts to improve Unicode coverage and formatting consistency in programming. 381
ibm/max-speech-to-text-converter Converts spoken words into text form using speech recognition technology 76
awni/speech A PyTorch implementation of end-to-end speech recognition models. 754
byulparan/sc-vst A Common Lisp library providing VST plugin support for the cl-collider digital signal processing framework. 6