FFTNet

Text-to-Speech Model

An implementation of a deep learning-based text-to-speech model using the FFTNet architecture.

FFTNet vocoder implementation

GitHub

81 stars
15 watching
8 forks
Language: Jupyter Notebook
last commit: about 6 years ago
deep-learningfftnetpytorchtext2speechvocoder

Related projects:

Repository Description Stars
lifeiteng/vall-e A PyTorch implementation of a text-to-speech synthesizer based on large language models 2,053
soobinseo/tacotron-pytorch A PyTorch implementation of an end-to-end text-to-speech synthesis model. 206
kefirski/bytenet A Pytorch implementation of a neural network model for machine translation 47
awni/speech A PyTorch implementation of end-to-end speech recognition models. 754
r9y9/deepvoice3_pytorch An implementation of text-to-speech synthesis using convolutional neural networks in PyTorch 1,969
openai/finetune-transformer-lm This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. 2,160
isht7/pytorch-deeplab-resnet A deep learning model implementation of the DeepLab ResNet architecture for image segmentation tasks. 602
l0sg/relational-rnn-pytorch An implementation of DeepMind's Relational Recurrent Neural Networks (Santoro et al. 2018) in PyTorch for word language modeling 244
gram-ai/radio-transformer-networks An implementation of a machine learning-based communications system using deep learning techniques. 127
eromera/erfnet A toolbox for training and evaluating real-time semantic segmentation networks using Torch library. 118
matlab-deep-learning/deepspeech Enables speech-to-text transcription using a pre-trained Deep Speech model in MATLAB. 7
randl/shufflenetv2-pytorch An implementation of a lightweight convolutional neural network architecture for mobile devices 191
yuangongnd/ltu An audio and speech large language model implementation with pre-trained models, datasets, and inference options 385
erogol/seg-torch Custom image segmentation implementation using deep learning with Lua and Torch 37
taoxugit/attngan Reproduces text-to-image generation with attentional generative adversarial networks. 1,339