FFTNet

Text-to-Speech Model

An implementation of a deep learning-based text-to-speech model using the FFTNet architecture.

FFTNet vocoder implementation

GitHub

81 stars

15 watching

8 forks

Language: Jupyter Notebook

last commit: almost 7 years ago

deep-learningfftnetpytorchtext2speechvocoder

Related projects:

Repository	Description	Stars
lifeiteng/vall-e	A PyTorch implementation of a text-to-speech synthesizer based on large language models	2,062
soobinseo/tacotron-pytorch	A PyTorch implementation of an end-to-end text-to-speech synthesis model.	207
kefirski/bytenet	A Pytorch implementation of a neural network model for machine translation	47
awni/speech	A PyTorch implementation of end-to-end speech recognition models.	756
r9y9/deepvoice3_pytorch	An implementation of text-to-speech synthesis using convolutional neural networks in PyTorch	1,970
openai/finetune-transformer-lm	This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture.	2,167
isht7/pytorch-deeplab-resnet	A deep learning model implementation of the DeepLab ResNet architecture for image segmentation tasks.	602
l0sg/relational-rnn-pytorch	An implementation of DeepMind's Relational Recurrent Neural Networks (Santoro et al. 2018) in PyTorch for word language modeling	245
gram-ai/radio-transformer-networks	An implementation of a machine learning-based communications system using deep learning techniques.	127
eromera/erfnet	A toolbox for training and evaluating real-time semantic segmentation networks using Torch library.	120
matlab-deep-learning/deepspeech	Enables speech-to-text transcription using a pre-trained Deep Speech model in MATLAB.	7
randl/shufflenetv2-pytorch	An implementation of a lightweight convolutional neural network architecture for mobile devices	191
yuangongnd/ltu	An audio and speech large language model implementation with pre-trained models, datasets, and inference options	396
erogol/seg-torch	Custom image segmentation implementation using deep learning with Lua and Torch	37
taoxugit/attngan	Reproduces text-to-image generation with attentional generative adversarial networks.	1,343