piper

TTS system

A fast local neural text-to-speech system optimized for small devices

A fast, local neural text to speech system

GitHub

7k stars
76 watching
490 forks
Language: C++
last commit: about 1 month ago
Linked from 1 awesome list

speech-synthesistext-to-speechtts

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
rhasspy/rhasspy An open-source voice assistant software that uses speech recognition and natural language processing to automate tasks in home automation systems. 2,410
mozilla/tts An open-source project providing a suite of deep learning models and tools for advanced text-to-speech synthesis. 9,442
espeak-ng/espeak-ng A text-to-speech synthesizer that supports multiple languages and is compact in size. 4,262
plachtaa/vall-e-x A research implementation of Microsoft's VALL-E X zero-shot TTS model for multilingual text-to-speech synthesis and voice cloning 7,701
seannaren/deepspeech.pytorch A deep learning-based speech recognition system built on top of PyTorch Lightning. 2,107
speechbrain/speechbrain A PyTorch-based toolkit for building conversational AI systems with advanced speech and text processing capabilities. 8,992
huggingface/transformers.js Runs machine learning models directly in the browser without server-side support. 12,240
mravanelli/pytorch-kaldi A toolkit for developing state-of-the-art deep learning-based speech recognition systems 2,369
rvc-boss/gpt-sovits An AI system for generating human-like voices from text inputs, using deep learning techniques and pre-trained models. 36,413
xiph/rnnoise A deep learning-based audio noise reduction system using recurrent neural networks 4,156
pytorch/audio A PyTorch module providing tools and functions for audio signal processing 2,547
huggingface/transformers A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. 135,747
amarcu5/piper An open-source browser extension that adds Picture in Picture support to multiple video streaming services 261
ggerganov/whisper.cpp A high-performance inference implementation of an automatic speech recognition model in C++ 35,997
nvidia/waveglow Generates high-quality speech from mel-spectrograms using a flow-based network architecture 2,292