piper

TTS system

A fast local neural text-to-speech system optimized for small devices

A fast, local neural text to speech system

7k stars

77 watching

513 forks

Language: C++

last commit: 10 months ago

Linked from 1 awesome list

speech-synthesistext-to-speechtts

Screenshot of rhasspy/piper website

rhasspy.github.io/piper-samples/

Backlinks from these awesome lists:

pluja/awesome-privacy

Related projects:

Repository	Description	Stars
rhasspy/rhasspy	An open-source voice assistant software that uses speech recognition and natural language processing to automate tasks in home automation systems.	2,419
mozilla/tts	An open-source project providing a suite of deep learning models and tools for advanced text-to-speech synthesis.	9,466
espeak-ng/espeak-ng	A text-to-speech synthesizer that supports multiple languages and is compact in size.	4,311
plachtaa/vall-e-x	A research implementation of Microsoft's VALL-E X zero-shot TTS model for multilingual text-to-speech synthesis and voice cloning	7,719
seannaren/deepspeech.pytorch	A deep learning-based speech recognition system built on top of PyTorch Lightning.	2,109
speechbrain/speechbrain	A PyTorch-based toolkit for building conversational AI systems with advanced speech and text processing capabilities.	9,066
huggingface/transformers.js	An open-source JavaScript library for running machine learning models in the browser without a server.	12,363
mravanelli/pytorch-kaldi	Develops state-of-the-art speech recognition systems using PyTorch and Kaldi toolkits	2,370
rvc-boss/gpt-sovits	An AI system for generating human-like voices from text inputs, using deep learning techniques and pre-trained models.	36,977
xiph/rnnoise	A deep learning-based audio noise reduction system using recurrent neural networks	4,191
pytorch/audio	A PyTorch module providing tools and functions for audio signal processing	2,561
huggingface/transformers	A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects.	136,357
amarcu5/piper	An open-source browser extension that adds Picture in Picture support to multiple video streaming services	261
ggerganov/whisper.cpp	A high-performance inference implementation of an automatic speech recognition model in C++	36,332
nvidia/waveglow	Generates high-quality speech from mel-spectrograms using a flow-based network architecture	2,294