piper

TTS system

A fast local neural text-to-speech system optimized for small devices

A fast, local neural text to speech system

GitHub

7k stars
77 watching
513 forks
Language: C++
last commit: 3 months ago
Linked from 1 awesome list

speech-synthesistext-to-speechtts

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
rhasspy/rhasspy An open-source voice assistant software that uses speech recognition and natural language processing to automate tasks in home automation systems. 2,419
mozilla/tts An open-source project providing a suite of deep learning models and tools for advanced text-to-speech synthesis. 9,466
espeak-ng/espeak-ng A text-to-speech synthesizer that supports multiple languages and is compact in size. 4,311
plachtaa/vall-e-x A research implementation of Microsoft's VALL-E X zero-shot TTS model for multilingual text-to-speech synthesis and voice cloning 7,719
seannaren/deepspeech.pytorch A deep learning-based speech recognition system built on top of PyTorch Lightning. 2,109
speechbrain/speechbrain A PyTorch-based toolkit for building conversational AI systems with advanced speech and text processing capabilities. 9,066
huggingface/transformers.js An open-source JavaScript library for running machine learning models in the browser without a server. 12,363
mravanelli/pytorch-kaldi Develops state-of-the-art speech recognition systems using PyTorch and Kaldi toolkits 2,370
rvc-boss/gpt-sovits An AI system for generating human-like voices from text inputs, using deep learning techniques and pre-trained models. 36,977
xiph/rnnoise A deep learning-based audio noise reduction system using recurrent neural networks 4,191
pytorch/audio A PyTorch module providing tools and functions for audio signal processing 2,561
huggingface/transformers A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. 136,357
amarcu5/piper An open-source browser extension that adds Picture in Picture support to multiple video streaming services 261
ggerganov/whisper.cpp A high-performance inference implementation of an automatic speech recognition model in C++ 36,332
nvidia/waveglow Generates high-quality speech from mel-spectrograms using a flow-based network architecture 2,294