piper
TTS system
A fast local neural text-to-speech system optimized for small devices
A fast, local neural text to speech system
7k stars
77 watching
513 forks
Language: C++
last commit: 3 months ago
Linked from 1 awesome list
speech-synthesistext-to-speechtts
Related projects:
Repository | Description | Stars |
---|---|---|
rhasspy/rhasspy | An open-source voice assistant software that uses speech recognition and natural language processing to automate tasks in home automation systems. | 2,419 |
mozilla/tts | An open-source project providing a suite of deep learning models and tools for advanced text-to-speech synthesis. | 9,466 |
espeak-ng/espeak-ng | A text-to-speech synthesizer that supports multiple languages and is compact in size. | 4,311 |
plachtaa/vall-e-x | A research implementation of Microsoft's VALL-E X zero-shot TTS model for multilingual text-to-speech synthesis and voice cloning | 7,719 |
seannaren/deepspeech.pytorch | A deep learning-based speech recognition system built on top of PyTorch Lightning. | 2,109 |
speechbrain/speechbrain | A PyTorch-based toolkit for building conversational AI systems with advanced speech and text processing capabilities. | 9,066 |
huggingface/transformers.js | An open-source JavaScript library for running machine learning models in the browser without a server. | 12,363 |
mravanelli/pytorch-kaldi | Develops state-of-the-art speech recognition systems using PyTorch and Kaldi toolkits | 2,370 |
rvc-boss/gpt-sovits | An AI system for generating human-like voices from text inputs, using deep learning techniques and pre-trained models. | 36,977 |
xiph/rnnoise | A deep learning-based audio noise reduction system using recurrent neural networks | 4,191 |
pytorch/audio | A PyTorch module providing tools and functions for audio signal processing | 2,561 |
huggingface/transformers | A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. | 136,357 |
amarcu5/piper | An open-source browser extension that adds Picture in Picture support to multiple video streaming services | 261 |
ggerganov/whisper.cpp | A high-performance inference implementation of an automatic speech recognition model in C++ | 36,332 |
nvidia/waveglow | Generates high-quality speech from mel-spectrograms using a flow-based network architecture | 2,294 |