piper
TTS system
A fast local neural text-to-speech system optimized for small devices
A fast, local neural text to speech system
7k stars
76 watching
490 forks
Language: C++
last commit: about 1 month ago
Linked from 1 awesome list
speech-synthesistext-to-speechtts
Related projects:
Repository | Description | Stars |
---|---|---|
rhasspy/rhasspy | An open-source voice assistant software that uses speech recognition and natural language processing to automate tasks in home automation systems. | 2,410 |
mozilla/tts | An open-source project providing a suite of deep learning models and tools for advanced text-to-speech synthesis. | 9,442 |
espeak-ng/espeak-ng | A text-to-speech synthesizer that supports multiple languages and is compact in size. | 4,262 |
plachtaa/vall-e-x | A research implementation of Microsoft's VALL-E X zero-shot TTS model for multilingual text-to-speech synthesis and voice cloning | 7,701 |
seannaren/deepspeech.pytorch | A deep learning-based speech recognition system built on top of PyTorch Lightning. | 2,107 |
speechbrain/speechbrain | A PyTorch-based toolkit for building conversational AI systems with advanced speech and text processing capabilities. | 8,992 |
huggingface/transformers.js | Runs machine learning models directly in the browser without server-side support. | 12,240 |
mravanelli/pytorch-kaldi | A toolkit for developing state-of-the-art deep learning-based speech recognition systems | 2,369 |
rvc-boss/gpt-sovits | An AI system for generating human-like voices from text inputs, using deep learning techniques and pre-trained models. | 36,413 |
xiph/rnnoise | A deep learning-based audio noise reduction system using recurrent neural networks | 4,156 |
pytorch/audio | A PyTorch module providing tools and functions for audio signal processing | 2,547 |
huggingface/transformers | A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. | 135,747 |
amarcu5/piper | An open-source browser extension that adds Picture in Picture support to multiple video streaming services | 261 |
ggerganov/whisper.cpp | A high-performance inference implementation of an automatic speech recognition model in C++ | 35,997 |
nvidia/waveglow | Generates high-quality speech from mel-spectrograms using a flow-based network architecture | 2,292 |