common-voice
Speech collector
A platform that collects and annotates speech data to train voice recognition models
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
3k stars
132 watching
844 forks
Language: TypeScript
last commit: 4 days ago
Linked from 1 awesome list
crowdsourcinginternet-freedomopen-datavoice
Related projects:
Repository | Description | Stars |
---|---|---|
mumble-voip/mumble | A high-quality, low-latency voice chat software with support for multiple platforms and plugins. | 6,428 |
pipecat-ai/pipecat | A framework for building conversational AI agents with voice and multimodal interactions | 3,383 |
melusina-org/make-common-lisp-program | Creates executable Common Lisp programs on GitHub runners with various implementations and systems | 3 |
discourse/discourse | A Ruby-based platform for community discussion with real-time chat and plugins. | 42,337 |
openbmb/chatdev | An interactive software framework built on large language models to facilitate collaborative development and task-oriented interactions among multiple agents. | 25,601 |
mozilla/tts | An open-source project providing a suite of deep learning models and tools for advanced text-to-speech synthesis. | 9,401 |
hahahumble/speechgpt | An application that enables users to converse with ChatGPT via speech and text interfaces. | 2,746 |
ggerganov/whisper.cpp | A high-performance implementation of the OpenAI Whisper ASR model in C++ | 35,706 |
jasonppy/voicecraft | A neural codec model for speech editing and text-to-speech synthesis in real-time, using few seconds of reference audio. | 7,638 |
rasahq/rasa | Automates conversations with contextual assistants | 18,956 |
rhasspy/piper | A fast local neural text-to-speech system optimized for small devices | 6,576 |
espeak-ng/espeak-ng | A text-to-speech synthesizer that supports multiple languages and is compact in size. | 4,224 |
cogentapps/chat-with-gpt | An open-source ChatGPT app with added features and customization options | 2,322 |
mahmoudashraf97/whisper-diarization | Automates speaker diarization from audio recordings using OpenAI Whisper ASR and additional neural networks. | 3,718 |
rvc-boss/gpt-sovits | An AI system for generating human-like voices from text inputs, using deep learning techniques and pre-trained models. | 35,728 |