common-voice

Speech collector

A platform that collects and annotates speech data to train voice recognition models

Common Voice is part of Mozilla's initiative to help teach machines how real people speak.

GitHub

3k stars
133 watching
843 forks
Language: TypeScript
last commit: about 5 hours ago
Linked from 1 awesome list

crowdsourcinginternet-freedomopen-datavoice

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
mumble-voip/mumble A high-quality, low-latency voice chat software with support for multiple platforms and plugins. 6,484
pipecat-ai/pipecat A modular framework for building conversational AI applications with real-time voice and multimodal interactions. 3,825
melusina-org/make-common-lisp-program Creates executable Common Lisp programs on GitHub runners with various implementations and systems 3
discourse/discourse A Ruby-based platform for community discussion with real-time chat and plugins. 42,613
openbmb/chatdev An interactive software framework built on large language models to facilitate collaborative development and task-oriented interactions among multiple agents. 25,916
mozilla/tts An open-source project providing a suite of deep learning models and tools for advanced text-to-speech synthesis. 9,466
hahahumble/speechgpt An application that enables users to converse with ChatGPT via speech and text interfaces. 2,752
ggerganov/whisper.cpp A high-performance inference implementation of an automatic speech recognition model in C++ 36,332
jasonppy/voicecraft A neural codec model for speech editing and text-to-speech synthesis in real-time, using few seconds of reference audio. 7,744
rasahq/rasa Automates conversations with contextual assistants 19,076
rhasspy/piper A fast local neural text-to-speech system optimized for small devices 7,002
espeak-ng/espeak-ng A text-to-speech synthesizer that supports multiple languages and is compact in size. 4,311
cogentapps/chat-with-gpt An open-source ChatGPT app with added features and customization options 2,327
mahmoudashraf97/whisper-diarization Automates speaker diarization from audio recordings using OpenAI Whisper ASR and additional neural networks. 3,874
rvc-boss/gpt-sovits An AI system for generating human-like voices from text inputs, using deep learning techniques and pre-trained models. 36,977