common-voice

Speech collector

A platform that collects and annotates speech data to train voice recognition models

Common Voice is part of Mozilla's initiative to help teach machines how real people speak.

GitHub

3k stars
132 watching
844 forks
Language: TypeScript
last commit: 4 days ago
Linked from 1 awesome list

crowdsourcinginternet-freedomopen-datavoice

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
mumble-voip/mumble A high-quality, low-latency voice chat software with support for multiple platforms and plugins. 6,428
pipecat-ai/pipecat A framework for building conversational AI agents with voice and multimodal interactions 3,383
melusina-org/make-common-lisp-program Creates executable Common Lisp programs on GitHub runners with various implementations and systems 3
discourse/discourse A Ruby-based platform for community discussion with real-time chat and plugins. 42,337
openbmb/chatdev An interactive software framework built on large language models to facilitate collaborative development and task-oriented interactions among multiple agents. 25,601
mozilla/tts An open-source project providing a suite of deep learning models and tools for advanced text-to-speech synthesis. 9,401
hahahumble/speechgpt An application that enables users to converse with ChatGPT via speech and text interfaces. 2,746
ggerganov/whisper.cpp A high-performance implementation of the OpenAI Whisper ASR model in C++ 35,706
jasonppy/voicecraft A neural codec model for speech editing and text-to-speech synthesis in real-time, using few seconds of reference audio. 7,638
rasahq/rasa Automates conversations with contextual assistants 18,956
rhasspy/piper A fast local neural text-to-speech system optimized for small devices 6,576
espeak-ng/espeak-ng A text-to-speech synthesizer that supports multiple languages and is compact in size. 4,224
cogentapps/chat-with-gpt An open-source ChatGPT app with added features and customization options 2,322
mahmoudashraf97/whisper-diarization Automates speaker diarization from audio recordings using OpenAI Whisper ASR and additional neural networks. 3,718
rvc-boss/gpt-sovits An AI system for generating human-like voices from text inputs, using deep learning techniques and pre-trained models. 35,728