speech_recognition

Speech Transcription Library

Provides an interface to various speech recognition engines and APIs for text transcription from audio files.

Speech recognition module for Python, supporting several engines and APIs, online and offline.

GitHub

8k stars
277 watching
2k forks
Language: Python
last commit: 12 days ago
Linked from 1 awesome list

audiopythonspeech-recognitionspeech-to-text

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
aofdev/vue-pwa-speech Enables synchronous speech recognition with Google Cloud Speech API on a Progressive Web App 99
aofdev/vue-speech-streaming A Vue2 project providing streaming speech recognition with Google Cloud Speech API 73
hahahumble/speechgpt An application that enables users to converse with ChatGPT via speech and text interfaces. 2,746
googleapis/google-api-nodejs-client Provides a Node.js client library for accessing Google APIs with support for OAuth 2.0 authentication and multiple authorization methods. 11,428
googleapis/google-api-go-client Automatically generated libraries for interacting with Google APIs 4,034
googlecloudplatform/generative-ai Generative AI workflow development and management tools on Google Cloud using Vertex AI. 7,807
rhasspy/rhasspy An open-source voice assistant software that uses speech recognition and natural language processing to automate tasks in home automation systems. 2,395
googlecloudplatform/microservices-demo A sample cloud-first application demonstrating Kubernetes, Istio, and gRPC integration 17,041
evancohen/sonus Software module enabling voice user interfaces with offline hotword detection and cloud-based speech recognition. 627
n0shake/public-apis A curated list of public APIs from around the web 21,505
googleapis/google-cloud-go A set of libraries providing access to Google Cloud Platform services via the Go programming language. 3,779
talater/annyang An open-source JavaScript library that enables voice control of web applications using speech recognition 6,628
pipecat-ai/pipecat A framework for building conversational AI agents with voice and multimodal interactions 3,383
googleapis/googleapis Contains public interface definitions of Google APIs 7,627
jjlawren/sonos_cloud Enables voice alert functionality on Sonos speakers using their cloud API. 121