speech_recognition
Speech Transcription Library
Provides an interface to various speech recognition engines and APIs for text transcription from audio files.
Speech recognition module for Python, supporting several engines and APIs, online and offline.
8k stars
277 watching
2k forks
Language: Python
last commit: 12 days ago
Linked from 1 awesome list
audiopythonspeech-recognitionspeech-to-text
Related projects:
Repository | Description | Stars |
---|---|---|
aofdev/vue-pwa-speech | Enables synchronous speech recognition with Google Cloud Speech API on a Progressive Web App | 99 |
aofdev/vue-speech-streaming | A Vue2 project providing streaming speech recognition with Google Cloud Speech API | 73 |
hahahumble/speechgpt | An application that enables users to converse with ChatGPT via speech and text interfaces. | 2,746 |
googleapis/google-api-nodejs-client | Provides a Node.js client library for accessing Google APIs with support for OAuth 2.0 authentication and multiple authorization methods. | 11,428 |
googleapis/google-api-go-client | Automatically generated libraries for interacting with Google APIs | 4,034 |
googlecloudplatform/generative-ai | Generative AI workflow development and management tools on Google Cloud using Vertex AI. | 7,807 |
rhasspy/rhasspy | An open-source voice assistant software that uses speech recognition and natural language processing to automate tasks in home automation systems. | 2,395 |
googlecloudplatform/microservices-demo | A sample cloud-first application demonstrating Kubernetes, Istio, and gRPC integration | 17,041 |
evancohen/sonus | Software module enabling voice user interfaces with offline hotword detection and cloud-based speech recognition. | 627 |
n0shake/public-apis | A curated list of public APIs from around the web | 21,505 |
googleapis/google-cloud-go | A set of libraries providing access to Google Cloud Platform services via the Go programming language. | 3,779 |
talater/annyang | An open-source JavaScript library that enables voice control of web applications using speech recognition | 6,628 |
pipecat-ai/pipecat | A framework for building conversational AI agents with voice and multimodal interactions | 3,383 |
googleapis/googleapis | Contains public interface definitions of Google APIs | 7,627 |
jjlawren/sonos_cloud | Enables voice alert functionality on Sonos speakers using their cloud API. | 121 |