speech_recognition
Speech recognition library
A comprehensive speech recognition library with support for various engines and APIs.
Speech recognition module for Python, supporting several engines and APIs, online and offline.
8k stars
277 watching
2k forks
Language: Python
last commit: about 1 month ago
Linked from 1 awesome list
audiopythonspeech-recognitionspeech-to-text
Related projects:
Repository | Description | Stars |
---|---|---|
aofdev/vue-pwa-speech | Enables synchronous speech recognition with Google Cloud Speech API on a Progressive Web App | 99 |
aofdev/vue-speech-streaming | A Vue2 project providing streaming speech recognition with Google Cloud Speech API | 73 |
hahahumble/speechgpt | An application that enables users to converse with ChatGPT via speech and text interfaces. | 2,752 |
googleapis/google-api-nodejs-client | Provides a Node.js client library for accessing Google APIs with support for OAuth 2.0 authentication and multiple authorization methods. | 11,498 |
googleapis/google-api-go-client | Automatically generated libraries for interacting with Google APIs | 4,056 |
googlecloudplatform/generative-ai | Demonstrates how to use generative AI on Google Cloud with Vertex AI | 8,645 |
rhasspy/rhasspy | An open-source voice assistant software that uses speech recognition and natural language processing to automate tasks in home automation systems. | 2,419 |
googlecloudplatform/microservices-demo | A sample cloud-first application demonstrating Kubernetes, Istio, and gRPC integration | 17,174 |
evancohen/sonus | Software module enabling voice user interfaces with offline hotword detection and cloud-based speech recognition. | 631 |
n0shake/public-apis | A curated list of public APIs from around the web | 21,591 |
googleapis/google-cloud-go | A set of libraries providing access to Google Cloud Platform services via the Go programming language. | 3,809 |
talater/annyang | An open-source JavaScript library that enables voice control of web applications using speech recognition | 6,628 |
pipecat-ai/pipecat | A modular framework for building conversational AI applications with real-time voice and multimodal interactions. | 3,825 |
googleapis/googleapis | Contains public interface definitions of Google APIs | 7,673 |
jjlawren/sonos_cloud | Enables voice alert functionality on Sonos speakers using their cloud API. | 121 |