WhisperLive
Transcription engine
An implementation of Whisper's speech-to-text functionality in a real-time transcription application
A nearly-live implementation of OpenAI's Whisper.
2k stars
32 watching
296 forks
Language: Python
last commit: about 2 months ago dictationobsopenaitensorrttensorrt-llmtext-to-speechtranslationvoice-recognitionwhisperwhisper-tensorrt
Related projects:
Repository | Description | Stars |
---|---|---|
sandrohanea/whisper.net | A .NET implementation of OpenAI Whisper models for speech recognition and text-to-speech conversion. | 601 |
arthurfdlr/whisper-youtube | Transcribes Youtube videos using OpenAI's Whisper speech recognition model | 369 |
softcatala/whisper-ctranslate2 | An AI-powered speech recognition and translation tool that utilizes CTranslate2 and Faster-whisper implementations for faster and more efficient processing. | 938 |
purfview/whisper-standalone-win | Provides standalone executables for OpenAI's Whisper & Faster-Whisper speech recognition and transcription tools | 1,405 |
macoron/whisper.unity | Provides a high-performance speech recognition system for Unity3D applications. | 445 |
schibsted/waas | A service for transcribing and processing audio files using OpenAI Whisper, providing both GUI and API options. | 1,854 |
chengsokdara/use-whisper | A React hook that enables real-time speech-to-text functionality using the OpenAI Whisper API | 738 |
sharrnah/whispering | An open-source tool for real-time audio and image transcription with support for multiple languages and various applications | 404 |
mybigday/whisper.rn | A React Native binding of Whisper's automatic speech recognition model | 408 |
m1guelpf/yt-whisper | Automates transcription and subtitle generation from YouTube videos using OpenAI's Whisper model | 1,373 |
thewh1teagle/vibe | An AI-powered audio and video transcription tool with cross-platform support for desktop devices. | 1,390 |
picovoice/rhino | A deep learning-based speech-to-intent engine for on-device voice interaction | 633 |
linto-ai/whisper-timestamped | An extension to the Whisper speech recognition model that adds word-level timestamps and confidence scores. | 2,121 |
srijith-rkr/kaust-whisper-adapter | A tool for fine-tuning the OpenAI Whisper speech recognition model using residual adapters and parameter-efficient learning methods. | 32 |
illyism/openai-whisper-api | An OpenAI speech-to-text API service built with Node.js and Typescript, running on Docker and Google Cloud Run. | 110 |