WhisperLive

Transcription engine

An implementation of Whisper's speech-to-text functionality in a real-time transcription application

A nearly-live implementation of OpenAI's Whisper.

GitHub

2k stars
32 watching
296 forks
Language: Python
last commit: about 2 months ago
dictationobsopenaitensorrttensorrt-llmtext-to-speechtranslationvoice-recognitionwhisperwhisper-tensorrt

Related projects:

Repository Description Stars
sandrohanea/whisper.net A .NET implementation of OpenAI Whisper models for speech recognition and text-to-speech conversion. 601
arthurfdlr/whisper-youtube Transcribes Youtube videos using OpenAI's Whisper speech recognition model 369
softcatala/whisper-ctranslate2 An AI-powered speech recognition and translation tool that utilizes CTranslate2 and Faster-whisper implementations for faster and more efficient processing. 938
purfview/whisper-standalone-win Provides standalone executables for OpenAI's Whisper & Faster-Whisper speech recognition and transcription tools 1,405
macoron/whisper.unity Provides a high-performance speech recognition system for Unity3D applications. 445
schibsted/waas A service for transcribing and processing audio files using OpenAI Whisper, providing both GUI and API options. 1,854
chengsokdara/use-whisper A React hook that enables real-time speech-to-text functionality using the OpenAI Whisper API 738
sharrnah/whispering An open-source tool for real-time audio and image transcription with support for multiple languages and various applications 404
mybigday/whisper.rn A React Native binding of Whisper's automatic speech recognition model 408
m1guelpf/yt-whisper Automates transcription and subtitle generation from YouTube videos using OpenAI's Whisper model 1,373
thewh1teagle/vibe An AI-powered audio and video transcription tool with cross-platform support for desktop devices. 1,390
picovoice/rhino A deep learning-based speech-to-intent engine for on-device voice interaction 633
linto-ai/whisper-timestamped An extension to the Whisper speech recognition model that adds word-level timestamps and confidence scores. 2,121
srijith-rkr/kaust-whisper-adapter A tool for fine-tuning the OpenAI Whisper speech recognition model using residual adapters and parameter-efficient learning methods. 32
illyism/openai-whisper-api An OpenAI speech-to-text API service built with Node.js and Typescript, running on Docker and Google Cloud Run. 110