whispering

Transcription tool

An open-source tool for real-time audio and image transcription with support for multiple languages and various applications

Whispering Tiger - OpenAI's whisper (and other models) with OSC and Websocket support. Allowing live transcription / translation in VRChat and Overlays in most Streaming Applications

GitHub

404 stars

12 watching

30 forks

Language: Python

last commit: over 1 year ago

Linked from 1 awesome list

Backlinks from these awesome lists:

madjin/awesome-vrchat

Related projects:

Repository	Description	Stars
collabora/whisperlive	An implementation of Whisper's speech-to-text functionality in a real-time transcription application	2,186
schibsted/waas	A service for transcribing and processing audio files using OpenAI Whisper, providing both GUI and API options.	1,854
chengsokdara/use-whisper	A React hook that enables real-time speech-to-text functionality using the OpenAI Whisper API	738
srijith-rkr/kaust-whisper-adapter	A tool for fine-tuning the OpenAI Whisper speech recognition model using residual adapters and parameter-efficient learning methods.	32
sandrohanea/whisper.net	A .NET implementation of OpenAI Whisper models for speech recognition and text-to-speech conversion.	601
softcatala/whisper-ctranslate2	An AI-powered speech recognition and translation tool that utilizes CTranslate2 and Faster-whisper implementations for faster and more efficient processing.	938
purfview/whisper-standalone-win	Provides standalone executables for OpenAI's Whisper & Faster-Whisper speech recognition and transcription tools	1,405
shashikg/whispers2t	An optimized speech-to-text pipeline designed to improve inference speed and accuracy	330
thewh1teagle/vibe	An AI-powered audio and video transcription tool with cross-platform support for desktop devices.	1,390
novinfard/transcriptionhelper	An iOS application that assists users in transcribing audio files for writing or language learning purposes.	7
dmort27/epitran	A tool for transcribing written text into the International Phonetic Alphabet (IPA) format.	668
benwbrum/fromthepage	A wiki-like application for collaborative transcription of handwritten documents from scanned pages.	171
arthurfdlr/whisper-youtube	Transcribes Youtube videos using OpenAI's Whisper speech recognition model	369
smallwat3r/shhh	A tool to securely share sensitive information through encrypted links with expiration dates and limited access attempts.	385
yuangongnd/whisper-at	An audio processing model that adds audio event tagging capabilities to an existing speech recognition system with minimal additional computational cost.	343