whispering

Transcription tool

An open-source tool for real-time audio and image transcription with support for multiple languages and various applications

Whispering Tiger - OpenAI's whisper (and other models) with OSC and Websocket support. Allowing live transcription / translation in VRChat and Overlays in most Streaming Applications

GitHub

404 stars
12 watching
30 forks
Language: Python
last commit: about 1 month ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
collabora/whisperlive An implementation of Whisper's speech-to-text functionality in a real-time transcription application 2,186
schibsted/waas A service for transcribing and processing audio files using OpenAI Whisper, providing both GUI and API options. 1,854
chengsokdara/use-whisper A React hook that enables real-time speech-to-text functionality using the OpenAI Whisper API 738
srijith-rkr/kaust-whisper-adapter A tool for fine-tuning the OpenAI Whisper speech recognition model using residual adapters and parameter-efficient learning methods. 32
sandrohanea/whisper.net A .NET implementation of OpenAI Whisper models for speech recognition and text-to-speech conversion. 601
softcatala/whisper-ctranslate2 An AI-powered speech recognition and translation tool that utilizes CTranslate2 and Faster-whisper implementations for faster and more efficient processing. 938
purfview/whisper-standalone-win Provides standalone executables for OpenAI's Whisper & Faster-Whisper speech recognition and transcription tools 1,405
shashikg/whispers2t An optimized speech-to-text pipeline designed to improve inference speed and accuracy 330
thewh1teagle/vibe An AI-powered audio and video transcription tool with cross-platform support for desktop devices. 1,390
novinfard/transcriptionhelper An iOS application that assists users in transcribing audio files for writing or language learning purposes. 7
dmort27/epitran A tool for transcribing written text into the International Phonetic Alphabet (IPA) format. 668
benwbrum/fromthepage A wiki-like application for collaborative transcription of handwritten documents from scanned pages. 171
arthurfdlr/whisper-youtube Transcribes Youtube videos using OpenAI's Whisper speech recognition model 369
smallwat3r/shhh A tool to securely share sensitive information through encrypted links with expiration dates and limited access attempts. 385
yuangongnd/whisper-at An audio processing model that adds audio event tagging capabilities to an existing speech recognition system with minimal additional computational cost. 343