whispering
Transcription tool
An open-source tool for real-time audio and image transcription with support for multiple languages and various applications
Whispering Tiger - OpenAI's whisper (and other models) with OSC and Websocket support. Allowing live transcription / translation in VRChat and Overlays in most Streaming Applications
401 stars
13 watching
29 forks
Language: Python
last commit: 23 days ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
collabora/whisperlive | An implementation of Whisper's speech-to-text functionality in a real-time transcription application | 2,050 |
schibsted/waas | A service for transcribing and processing audio files using OpenAI Whisper, providing both GUI and API options. | 1,841 |
chengsokdara/use-whisper | A React hook that enables real-time speech-to-text functionality using the OpenAI Whisper API | 733 |
srijith-rkr/kaust-whisper-adapter | A tool for fine-tuning the OpenAI Whisper speech recognition model using residual adapters and parameter-efficient learning methods. | 32 |
sandrohanea/whisper.net | An open-source speech-to-text library built on top of Whisper Models for cross-platform support. | 582 |
softcatala/whisper-ctranslate2 | A Whisper client compatible with the CTranslate2 model, providing faster transcription and translation capabilities than OpenAI Whisper | 914 |
purfview/whisper-standalone-win | Executable standalone versions of Whisper and Faster-Whisper speech recognition tools | 1,326 |
shashikg/whispers2t | An optimized speech-to-text pipeline designed to improve inference speed and accuracy | 310 |
thewh1teagle/vibe | An AI-powered audio and video transcription tool with cross-platform support for desktop devices. | 1,209 |
novinfard/transcriptionhelper | An iOS application that assists users in transcribing audio files for writing or language learning purposes. | 7 |
dmort27/epitran | A tool for transcribing written text into the International Phonetic Alphabet (IPA) format. | 653 |
benwbrum/fromthepage | A wiki-like application for collaborative transcription of handwritten documents from scanned pages. | 171 |
arthurfdlr/whisper-youtube | Transcribes Youtube videos using OpenAI's Whisper speech recognition model | 362 |
smallwat3r/shhh | A tool to securely share sensitive information through encrypted links with expiration dates and limited access attempts. | 378 |
yuangongnd/whisper-at | An audio processing model that adds audio event tagging capabilities to an existing speech recognition system with minimal additional computational cost. | 321 |