awesome-whisper
Speech toolkit
A curated list of tools and resources for building speech-to-text applications
๐ Awesome list for Whisper โ an open-source AI-powered speech recognition system developed by OpenAI
1k stars
24 watching
61 forks
last commit: over 1 year ago
Linked from 2 awesome lists
aiartificial-intelligenceawesomeawesome-listgptopenaispeech-to-texttranscription
Official | |||
| Introduction | |||
| Source code | 72,752 | 12 months ago | |
| White paper | |||
Model variants | |||
| Whisper.cpp | 36,332 | 11 months ago | Port of Whisper in C++ |
Model variants / Whisper.cpp | |||
| Bindings for many languages | 36,332 | 11 months ago | |
Model variants | |||
| WhisperX | 12,894 | 11 months ago | Adds fast automatic speaker recognition with word-level timestamps and speaker diarization |
| faster-whisper | 12,989 | 11 months ago | Faster reimplementation of Whisper using CTranslate2 |
| Whisper JAX | 4,467 | over 1 year ago | JAX implementation of Whisper for up to 70x speed-up on TPU |
| whisper-timestamped | 2,121 | 11 months ago | Adds word-level timestamps and confidence scores |
| whisper-openvino | 163 | about 2 years ago | Whisper running on OpenVINO |
| whisper.tflite | 71 | about 2 years ago | Whisper running on TensorFlow Lite |
| Whisper variants | Various Whisper variants on Hugging Faces | ||
| Whisper-AT | 343 | over 1 year ago | Whisper that can recognize non-speech audio events in addition to speech |
Apps | |||
| Aiko | Audio transcription iOS and macOS app | ||
| MacWhisper | Audio transcription macOS app. (Freemium) | ||
| Whisper Memos | Audio transcription iOS app. (Freemium) | ||
| FourYou | Audio journal iOS app | ||
| Jojo Transcribe | Audio transcription macOS app | ||
| Buzz | 12,817 | 11 months ago | Audio transcription and translation macOS app |
| WhisperScript | Audio transcription macOS app. (Freemium ยท Electron) | ||
| Audio Podium | Audio/video management macOS app | ||
| superwhisper | Global audio transcription macOS menu bar app | ||
| Speech Note | 609 | 11 months ago | Audio transcription Linux app |
| FridayGPT | Dictation macOS app powered by OpenAI API | ||
| EasyWhisper | Windows and macOS app for audio transcription and speaker diarization. (Freemium) | ||
Web apps / Hosted | |||
| bigWav | Audio transcription and annotation tool | ||
| Free Podcast Transcription | Runs locally in your browser | ||
| Gladia | Transcription with real-time processing | ||
Web apps / Self-hosted | |||
| Subs AI | 1,332 | about 1 year ago | Subtitle generation |
| WaaS | 1,854 | 11 months ago | GUI and API for Whisper |
| writeout.ai | 1,479 | over 2 years ago | Laravel app to transcribe and translate audio files |
| Meeper | 58 | about 2 years ago | Transcriptions, summary and more for meetings and any browser tab. (Chrome app) |
CLI tools | |||
| yt-whisper | 1,373 | almost 2 years ago | YouTube subtitle generation |
| phonix | 39 | over 1 year ago | Generate captions for videos |
| whisper-standalone-win | 1,405 | 11 months ago | Standalone Windows executable for Whisper and Faster Whisper |
| whisper-ctranslate2 | 938 | 11 months ago | Whisper command-line tool based on CTranslate2, compatible with the original |
| insanely-fast-whisper-cli | 328 | over 1 year ago | Achieve transcription speeds near 30x real-time with several optimizations |
| whisper-diarization | 3,874 | 11 months ago | Automatic speech recognition with speaker diarization |
Playgrounds | |||
| Hugging Faces | Whisper demo running on Hugging Faces. ( ) | ||
| Monster API | Whisper demo running on Monster API. ( ) | ||
| Web Whisper | Whisper demo by Pluja. ( ) | ||
| YouTube Video Transcription | 369 | over 1 year ago | Running on Colab |
Packages / JavaScript | |||
| use-whisper | 738 | over 1 year ago | React hook |
Articles | |||
| Whispers of A.I.'s Modular Future | The future of machine learning lies in adaptable and accessible open-source speech-transcription programs | ||
| How to Run Whisper Speech Recognition Model | Explains how to install and run the model, as well as providing a performance analysis comparing Whisper to other models | ||
| Create your own speech to text app using Flask | The tutorial demonstrates Whisper's speech-to-text model, with a demo on running it in a Gradient Notebook and a guide for setting up a Flask app with Gradient Deployments | ||
| Convert Podcasts to Text | Tutorial on the Whisper API with Python for speech-to-text transcription, showcasing GPU's faster transcription and advanced technology | ||
Videos | |||
| Open AI's Whisper is Amazing! | Introduction to Whisper | ||
| How to do Free Speech-to-Text Transcription Better Than Google Premium API | Tutorial | ||
| Multilingual AI Speech Recognition Live App | Tutorial | ||
Community | |||
| Discussions | 72,752 | 12 months ago | |
| Discord | |||
Third-party APIs | |||
| Whisper+ | Extension of the Whisper model which adds powerful features such as speaker identification custom vocabulary, summarization, and chapter generation | ||
| Replicate | Use Whisper running on Replicate | ||
Related lists | |||
| awesome-chatgpt | 5,229 | 11 months ago | ChatGPT resources |