awesome-whisper
Speech toolkit
A curated list of tools and resources for building speech-to-text applications
๐ Awesome list for Whisper โ an open-source AI-powered speech recognition system developed by OpenAI
1k stars
24 watching
61 forks
last commit: 7 months ago
Linked from 2 awesome lists
aiartificial-intelligenceawesomeawesome-listgptopenaispeech-to-texttranscription
Official | |||
Introduction | |||
Source code | 72,752 | 17 days ago | |
White paper | |||
Model variants | |||
Whisper.cpp | 36,332 | 1 day ago | Port of Whisper in C++ |
Model variants / Whisper.cpp | |||
Bindings for many languages | 36,332 | 1 day ago | |
Model variants | |||
WhisperX | 12,894 | about 19 hours ago | Adds fast automatic speaker recognition with word-level timestamps and speaker diarization |
faster-whisper | 12,989 | 6 days ago | Faster reimplementation of Whisper using CTranslate2 |
Whisper JAX | 4,460 | 9 months ago | JAX implementation of Whisper for up to 70x speed-up on TPU |
whisper-timestamped | 2,121 | 11 days ago | Adds word-level timestamps and confidence scores |
whisper-openvino | 163 | about 1 year ago | Whisper running on OpenVINO |
whisper.tflite | 71 | over 1 year ago | Whisper running on TensorFlow Lite |
Whisper variants | Various Whisper variants on Hugging Faces | ||
Whisper-AT | 341 | 10 months ago | Whisper that can recognize non-speech audio events in addition to speech |
Apps | |||
Aiko | Audio transcription iOS and macOS app | ||
MacWhisper | Audio transcription macOS app. (Freemium) | ||
Whisper Memos | Audio transcription iOS app. (Freemium) | ||
FourYou | Audio journal iOS app | ||
Jojo Transcribe | Audio transcription macOS app | ||
Buzz | 12,817 | 2 days ago | Audio transcription and translation macOS app |
WhisperScript | Audio transcription macOS app. (Freemium ยท Electron) | ||
Audio Podium | Audio/video management macOS app | ||
superwhisper | Global audio transcription macOS menu bar app | ||
Speech Note | 609 | about 7 hours ago | Audio transcription Linux app |
FridayGPT | Dictation macOS app powered by OpenAI API | ||
EasyWhisper | Windows and macOS app for audio transcription and speaker diarization. (Freemium) | ||
Web apps / Hosted | |||
bigWav | Audio transcription and annotation tool | ||
Free Podcast Transcription | Runs locally in your browser | ||
Gladia | Transcription with real-time processing | ||
Web apps / Self-hosted | |||
Subs AI | 1,332 | about 2 months ago | Subtitle generation |
WaaS | 1,852 | 12 days ago | GUI and API for Whisper |
writeout.ai | 1,479 | almost 2 years ago | Laravel app to transcribe and translate audio files |
Meeper | 58 | over 1 year ago | Transcriptions, summary and more for meetings and any browser tab. (Chrome app) |
CLI tools | |||
yt-whisper | 1,373 | 11 months ago | YouTube subtitle generation |
phonix | 39 | 7 months ago | Generate captions for videos |
whisper-standalone-win | 1,385 | 8 days ago | Standalone Windows executable for Whisper and Faster Whisper |
whisper-ctranslate2 | 931 | 15 days ago | Whisper command-line tool based on CTranslate2, compatible with the original |
insanely-fast-whisper-cli | 328 | 6 months ago | Achieve transcription speeds near 30x real-time with several optimizations |
whisper-diarization | 3,874 | 2 days ago | Automatic speech recognition with speaker diarization |
Playgrounds | |||
Hugging Faces | Whisper demo running on Hugging Faces. ( ) | ||
Monster API | Whisper demo running on Monster API. ( ) | ||
Web Whisper | Whisper demo by Pluja. ( ) | ||
YouTube Video Transcription | 369 | 8 months ago | Running on Colab |
Packages / JavaScript | |||
use-whisper | 738 | 8 months ago | React hook |
Articles | |||
Whispers of A.I.'s Modular Future | The future of machine learning lies in adaptable and accessible open-source speech-transcription programs | ||
How to Run Whisper Speech Recognition Model | Explains how to install and run the model, as well as providing a performance analysis comparing Whisper to other models | ||
Create your own speech to text app using Flask | The tutorial demonstrates Whisper's speech-to-text model, with a demo on running it in a Gradient Notebook and a guide for setting up a Flask app with Gradient Deployments | ||
Convert Podcasts to Text | Tutorial on the Whisper API with Python for speech-to-text transcription, showcasing GPU's faster transcription and advanced technology | ||
Videos | |||
Open AI's Whisper is Amazing! | Introduction to Whisper | ||
How to do Free Speech-to-Text Transcription Better Than Google Premium API | Tutorial | ||
Multilingual AI Speech Recognition Live App | Tutorial | ||
Community | |||
Discussions | 72,752 | 17 days ago | |
Discord | |||
Third-party APIs | |||
Whisper+ | Extension of the Whisper model which adds powerful features such as speaker identification custom vocabulary, summarization, and chapter generation | ||
Replicate | Use Whisper running on Replicate | ||
Related lists | |||
awesome-chatgpt | 5,229 | 6 days ago | ChatGPT resources |