awesome-whisper

Speech toolkit

A curated list of tools and resources for building speech-to-text applications

๐Ÿ”Š Awesome list for Whisper โ€” an open-source AI-powered speech recognition system developed by OpenAI

GitHub

1k stars
24 watching
61 forks
last commit: 7 months ago
Linked from 2 awesome lists

aiartificial-intelligenceawesomeawesome-listgptopenaispeech-to-texttranscription

Official

Introduction
Source code 72,752 17 days ago
White paper

Model variants

Whisper.cpp 36,332 1 day ago Port of Whisper in C++

Model variants / Whisper.cpp

Bindings for many languages 36,332 1 day ago

Model variants

WhisperX 12,894 about 19 hours ago Adds fast automatic speaker recognition with word-level timestamps and speaker diarization
faster-whisper 12,989 6 days ago Faster reimplementation of Whisper using CTranslate2
Whisper JAX 4,460 9 months ago JAX implementation of Whisper for up to 70x speed-up on TPU
whisper-timestamped 2,121 11 days ago Adds word-level timestamps and confidence scores
whisper-openvino 163 about 1 year ago Whisper running on OpenVINO
whisper.tflite 71 over 1 year ago Whisper running on TensorFlow Lite
Whisper variants Various Whisper variants on Hugging Faces
Whisper-AT 341 10 months ago Whisper that can recognize non-speech audio events in addition to speech

Apps

Aiko Audio transcription iOS and macOS app
MacWhisper Audio transcription macOS app. (Freemium)
Whisper Memos Audio transcription iOS app. (Freemium)
FourYou Audio journal iOS app
Jojo Transcribe Audio transcription macOS app
Buzz 12,817 2 days ago Audio transcription and translation macOS app
WhisperScript Audio transcription macOS app. (Freemium ยท Electron)
Audio Podium Audio/video management macOS app
superwhisper Global audio transcription macOS menu bar app
Speech Note 609 about 7 hours ago Audio transcription Linux app
FridayGPT Dictation macOS app powered by OpenAI API
EasyWhisper Windows and macOS app for audio transcription and speaker diarization. (Freemium)

Web apps / Hosted

bigWav Audio transcription and annotation tool
Free Podcast Transcription Runs locally in your browser
Gladia Transcription with real-time processing

Web apps / Self-hosted

Subs AI 1,332 about 2 months ago Subtitle generation
WaaS 1,852 12 days ago GUI and API for Whisper
writeout.ai 1,479 almost 2 years ago Laravel app to transcribe and translate audio files
Meeper 58 over 1 year ago Transcriptions, summary and more for meetings and any browser tab. (Chrome app)

CLI tools

yt-whisper 1,373 11 months ago YouTube subtitle generation
phonix 39 7 months ago Generate captions for videos
whisper-standalone-win 1,385 8 days ago Standalone Windows executable for Whisper and Faster Whisper
whisper-ctranslate2 931 15 days ago Whisper command-line tool based on CTranslate2, compatible with the original
insanely-fast-whisper-cli 328 6 months ago Achieve transcription speeds near 30x real-time with several optimizations
whisper-diarization 3,874 2 days ago Automatic speech recognition with speaker diarization

Playgrounds

Hugging Faces Whisper demo running on Hugging Faces. ( )
Monster API Whisper demo running on Monster API. ( )
Web Whisper Whisper demo by Pluja. ( )
YouTube Video Transcription 369 8 months ago Running on Colab

Packages / JavaScript

use-whisper 738 8 months ago React hook

Articles

Whispers of A.I.'s Modular Future The future of machine learning lies in adaptable and accessible open-source speech-transcription programs
How to Run Whisper Speech Recognition Model Explains how to install and run the model, as well as providing a performance analysis comparing Whisper to other models
Create your own speech to text app using Flask The tutorial demonstrates Whisper's speech-to-text model, with a demo on running it in a Gradient Notebook and a guide for setting up a Flask app with Gradient Deployments
Convert Podcasts to Text Tutorial on the Whisper API with Python for speech-to-text transcription, showcasing GPU's faster transcription and advanced technology

Videos

Open AI's Whisper is Amazing! Introduction to Whisper
How to do Free Speech-to-Text Transcription Better Than Google Premium API Tutorial
Multilingual AI Speech Recognition Live App Tutorial

Community

Discussions 72,752 17 days ago
Discord

Third-party APIs

Whisper+ Extension of the Whisper model which adds powerful features such as speaker identification custom vocabulary, summarization, and chapter generation
Replicate Use Whisper running on Replicate
awesome-chatgpt 5,229 6 days ago ChatGPT resources

Backlinks from these awesome lists:

More related projects: