insanely-fast-whisper

Audio Transcriber

A command-line tool for fast audio transcription using the Whisper AI model

GitHub

8k stars
66 watching
545 forks
Language: Jupyter Notebook
last commit: 5 months ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
systran/faster-whisper A fast speech recognition system built on top of the CTranslate2 transformer model 12,506
m-bain/whisperx An automatic speech recognition system with word-level timestamps and speaker diarization. 12,489
const-me/whisper An implementation of OpenAI's Whisper ASR model using DirectCompute for GPGPU inference 8,460
sanchit-gandhi/whisper-jax An optimized implementation of OpenAI's Whisper Model for speech recognition and speech-to-text tasks using JAX. 4,444
huggingface/distil-whisper A machine learning model that uses audio input to generate text transcriptions at high speeds and with good accuracy. 3,613
ggerganov/whisper.cpp A high-performance implementation of the OpenAI Whisper ASR model in C++ 35,706
openai/whisper A general-purpose speech recognition system trained on large-scale weak supervision 71,257
mahmoudashraf97/whisper-diarization Automates speaker diarization from audio recordings using OpenAI Whisper ASR and additional neural networks. 3,718
xenova/whisper-web An open-source speech recognition system built using machine learning models and JavaScript. 2,578
purfview/whisper-standalone-win Executable standalone versions of Whisper and Faster-Whisper speech recognition tools 1,326
sjtu-ipads/powerinfer An efficient Large Language Model inference engine leveraging consumer-grade GPUs on PCs 7,964
rhasspy/piper A fast local neural text-to-speech system optimized for small devices 6,576
leetcode-mafia/cheetah An AI-powered macOS app designed to assist users during remote software engineering interviews 4,083
facebookresearch/metaseq A codebase for working with Open Pre-trained Transformers, enabling deployment and fine-tuning of transformer models on various platforms. 6,515
hyperoslo/whisper A UI component library for displaying messages and notifications in iOS apps with customizable sounds, colors, and fonts 3,755