insanely-fast-whisper

Audio Transcriber

A command-line tool for fast audio transcription using the Whisper AI model

8k stars

67 watching

551 forks

Language: Jupyter Notebook

last commit: about 2 years ago

Linked from 1 awesome list

Backlinks from these awesome lists:

filipecalegario/awesome-generative-ai

Related projects:

Repository	Description	Stars
systran/faster-whisper	A fast speech-to-text implementation using CTranslate2 and optimized for inference on CPU and GPU.	12,989
m-bain/whisperx	An automatic speech recognition system with word-level timestamps and speaker diarization.	12,894
const-me/whisper	An implementation of OpenAI's Whisper ASR model using DirectCompute for GPGPU inference	8,617
sanchit-gandhi/whisper-jax	An optimized implementation of OpenAI's Whisper Model for speech recognition and speech-to-text tasks using JAX.	4,467
huggingface/distil-whisper	A machine learning model that uses audio input to generate text transcriptions at high speeds and with good accuracy.	3,644
ggerganov/whisper.cpp	A high-performance inference implementation of an automatic speech recognition model in C++	36,332
openai/whisper	A general-purpose speech recognition system trained on large-scale weak supervision	72,752
mahmoudashraf97/whisper-diarization	Automates speaker diarization from audio recordings using OpenAI Whisper ASR and additional neural networks.	3,874
xenova/whisper-web	An open-source speech recognition system built using machine learning models and JavaScript.	2,651
purfview/whisper-standalone-win	Provides standalone executables for OpenAI's Whisper & Faster-Whisper speech recognition and transcription tools	1,405
sjtu-ipads/powerinfer	An efficient Large Language Model inference engine leveraging consumer-grade GPUs on PCs	8,011
rhasspy/piper	A fast local neural text-to-speech system optimized for small devices	7,002
leetcode-mafia/cheetah	An AI-powered macOS app designed to assist users during remote software engineering interviews	4,103
facebookresearch/metaseq	A codebase for working with Open Pre-trained Transformers, enabling deployment and fine-tuning of transformer models on various platforms.	6,519
hyperoslo/whisper	A UI component library for displaying messages and notifications in iOS apps with customizable sounds, colors, and fonts	3,755