insanely-fast-whisper
Audio Transcriber
A command-line tool for fast audio transcription using the Whisper AI model
8k stars
67 watching
551 forks
Language: Jupyter Notebook
last commit: 6 months ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
systran/faster-whisper | A fast speech-to-text implementation using CTranslate2 and optimized for inference on CPU and GPU. | 12,989 |
m-bain/whisperx | An automatic speech recognition system with word-level timestamps and speaker diarization. | 12,894 |
const-me/whisper | An implementation of OpenAI's Whisper ASR model using DirectCompute for GPGPU inference | 8,617 |
sanchit-gandhi/whisper-jax | An optimized implementation of OpenAI's Whisper Model for speech recognition and speech-to-text tasks using JAX. | 4,460 |
huggingface/distil-whisper | A machine learning model that uses audio input to generate text transcriptions at high speeds and with good accuracy. | 3,644 |
ggerganov/whisper.cpp | A high-performance inference implementation of an automatic speech recognition model in C++ | 36,332 |
openai/whisper | A general-purpose speech recognition system trained on large-scale weak supervision | 72,752 |
mahmoudashraf97/whisper-diarization | Automates speaker diarization from audio recordings using OpenAI Whisper ASR and additional neural networks. | 3,874 |
xenova/whisper-web | An open-source speech recognition system built using machine learning models and JavaScript. | 2,651 |
purfview/whisper-standalone-win | Provides standalone executables for OpenAI's Whisper & Faster-Whisper speech recognition and transcription tools | 1,405 |
sjtu-ipads/powerinfer | An efficient Large Language Model inference engine leveraging consumer-grade GPUs on PCs | 8,011 |
rhasspy/piper | A fast local neural text-to-speech system optimized for small devices | 7,002 |
leetcode-mafia/cheetah | An AI-powered macOS app designed to assist users during remote software engineering interviews | 4,103 |
facebookresearch/metaseq | A codebase for working with Open Pre-trained Transformers, enabling deployment and fine-tuning of transformer models on various platforms. | 6,519 |
hyperoslo/whisper | A UI component library for displaying messages and notifications in iOS apps with customizable sounds, colors, and fonts | 3,755 |