whisperX

ASR system

An automatic speech recognition system with word-level timestamps and speaker diarization.

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

GitHub

13k stars

137 watching

1k forks

Language: Python

last commit: 8 months ago

Linked from 1 awesome list

asrspeechspeech-recognitionspeech-to-textwhisper

Backlinks from these awesome lists:

sindresorhus/awesome-whisper

Related projects:

Repository	Description	Stars
mahmoudashraf97/whisper-diarization	Automates speaker diarization from audio recordings using OpenAI Whisper ASR and additional neural networks.	3,874
huggingface/distil-whisper	A machine learning model that uses audio input to generate text transcriptions at high speeds and with good accuracy.	3,644
ggerganov/whisper.cpp	A high-performance inference implementation of an automatic speech recognition model in C++	36,332
openai/whisper	A general-purpose speech recognition system trained on large-scale weak supervision	72,752
vaibhavs10/insanely-fast-whisper	A command-line tool for fast audio transcription using the Whisper AI model	7,848
systran/faster-whisper	A fast speech-to-text implementation using CTranslate2 and optimized for inference on CPU and GPU.	12,989
linto-ai/whisper-timestamped	An extension to the Whisper speech recognition model that adds word-level timestamps and confidence scores.	2,121
const-me/whisper	An implementation of OpenAI's Whisper ASR model using DirectCompute for GPGPU inference	8,617
purfview/whisper-standalone-win	Provides standalone executables for OpenAI's Whisper & Faster-Whisper speech recognition and transcription tools	1,405
mybigday/whisper.rn	A React Native binding of Whisper's automatic speech recognition model	408
ochen1/insanely-fast-whisper-cli	A command-line interface for fast and accurate automatic speech recognition using Whisper optimization	328
arthurfdlr/whisper-youtube	Transcribes Youtube videos using OpenAI's Whisper speech recognition model	369
flashlight/wav2letter	An open-source toolkit for automatic speech recognition using deep learning and end-to-end training.	6,398
macoron/whisper.unity	Provides a high-performance speech recognition system for Unity3D applications.	445
xenova/whisper-web	An open-source speech recognition system built using machine learning models and JavaScript.	2,651