whisper-diarization

ASR

Automates speaker diarization from audio recordings using OpenAI Whisper ASR and additional neural networks.

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

GitHub

4k stars
48 watching
349 forks
Language: Jupyter Notebook
last commit: 2 days ago
Linked from 1 awesome list

asrspeaker-diarizationspeechspeech-recognitionspeech-to-textwhisper

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
m-bain/whisperx An automatic speech recognition system with word-level timestamps and speaker diarization. 12,894
openai/whisper A general-purpose speech recognition system trained on large-scale weak supervision 72,752
ggerganov/whisper.cpp A high-performance inference implementation of an automatic speech recognition model in C++ 36,332
const-me/whisper An implementation of OpenAI's Whisper ASR model using DirectCompute for GPGPU inference 8,617
pyannote/pyannote-audio A toolkit for speaker diarization using PyTorch and speech activity detection. 6,508
arthurfdlr/whisper-youtube Transcribes Youtube videos using OpenAI's Whisper speech recognition model 369
systran/faster-whisper A fast speech-to-text implementation using CTranslate2 and optimized for inference on CPU and GPU. 12,989
huggingface/distil-whisper A machine learning model that uses audio input to generate text transcriptions at high speeds and with good accuracy. 3,644
linto-ai/whisper-timestamped An extension to the Whisper speech recognition model that adds word-level timestamps and confidence scores. 2,121
purfview/whisper-standalone-win Provides standalone executables for OpenAI's Whisper & Faster-Whisper speech recognition and transcription tools 1,405
collabora/whisperlive An implementation of Whisper's speech-to-text functionality in a real-time transcription application 2,186
srijith-rkr/kaust-whisper-adapter A tool for fine-tuning the OpenAI Whisper speech recognition model using residual adapters and parameter-efficient learning methods. 32
sanchit-gandhi/whisper-jax An optimized implementation of OpenAI's Whisper Model for speech recognition and speech-to-text tasks using JAX. 4,460
vaibhavs10/insanely-fast-whisper A command-line tool for fast audio transcription using the Whisper AI model 7,848
mybigday/whisper.rn A React Native binding of Whisper's automatic speech recognition model 408