whisper-diarization

ASR

Automates speaker diarization from audio recordings using OpenAI Whisper ASR and additional neural networks.

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

GitHub

4k stars
49 watching
326 forks
Language: Jupyter Notebook
last commit: 9 days ago
Linked from 1 awesome list

asrspeaker-diarizationspeechspeech-recognitionspeech-to-textwhisper

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
m-bain/whisperx An automatic speech recognition system with word-level timestamps and speaker diarization. 12,489
openai/whisper A general-purpose speech recognition system trained on large-scale weak supervision 71,660
ggerganov/whisper.cpp A high-performance implementation of the OpenAI Whisper ASR model in C++ 35,706
const-me/whisper An implementation of OpenAI's Whisper ASR model using DirectCompute for GPGPU inference 8,460
pyannote/pyannote-audio A toolkit for speaker diarization using PyTorch and speech activity detection. 6,333
arthurfdlr/whisper-youtube Transcribes Youtube videos using OpenAI's Whisper speech recognition model 364
systran/faster-whisper A fast speech recognition system built on top of the CTranslate2 transformer model 12,506
huggingface/distil-whisper A machine learning model that uses audio input to generate text transcriptions at high speeds and with good accuracy. 3,621
linto-ai/whisper-timestamped An extension of the Whisper model to predict word timestamps and confidence scores with improved accuracy 2,045
purfview/whisper-standalone-win Executable standalone versions of Whisper and Faster-Whisper speech recognition tools 1,326
collabora/whisperlive An implementation of Whisper's speech-to-text functionality in a real-time transcription application 2,050
srijith-rkr/kaust-whisper-adapter A tool for fine-tuning the OpenAI Whisper speech recognition model using residual adapters and parameter-efficient learning methods. 32
sanchit-gandhi/whisper-jax An optimized implementation of OpenAI's Whisper Model for speech recognition and speech-to-text tasks using JAX. 4,444
vaibhavs10/insanely-fast-whisper A command-line tool for fast audio transcription using the Whisper AI model 7,731
mybigday/whisper.rn A React Native binding of Whisper's automatic speech recognition model 395