whisper-diarization

ASR

Automates speaker diarization from audio recordings using OpenAI Whisper ASR and additional neural networks.

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

GitHub

4k stars

48 watching

349 forks

Language: Jupyter Notebook

last commit: 8 months ago

Linked from 1 awesome list

asrspeaker-diarizationspeechspeech-recognitionspeech-to-textwhisper

Backlinks from these awesome lists:

sindresorhus/awesome-whisper

Related projects:

Repository	Description	Stars
m-bain/whisperx	An automatic speech recognition system with word-level timestamps and speaker diarization.	12,894
openai/whisper	A general-purpose speech recognition system trained on large-scale weak supervision	72,752
ggerganov/whisper.cpp	A high-performance inference implementation of an automatic speech recognition model in C++	36,332
const-me/whisper	An implementation of OpenAI's Whisper ASR model using DirectCompute for GPGPU inference	8,617
pyannote/pyannote-audio	A toolkit for speaker diarization using PyTorch and speech activity detection.	6,508
arthurfdlr/whisper-youtube	Transcribes Youtube videos using OpenAI's Whisper speech recognition model	369
systran/faster-whisper	A fast speech-to-text implementation using CTranslate2 and optimized for inference on CPU and GPU.	12,989
huggingface/distil-whisper	A machine learning model that uses audio input to generate text transcriptions at high speeds and with good accuracy.	3,644
linto-ai/whisper-timestamped	An extension to the Whisper speech recognition model that adds word-level timestamps and confidence scores.	2,121
purfview/whisper-standalone-win	Provides standalone executables for OpenAI's Whisper & Faster-Whisper speech recognition and transcription tools	1,405
collabora/whisperlive	An implementation of Whisper's speech-to-text functionality in a real-time transcription application	2,186
srijith-rkr/kaust-whisper-adapter	A tool for fine-tuning the OpenAI Whisper speech recognition model using residual adapters and parameter-efficient learning methods.	32
sanchit-gandhi/whisper-jax	An optimized implementation of OpenAI's Whisper Model for speech recognition and speech-to-text tasks using JAX.	4,467
vaibhavs10/insanely-fast-whisper	A command-line tool for fast audio transcription using the Whisper AI model	7,848
mybigday/whisper.rn	A React Native binding of Whisper's automatic speech recognition model	408