whisper-diarization
ASR
Automates speaker diarization from audio recordings using OpenAI Whisper ASR and additional neural networks.
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
4k stars
48 watching
349 forks
Language: Jupyter Notebook
last commit: 2 days ago
Linked from 1 awesome list
asrspeaker-diarizationspeechspeech-recognitionspeech-to-textwhisper
Related projects:
Repository | Description | Stars |
---|---|---|
m-bain/whisperx | An automatic speech recognition system with word-level timestamps and speaker diarization. | 12,894 |
openai/whisper | A general-purpose speech recognition system trained on large-scale weak supervision | 72,752 |
ggerganov/whisper.cpp | A high-performance inference implementation of an automatic speech recognition model in C++ | 36,332 |
const-me/whisper | An implementation of OpenAI's Whisper ASR model using DirectCompute for GPGPU inference | 8,617 |
pyannote/pyannote-audio | A toolkit for speaker diarization using PyTorch and speech activity detection. | 6,508 |
arthurfdlr/whisper-youtube | Transcribes Youtube videos using OpenAI's Whisper speech recognition model | 369 |
systran/faster-whisper | A fast speech-to-text implementation using CTranslate2 and optimized for inference on CPU and GPU. | 12,989 |
huggingface/distil-whisper | A machine learning model that uses audio input to generate text transcriptions at high speeds and with good accuracy. | 3,644 |
linto-ai/whisper-timestamped | An extension to the Whisper speech recognition model that adds word-level timestamps and confidence scores. | 2,121 |
purfview/whisper-standalone-win | Provides standalone executables for OpenAI's Whisper & Faster-Whisper speech recognition and transcription tools | 1,405 |
collabora/whisperlive | An implementation of Whisper's speech-to-text functionality in a real-time transcription application | 2,186 |
srijith-rkr/kaust-whisper-adapter | A tool for fine-tuning the OpenAI Whisper speech recognition model using residual adapters and parameter-efficient learning methods. | 32 |
sanchit-gandhi/whisper-jax | An optimized implementation of OpenAI's Whisper Model for speech recognition and speech-to-text tasks using JAX. | 4,460 |
vaibhavs10/insanely-fast-whisper | A command-line tool for fast audio transcription using the Whisper AI model | 7,848 |
mybigday/whisper.rn | A React Native binding of Whisper's automatic speech recognition model | 408 |