whisper-diarization
ASR
Automates speaker diarization from audio recordings using OpenAI Whisper ASR and additional neural networks.
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
4k stars
49 watching
326 forks
Language: Jupyter Notebook
last commit: 9 days ago
Linked from 1 awesome list
asrspeaker-diarizationspeechspeech-recognitionspeech-to-textwhisper
Related projects:
Repository | Description | Stars |
---|---|---|
m-bain/whisperx | An automatic speech recognition system with word-level timestamps and speaker diarization. | 12,489 |
openai/whisper | A general-purpose speech recognition system trained on large-scale weak supervision | 71,660 |
ggerganov/whisper.cpp | A high-performance implementation of the OpenAI Whisper ASR model in C++ | 35,706 |
const-me/whisper | An implementation of OpenAI's Whisper ASR model using DirectCompute for GPGPU inference | 8,460 |
pyannote/pyannote-audio | A toolkit for speaker diarization using PyTorch and speech activity detection. | 6,333 |
arthurfdlr/whisper-youtube | Transcribes Youtube videos using OpenAI's Whisper speech recognition model | 364 |
systran/faster-whisper | A fast speech recognition system built on top of the CTranslate2 transformer model | 12,506 |
huggingface/distil-whisper | A machine learning model that uses audio input to generate text transcriptions at high speeds and with good accuracy. | 3,621 |
linto-ai/whisper-timestamped | An extension of the Whisper model to predict word timestamps and confidence scores with improved accuracy | 2,045 |
purfview/whisper-standalone-win | Executable standalone versions of Whisper and Faster-Whisper speech recognition tools | 1,326 |
collabora/whisperlive | An implementation of Whisper's speech-to-text functionality in a real-time transcription application | 2,050 |
srijith-rkr/kaust-whisper-adapter | A tool for fine-tuning the OpenAI Whisper speech recognition model using residual adapters and parameter-efficient learning methods. | 32 |
sanchit-gandhi/whisper-jax | An optimized implementation of OpenAI's Whisper Model for speech recognition and speech-to-text tasks using JAX. | 4,444 |
vaibhavs10/insanely-fast-whisper | A command-line tool for fast audio transcription using the Whisper AI model | 7,731 |
mybigday/whisper.rn | A React Native binding of Whisper's automatic speech recognition model | 395 |