whisperX
ASR system
An automatic speech recognition system with word-level timestamps and speaker diarization.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
13k stars
137 watching
1k forks
Language: Python
last commit: 2 months ago
Linked from 1 awesome list
asrspeechspeech-recognitionspeech-to-textwhisper
Related projects:
Repository | Description | Stars |
---|---|---|
| Automates speaker diarization from audio recordings using OpenAI Whisper ASR and additional neural networks. | 3,874 |
| A machine learning model that uses audio input to generate text transcriptions at high speeds and with good accuracy. | 3,644 |
| A high-performance inference implementation of an automatic speech recognition model in C++ | 36,332 |
| A general-purpose speech recognition system trained on large-scale weak supervision | 72,752 |
| A command-line tool for fast audio transcription using the Whisper AI model | 7,848 |
| A fast speech-to-text implementation using CTranslate2 and optimized for inference on CPU and GPU. | 12,989 |
| An extension to the Whisper speech recognition model that adds word-level timestamps and confidence scores. | 2,121 |
| An implementation of OpenAI's Whisper ASR model using DirectCompute for GPGPU inference | 8,617 |
| Provides standalone executables for OpenAI's Whisper & Faster-Whisper speech recognition and transcription tools | 1,405 |
| A React Native binding of Whisper's automatic speech recognition model | 408 |
| A command-line interface for fast and accurate automatic speech recognition using Whisper optimization | 328 |
| Transcribes Youtube videos using OpenAI's Whisper speech recognition model | 369 |
| An open-source toolkit for automatic speech recognition using deep learning and end-to-end training. | 6,398 |
| Provides a high-performance speech recognition system for Unity3D applications. | 445 |
| An open-source speech recognition system built using machine learning models and JavaScript. | 2,651 |