whisper
Speech Recogniser
A general-purpose speech recognition system trained on large-scale weak supervision
Robust Speech Recognition via Large-Scale Weak Supervision
73k stars
595 watching
9k forks
Language: Python
last commit: about 2 months ago
Linked from 3 awesome lists
Related projects:
Repository | Description | Stars |
---|---|---|
ggerganov/whisper.cpp | A high-performance inference implementation of an automatic speech recognition model in C++ | 36,332 |
mahmoudashraf97/whisper-diarization | Automates speaker diarization from audio recordings using OpenAI Whisper ASR and additional neural networks. | 3,874 |
huggingface/distil-whisper | A machine learning model that uses audio input to generate text transcriptions at high speeds and with good accuracy. | 3,644 |
sanchit-gandhi/whisper-jax | An optimized implementation of OpenAI's Whisper Model for speech recognition and speech-to-text tasks using JAX. | 4,467 |
m-bain/whisperx | An automatic speech recognition system with word-level timestamps and speaker diarization. | 12,894 |
const-me/whisper | An implementation of OpenAI's Whisper ASR model using DirectCompute for GPGPU inference | 8,617 |
systran/faster-whisper | A fast speech-to-text implementation using CTranslate2 and optimized for inference on CPU and GPU. | 12,989 |
purfview/whisper-standalone-win | Provides standalone executables for OpenAI's Whisper & Faster-Whisper speech recognition and transcription tools | 1,405 |
softcatala/whisper-ctranslate2 | An AI-powered speech recognition and translation tool that utilizes CTranslate2 and Faster-whisper implementations for faster and more efficient processing. | 938 |
mybigday/whisper.rn | A React Native binding of Whisper's automatic speech recognition model | 408 |
arthurfdlr/whisper-youtube | Transcribes Youtube videos using OpenAI's Whisper speech recognition model | 369 |
illyism/openai-whisper-api | An OpenAI speech-to-text API service built with Node.js and Typescript, running on Docker and Google Cloud Run. | 110 |
srijith-rkr/kaust-whisper-adapter | A tool for fine-tuning the OpenAI Whisper speech recognition model using residual adapters and parameter-efficient learning methods. | 32 |
openai/tiktoken | A fast and efficient tokeniser for natural language models based on Byte Pair Encoding (BPE) | 12,703 |
xenova/whisper-web | An open-source speech recognition system built using machine learning models and JavaScript. | 2,651 |