whisper

Speech Recogniser

A general-purpose speech recognition system trained on large-scale weak supervision

Robust Speech Recognition via Large-Scale Weak Supervision

GitHub

73k stars
595 watching
9k forks
Language: Python
last commit: about 2 months ago
Linked from 3 awesome lists


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
ggerganov/whisper.cpp A high-performance inference implementation of an automatic speech recognition model in C++ 36,332
mahmoudashraf97/whisper-diarization Automates speaker diarization from audio recordings using OpenAI Whisper ASR and additional neural networks. 3,874
huggingface/distil-whisper A machine learning model that uses audio input to generate text transcriptions at high speeds and with good accuracy. 3,644
sanchit-gandhi/whisper-jax An optimized implementation of OpenAI's Whisper Model for speech recognition and speech-to-text tasks using JAX. 4,467
m-bain/whisperx An automatic speech recognition system with word-level timestamps and speaker diarization. 12,894
const-me/whisper An implementation of OpenAI's Whisper ASR model using DirectCompute for GPGPU inference 8,617
systran/faster-whisper A fast speech-to-text implementation using CTranslate2 and optimized for inference on CPU and GPU. 12,989
purfview/whisper-standalone-win Provides standalone executables for OpenAI's Whisper & Faster-Whisper speech recognition and transcription tools 1,405
softcatala/whisper-ctranslate2 An AI-powered speech recognition and translation tool that utilizes CTranslate2 and Faster-whisper implementations for faster and more efficient processing. 938
mybigday/whisper.rn A React Native binding of Whisper's automatic speech recognition model 408
arthurfdlr/whisper-youtube Transcribes Youtube videos using OpenAI's Whisper speech recognition model 369
illyism/openai-whisper-api An OpenAI speech-to-text API service built with Node.js and Typescript, running on Docker and Google Cloud Run. 110
srijith-rkr/kaust-whisper-adapter A tool for fine-tuning the OpenAI Whisper speech recognition model using residual adapters and parameter-efficient learning methods. 32
openai/tiktoken A fast and efficient tokeniser for natural language models based on Byte Pair Encoding (BPE) 12,703
xenova/whisper-web An open-source speech recognition system built using machine learning models and JavaScript. 2,651