whisper-jax

Whisper model

An optimized implementation of OpenAI's Whisper Model for speech recognition and speech-to-text tasks using JAX.

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

GitHub

4k stars

43 watching

384 forks

Language: Jupyter Notebook

last commit: over 1 year ago

Linked from 1 awesome list

deep-learningjaxspeech-recognitionspeech-to-textwhisper

Backlinks from these awesome lists:

sindresorhus/awesome-whisper

Related projects:

Repository	Description	Stars
ggerganov/whisper.cpp	A high-performance inference implementation of an automatic speech recognition model in C++	36,332
systran/faster-whisper	A fast speech-to-text implementation using CTranslate2 and optimized for inference on CPU and GPU.	12,989
openai/whisper	A general-purpose speech recognition system trained on large-scale weak supervision	72,752
const-me/whisper	An implementation of OpenAI's Whisper ASR model using DirectCompute for GPGPU inference	8,617
huggingface/distil-whisper	A machine learning model that uses audio input to generate text transcriptions at high speeds and with good accuracy.	3,644
vaibhavs10/insanely-fast-whisper	A command-line tool for fast audio transcription using the Whisper AI model	7,848
softcatala/whisper-ctranslate2	An AI-powered speech recognition and translation tool that utilizes CTranslate2 and Faster-whisper implementations for faster and more efficient processing.	938
jax-ml/jax	A library that provides high-performance numerical computing and machine learning capabilities.	30,744
purfview/whisper-standalone-win	Provides standalone executables for OpenAI's Whisper & Faster-Whisper speech recognition and transcription tools	1,405
mahmoudashraf97/whisper-diarization	Automates speaker diarization from audio recordings using OpenAI Whisper ASR and additional neural networks.	3,874
m-bain/whisperx	An automatic speech recognition system with word-level timestamps and speaker diarization.	12,894
xenova/whisper-web	An open-source speech recognition system built using machine learning models and JavaScript.	2,651
xai-org/grok-1	An implementation of a Mixture of Experts (MoE) model with large parameters and specialized features for natural language processing tasks.	49,727
arthurfdlr/whisper-youtube	Transcribes Youtube videos using OpenAI's Whisper speech recognition model	369
illyism/openai-whisper-api	An OpenAI speech-to-text API service built with Node.js and Typescript, running on Docker and Google Cloud Run.	110