whisper-jax
Whisper model
An optimized implementation of OpenAI's Whisper Model for speech recognition and speech-to-text tasks using JAX.
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
4k stars
43 watching
384 forks
Language: Jupyter Notebook
last commit: 11 months ago
Linked from 1 awesome list
deep-learningjaxspeech-recognitionspeech-to-textwhisper
Related projects:
Repository | Description | Stars |
---|---|---|
| A high-performance inference implementation of an automatic speech recognition model in C++ | 36,332 |
| A fast speech-to-text implementation using CTranslate2 and optimized for inference on CPU and GPU. | 12,989 |
| A general-purpose speech recognition system trained on large-scale weak supervision | 72,752 |
| An implementation of OpenAI's Whisper ASR model using DirectCompute for GPGPU inference | 8,617 |
| A machine learning model that uses audio input to generate text transcriptions at high speeds and with good accuracy. | 3,644 |
| A command-line tool for fast audio transcription using the Whisper AI model | 7,848 |
| An AI-powered speech recognition and translation tool that utilizes CTranslate2 and Faster-whisper implementations for faster and more efficient processing. | 938 |
| A library that provides high-performance numerical computing and machine learning capabilities. | 30,744 |
| Provides standalone executables for OpenAI's Whisper & Faster-Whisper speech recognition and transcription tools | 1,405 |
| Automates speaker diarization from audio recordings using OpenAI Whisper ASR and additional neural networks. | 3,874 |
| An automatic speech recognition system with word-level timestamps and speaker diarization. | 12,894 |
| An open-source speech recognition system built using machine learning models and JavaScript. | 2,651 |
| An implementation of a Mixture of Experts (MoE) model with large parameters and specialized features for natural language processing tasks. | 49,727 |
| Transcribes Youtube videos using OpenAI's Whisper speech recognition model | 369 |
| An OpenAI speech-to-text API service built with Node.js and Typescript, running on Docker and Google Cloud Run. | 110 |