WhisperS2T
ASR pipeline
An optimized speech-to-text pipeline designed to improve inference speed and accuracy
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
310 stars
14 watching
31 forks
Language: Jupyter Notebook
last commit: 3 months ago
Linked from 1 awesome list
asrdeep-learningspeech-recognitionspeech-to-texttensorrttensorrt-llmvadvoice-activity-detectionwhisper
Related projects:
Repository | Description | Stars |
---|---|---|
linto-ai/whisper-timestamped | An extension of the Whisper model to predict word timestamps and confidence scores with improved accuracy | 2,045 |
mybigday/whisper.rn | A React Native binding of Whisper's automatic speech recognition model | 395 |
bnosac/audio.whisper | Provides an R interface to the Whisper Automatic Speech Recognition model | 118 |
srijith-rkr/kaust-whisper-adapter | A tool for fine-tuning the OpenAI Whisper speech recognition model using residual adapters and parameter-efficient learning methods. | 32 |
ochen1/insanely-fast-whisper-cli | A command-line interface for fast and accurate automatic speech recognition using Whisper optimization | 322 |
rf5/transfusion-asr | An ASR project that uses diffusion models to transcribe speech | 75 |
chengsokdara/use-whisper | A React hook that enables real-time speech-to-text functionality using the OpenAI Whisper API | 733 |
macoron/whisper.unity | Provides a high-performance speech recognition system for Unity3D applications. | 433 |
purfview/whisper-standalone-win | Executable standalone versions of Whisper and Faster-Whisper speech recognition tools | 1,326 |
ggerganov/whisper.spm | A Swift package for C implementation of a speech recognition system | 169 |
arthurfdlr/whisper-youtube | Transcribes Youtube videos using OpenAI's Whisper speech recognition model | 364 |
sandrohanea/whisper.net | An open-source speech-to-text library built on top of Whisper Models for cross-platform support. | 582 |
zhuzilin/whisper-openvino | Fork of Whisper ASR with OpenVINO backend for improved performance | 161 |
collabora/whisperlive | An implementation of Whisper's speech-to-text functionality in a real-time transcription application | 2,050 |
sharrnah/whispering | An open-source tool for real-time audio and image transcription with support for multiple languages and various applications | 401 |