WhisperS2T

ASR pipeline

An optimized speech-to-text pipeline designed to improve inference speed and accuracy

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

GitHub

310 stars
14 watching
31 forks
Language: Jupyter Notebook
last commit: 3 months ago
Linked from 1 awesome list

asrdeep-learningspeech-recognitionspeech-to-texttensorrttensorrt-llmvadvoice-activity-detectionwhisper

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
linto-ai/whisper-timestamped An extension of the Whisper model to predict word timestamps and confidence scores with improved accuracy 2,045
mybigday/whisper.rn A React Native binding of Whisper's automatic speech recognition model 395
bnosac/audio.whisper Provides an R interface to the Whisper Automatic Speech Recognition model 118
srijith-rkr/kaust-whisper-adapter A tool for fine-tuning the OpenAI Whisper speech recognition model using residual adapters and parameter-efficient learning methods. 32
ochen1/insanely-fast-whisper-cli A command-line interface for fast and accurate automatic speech recognition using Whisper optimization 322
rf5/transfusion-asr An ASR project that uses diffusion models to transcribe speech 75
chengsokdara/use-whisper A React hook that enables real-time speech-to-text functionality using the OpenAI Whisper API 733
macoron/whisper.unity Provides a high-performance speech recognition system for Unity3D applications. 433
purfview/whisper-standalone-win Executable standalone versions of Whisper and Faster-Whisper speech recognition tools 1,326
ggerganov/whisper.spm A Swift package for C implementation of a speech recognition system 169
arthurfdlr/whisper-youtube Transcribes Youtube videos using OpenAI's Whisper speech recognition model 364
sandrohanea/whisper.net An open-source speech-to-text library built on top of Whisper Models for cross-platform support. 582
zhuzilin/whisper-openvino Fork of Whisper ASR with OpenVINO backend for improved performance 161
collabora/whisperlive An implementation of Whisper's speech-to-text functionality in a real-time transcription application 2,050
sharrnah/whispering An open-source tool for real-time audio and image transcription with support for multiple languages and various applications 401