audio.whisper

ASR library

Provides an R interface to the Whisper Automatic Speech Recognition model

Transcribe audio files using the "Whisper" Automatic Speech Recognition model from R

GitHub

118 stars
4 watching
13 forks
Language: C
last commit: about 1 month ago

Related projects:

Repository Description Stars
mybigday/whisper.rn A React Native binding of Whisper's automatic speech recognition model 395
linto-ai/whisper-timestamped An extension of the Whisper model to predict word timestamps and confidence scores with improved accuracy 2,045
arthurfdlr/whisper-youtube Transcribes Youtube videos using OpenAI's Whisper speech recognition model 362
ggerganov/whisper.spm A Swift package for C implementation of a speech recognition system 169
purfview/whisper-standalone-win Executable standalone versions of Whisper and Faster-Whisper speech recognition tools 1,326
chengsokdara/use-whisper A React hook that enables real-time speech-to-text functionality using the OpenAI Whisper API 733
sandrohanea/whisper.net An open-source speech-to-text library built on top of Whisper Models for cross-platform support. 582
macoron/whisper.unity Provides a high-performance speech recognition system for Unity3D applications. 433
shashikg/whispers2t An optimized speech-to-text pipeline designed to improve inference speed and accuracy 310
ochen1/insanely-fast-whisper-cli A command-line interface for fast and accurate automatic speech recognition using Whisper optimization 322
srijith-rkr/kaust-whisper-adapter A tool for fine-tuning the OpenAI Whisper speech recognition model using residual adapters and parameter-efficient learning methods. 32
rf5/transfusion-asr An ASR project that uses diffusion models to transcribe speech 75
silentsignal/burp-asn1 An ASN.1 toolbox for parsing and decoding ASN.1 data in Burp Suite 2
yuangongnd/whisper-at An audio processing model that adds audio event tagging capabilities to an existing speech recognition system with minimal additional computational cost. 321
softcatala/whisper-ctranslate2 A Whisper client compatible with the CTranslate2 model, providing faster transcription and translation capabilities than OpenAI Whisper 914