audio.whisper
ASR library
Provides an R interface to the Whisper Automatic Speech Recognition model
Transcribe audio files using the "Whisper" Automatic Speech Recognition model from R
119 stars
4 watching
13 forks
Language: C
last commit: 4 months ago Related projects:
Repository | Description | Stars |
---|---|---|
mybigday/whisper.rn | A React Native binding of Whisper's automatic speech recognition model | 408 |
linto-ai/whisper-timestamped | An extension to the Whisper speech recognition model that adds word-level timestamps and confidence scores. | 2,121 |
arthurfdlr/whisper-youtube | Transcribes Youtube videos using OpenAI's Whisper speech recognition model | 369 |
ggerganov/whisper.spm | A Swift package for C implementation of a speech recognition system | 169 |
purfview/whisper-standalone-win | Provides standalone executables for OpenAI's Whisper & Faster-Whisper speech recognition and transcription tools | 1,405 |
chengsokdara/use-whisper | A React hook that enables real-time speech-to-text functionality using the OpenAI Whisper API | 738 |
sandrohanea/whisper.net | A .NET implementation of OpenAI Whisper models for speech recognition and text-to-speech conversion. | 601 |
macoron/whisper.unity | Provides a high-performance speech recognition system for Unity3D applications. | 445 |
shashikg/whispers2t | An optimized speech-to-text pipeline designed to improve inference speed and accuracy | 330 |
ochen1/insanely-fast-whisper-cli | A command-line interface for fast and accurate automatic speech recognition using Whisper optimization | 328 |
srijith-rkr/kaust-whisper-adapter | A tool for fine-tuning the OpenAI Whisper speech recognition model using residual adapters and parameter-efficient learning methods. | 32 |
rf5/transfusion-asr | An ASR project that uses diffusion models to transcribe speech | 76 |
silentsignal/burp-asn1 | An ASN.1 toolbox for parsing and decoding ASN.1 data in Burp Suite | 2 |
yuangongnd/whisper-at | An audio processing model that adds audio event tagging capabilities to an existing speech recognition system with minimal additional computational cost. | 343 |
softcatala/whisper-ctranslate2 | An AI-powered speech recognition and translation tool that utilizes CTranslate2 and Faster-whisper implementations for faster and more efficient processing. | 938 |