KAUST-Whisper-Adapter
Speech Recognizer Adapter
A tool for fine-tuning the OpenAI Whisper speech recognition model using residual adapters and parameter-efficient learning methods.
INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!
32 stars
4 watching
2 forks
Language: Python
last commit: about 1 year ago automatic-speech-recognitionparameter-efficient-learning
Related projects:
Repository | Description | Stars |
---|---|---|
mybigday/whisper.rn | A React Native binding of Whisper's automatic speech recognition model | 395 |
chengsokdara/use-whisper | A React hook that enables real-time speech-to-text functionality using the OpenAI Whisper API | 733 |
purfview/whisper-standalone-win | Executable standalone versions of Whisper and Faster-Whisper speech recognition tools | 1,326 |
ggerganov/whisper.spm | A Swift package for C implementation of a speech recognition system | 169 |
softcatala/whisper-ctranslate2 | A Whisper client compatible with the CTranslate2 model, providing faster transcription and translation capabilities than OpenAI Whisper | 914 |
shashikg/whispers2t | An optimized speech-to-text pipeline designed to improve inference speed and accuracy | 310 |
arthurfdlr/whisper-youtube | Transcribes Youtube videos using OpenAI's Whisper speech recognition model | 362 |
arjo129/uspeech | A toolkit for speech recognition on Arduino using C++ | 473 |
bnosac/audio.whisper | Provides an R interface to the Whisper Automatic Speech Recognition model | 118 |
linto-ai/whisper-timestamped | An extension of the Whisper model to predict word timestamps and confidence scores with improved accuracy | 2,045 |
shi-labs/vcoder | An adapter for improving large language models at object-level perception tasks with auxiliary perception modalities | 261 |
collabora/whisperlive | An implementation of Whisper's speech-to-text functionality in a real-time transcription application | 2,050 |
sharrnah/whispering | An open-source tool for real-time audio and image transcription with support for multiple languages and various applications | 401 |
seannaren/deepspeech.torch | A speech recognition system based on the DeepSpeech2 architecture | 259 |
opensource-spraakherkenning-nl/kaldi_nl | This project provides scripts and tools for speech recognition in Dutch using the Kaldi toolkit | 66 |