KAUST-Whisper-Adapter

Speech Recognizer Adapter

A tool for fine-tuning the OpenAI Whisper speech recognition model using residual adapters and parameter-efficient learning methods.

INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!

GitHub

32 stars
4 watching
2 forks
Language: Python
last commit: about 1 year ago
automatic-speech-recognitionparameter-efficient-learning

Related projects:

Repository Description Stars
mybigday/whisper.rn A React Native binding of Whisper's automatic speech recognition model 395
chengsokdara/use-whisper A React hook that enables real-time speech-to-text functionality using the OpenAI Whisper API 733
purfview/whisper-standalone-win Executable standalone versions of Whisper and Faster-Whisper speech recognition tools 1,326
ggerganov/whisper.spm A Swift package for C implementation of a speech recognition system 169
softcatala/whisper-ctranslate2 A Whisper client compatible with the CTranslate2 model, providing faster transcription and translation capabilities than OpenAI Whisper 914
shashikg/whispers2t An optimized speech-to-text pipeline designed to improve inference speed and accuracy 310
arthurfdlr/whisper-youtube Transcribes Youtube videos using OpenAI's Whisper speech recognition model 362
arjo129/uspeech A toolkit for speech recognition on Arduino using C++ 473
bnosac/audio.whisper Provides an R interface to the Whisper Automatic Speech Recognition model 118
linto-ai/whisper-timestamped An extension of the Whisper model to predict word timestamps and confidence scores with improved accuracy 2,045
shi-labs/vcoder An adapter for improving large language models at object-level perception tasks with auxiliary perception modalities 261
collabora/whisperlive An implementation of Whisper's speech-to-text functionality in a real-time transcription application 2,050
sharrnah/whispering An open-source tool for real-time audio and image transcription with support for multiple languages and various applications 401
seannaren/deepspeech.torch A speech recognition system based on the DeepSpeech2 architecture 259
opensource-spraakherkenning-nl/kaldi_nl This project provides scripts and tools for speech recognition in Dutch using the Kaldi toolkit 66