KAUST-Whisper-Adapter

Speech Recognizer Adapter

A tool for fine-tuning the OpenAI Whisper speech recognition model using residual adapters and parameter-efficient learning methods.

INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!

GitHub

32 stars
4 watching
2 forks
Language: Python
last commit: over 1 year ago
automatic-speech-recognitionparameter-efficient-learning

Related projects:

Repository Description Stars
mybigday/whisper.rn A React Native binding of Whisper's automatic speech recognition model 408
chengsokdara/use-whisper A React hook that enables real-time speech-to-text functionality using the OpenAI Whisper API 738
purfview/whisper-standalone-win Provides standalone executables for OpenAI's Whisper & Faster-Whisper speech recognition and transcription tools 1,405
ggerganov/whisper.spm A Swift package for C implementation of a speech recognition system 169
softcatala/whisper-ctranslate2 An AI-powered speech recognition and translation tool that utilizes CTranslate2 and Faster-whisper implementations for faster and more efficient processing. 938
shashikg/whispers2t An optimized speech-to-text pipeline designed to improve inference speed and accuracy 330
arthurfdlr/whisper-youtube Transcribes Youtube videos using OpenAI's Whisper speech recognition model 369
arjo129/uspeech A toolkit for speech recognition on Arduino using C++ 473
bnosac/audio.whisper Provides an R interface to the Whisper Automatic Speech Recognition model 119
linto-ai/whisper-timestamped An extension to the Whisper speech recognition model that adds word-level timestamps and confidence scores. 2,121
shi-labs/vcoder An adapter for improving large language models at object-level perception tasks with auxiliary perception modalities 266
collabora/whisperlive An implementation of Whisper's speech-to-text functionality in a real-time transcription application 2,186
sharrnah/whispering An open-source tool for real-time audio and image transcription with support for multiple languages and various applications 404
seannaren/deepspeech.torch A speech recognition system based on the DeepSpeech2 architecture 259
opensource-spraakherkenning-nl/kaldi_nl This project provides scripts and tools for speech recognition in Dutch using the Kaldi toolkit 66