KAUST-Whisper-Adapter

Speech Recognizer Adapter

A tool for fine-tuning the OpenAI Whisper speech recognition model using residual adapters and parameter-efficient learning methods.

INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!

GitHub

32 stars

4 watching

2 forks

Language: Python

last commit: almost 2 years ago

automatic-speech-recognitionparameter-efficient-learning

Related projects:

Repository	Description	Stars
mybigday/whisper.rn	A React Native binding of Whisper's automatic speech recognition model	408
chengsokdara/use-whisper	A React hook that enables real-time speech-to-text functionality using the OpenAI Whisper API	738
purfview/whisper-standalone-win	Provides standalone executables for OpenAI's Whisper & Faster-Whisper speech recognition and transcription tools	1,405
ggerganov/whisper.spm	A Swift package for C implementation of a speech recognition system	169
softcatala/whisper-ctranslate2	An AI-powered speech recognition and translation tool that utilizes CTranslate2 and Faster-whisper implementations for faster and more efficient processing.	938
shashikg/whispers2t	An optimized speech-to-text pipeline designed to improve inference speed and accuracy	330
arthurfdlr/whisper-youtube	Transcribes Youtube videos using OpenAI's Whisper speech recognition model	369
arjo129/uspeech	A toolkit for speech recognition on Arduino using C++	473
bnosac/audio.whisper	Provides an R interface to the Whisper Automatic Speech Recognition model	119
linto-ai/whisper-timestamped	An extension to the Whisper speech recognition model that adds word-level timestamps and confidence scores.	2,121
shi-labs/vcoder	An adapter for improving large language models at object-level perception tasks with auxiliary perception modalities	266
collabora/whisperlive	An implementation of Whisper's speech-to-text functionality in a real-time transcription application	2,186
sharrnah/whispering	An open-source tool for real-time audio and image transcription with support for multiple languages and various applications	404
seannaren/deepspeech.torch	A speech recognition system based on the DeepSpeech2 architecture	259
opensource-spraakherkenning-nl/kaldi_nl	This project provides scripts and tools for speech recognition in Dutch using the Kaldi toolkit	66