KAUST-Whisper-Adapter
Speech Recognizer Adapter
A tool for fine-tuning the OpenAI Whisper speech recognition model using residual adapters and parameter-efficient learning methods.
INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!
32 stars
4 watching
2 forks
Language: Python
last commit: over 1 year ago automatic-speech-recognitionparameter-efficient-learning
Related projects:
Repository | Description | Stars |
---|---|---|
| A React Native binding of Whisper's automatic speech recognition model | 408 |
| A React hook that enables real-time speech-to-text functionality using the OpenAI Whisper API | 738 |
| Provides standalone executables for OpenAI's Whisper & Faster-Whisper speech recognition and transcription tools | 1,405 |
| A Swift package for C implementation of a speech recognition system | 169 |
| An AI-powered speech recognition and translation tool that utilizes CTranslate2 and Faster-whisper implementations for faster and more efficient processing. | 938 |
| An optimized speech-to-text pipeline designed to improve inference speed and accuracy | 330 |
| Transcribes Youtube videos using OpenAI's Whisper speech recognition model | 369 |
| A toolkit for speech recognition on Arduino using C++ | 473 |
| Provides an R interface to the Whisper Automatic Speech Recognition model | 119 |
| An extension to the Whisper speech recognition model that adds word-level timestamps and confidence scores. | 2,121 |
| An adapter for improving large language models at object-level perception tasks with auxiliary perception modalities | 266 |
| An implementation of Whisper's speech-to-text functionality in a real-time transcription application | 2,186 |
| An open-source tool for real-time audio and image transcription with support for multiple languages and various applications | 404 |
| A speech recognition system based on the DeepSpeech2 architecture | 259 |
| This project provides scripts and tools for speech recognition in Dutch using the Kaldi toolkit | 66 |