elpis

Speech Recognition Model Builder

A tool that enables language workers to build speech recognition models using multiple systems, including Kaldi and Huggingface Transformers.

🙊 software for creating speech recognition models.

GitHub

152 stars
15 watching
33 forks
Language: Python
last commit: 6 months ago
Linked from 1 awesome list

automatic-speech-recognitioncomputational-linguisticsdockerkaldilinguisticspythontranscription

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
awni/speech A PyTorch implementation of end-to-end speech recognition models. 754
opensource-spraakherkenning-nl/kaldi_nl This project provides scripts and tools for speech recognition in Dutch using the Kaldi toolkit 66
dodohow1011/speechadvreprogram Developing low-resource speech command recognition systems using adversarial reprogramming and transfer learning 18
nvlabs/eagle Develops high-resolution multimodal LLMs by combining vision encoders and various input resolutions 539
minimaxir/automl-gs Automates machine learning model creation and optimization for complex datasets 1,853
deepgram/kur A system for quickly building and applying state-of-the-art deep learning models to new problems 817
lxtgh/omg-seg Develops an end-to-end model for multiple visual perception and reasoning tasks using a single encoder, decoder, and large language model. 1,300
arjo129/uspeech A toolkit for speech recognition on Arduino using C++ 473
dfki-nlp/gevalm Evaluates German transformer language models with syntactic agreement tests 7
thecodrr/vspeech Provides an interface to Mozilla's DeepSpeech TensorFlow-based Speech-to-Text library using V bindings. 50
egorsmkv/speech-recognition-uk A collection of speech recognition and synthesis models and tools for Ukrainian 342
jgreenemi/parris Automates the setup and training of machine learning algorithms on remote servers 316
ibm/max-speech-to-text-converter Converts spoken words into text form using speech recognition technology 76
keras-team/keras-hub Provides pre-trained models and building blocks for natural language processing, computer vision, audio, and multimodal tasks 797
alankbi/detecto A Python package for building and deploying computer vision models with PyTorch 613