speech2text
Audio converter
Converts audio files to text using the Google Speech API
Using Google Speech to Text API Provide a Simple Interface to Convert Audio Files
363 stars
35 watching
86 forks
Language: Ruby
last commit: over 3 years ago Related projects:
Repository | Description | Stars |
---|---|---|
c2h2/tts | Generates .mp3 voice files from input text using the Google Translate service as a speech engine. | 93 |
ibm/max-speech-to-text-converter | Converts spoken words into text form using speech recognition technology | 76 |
adhearsion/att_speech | A Ruby library for interacting with the AT&T Speech API to convert speech and text into formats for use in applications. | 20 |
nfroidure/ttf2woff2 | Converts TTF font files to the WOFF2 format. | 302 |
lamm-mit/pdf2audio | Converts PDF files to audio content using OpenAI's GPT models and text-to-speech conversion. | 1,082 |
gen2brain/malgo | Provides a set of audio APIs for Go programming language | 301 |
aofdev/vue-speech-streaming | A Vue2 project providing streaming speech recognition with Google Cloud Speech API | 73 |
audiamus/aaxaudioconverter | Converts Audible proprietary .aax files to plain .mp3 or .m4a/.m4b files with various processing options and meta-tag preservation. | 1,522 |
knowsuchagency/pdf-to-podcast | Converts PDF documents to audio podcast episodes using AI-powered dialogue generation and text-to-speech models | 594 |
algolia/voice-overlay-ios | An iOS library that converts spoken words into text using speech recognition and provides a customizable UI for user input. | 545 |
laoshu133/sfnt2woff | A tool that converts font files from OTF/TFI to WOFF format | 7 |
sophiajt/audioscope-ng2 | An Angular 2 + TypeScript demo of an audio player | 33 |
sajattack/uf2conv-rs | Converts binary files to Microsoft's UF2 format used in embedded systems | 28 |
chrisguttandin/standardized-audio-context | A cross-browser wrapper for the Web Audio API aiming to closely follow the standard. | 680 |
watson-developer-cloud/speech-to-text-nodejs | An application that converts speech to text using IBM's speech recognition service | 1,106 |