SoniTranslate
Video Dubber
Software that allows video translation with synchronized audio, utilizing speech-to-text and text-to-speech technologies.
Synchronized Translation for Videos. Video dubbing
924 stars
17 watching
171 forks
Language: Python
last commit: about 1 year ago asraudio-processingautomatic-dubbingdiarizationdocument-translatordubbingspeech-to-textsttsubtitle-to-speechtext-to-speechtranslate-audiotranslate-videotranslationttsvideo-dubbing
Related projects:
| Repository | Description | Stars |
|---|---|---|
| | A tool that uses AI to generate subtitles and translate them from one language to another | 19 |
| | Transforms video content into a long document containing visual and audio information that can be used for chat or other applications. | 545 |
| | Automates subtitle syncing by comparing audio patterns to align subtitles with different video sources. | 649 |
| | Generates .mp3 voice files from input text using the Google Translate service as a speech engine. | 93 |
| | Automates transcription and subtitle generation from YouTube videos using OpenAI's Whisper model | 1,373 |
| | A tool that merges multiple video files into one file with chapters and optional FPS scaling. | 127 |
| | A tool for phonetic transcription of languages with close-to-phonetic writing systems | 10 |
| | Converts text to phonetic transcriptions in multiple languages using various backends and algorithms | 1,249 |
| | Software tool to generate pronunciation lexicons for low-resource languages using speech recognition and machine learning algorithms. | 21 |
| | An end-to-end voice model for conversational dialogue in English | 2,467 |
| | A suite of tools for audio and video processing, including pitch-shifting, batch conversion, and background playback. | 144 |
| | An ASR project that uses diffusion models to transcribe speech | 76 |
| | A Node package that translates text into other languages using speech synthesis and private APIs. | 241 |
| | This repository provides a software framework to match singing audio with corresponding music scores based on phonetic and duration information. | 27 |
| | Creates clean, user-friendly URLs from input strings by transliterating and manipulating characters. | 1,116 |