SoniTranslate
Video Dubber
Software that allows video translation with synchronized audio, utilizing speech-to-text and text-to-speech technologies.
Synchronized Translation for Videos. Video dubbing
869 stars
17 watching
162 forks
Language: Python
last commit: about 1 month ago asraudio-processingautomatic-dubbingdiarizationdocument-translatordubbingspeech-to-textsttsubtitle-to-speechtext-to-speechtranslate-audiotranslate-videotranslationttsvideo-dubbing
Related projects:
Repository | Description | Stars |
---|---|---|
rongjc/autosubtitle | A tool that uses AI to generate subtitles and translate them from one language to another | 17 |
showlab/vlog | Transforms video content into a long document containing visual and audio information that can be used for chat or other applications. | 538 |
tp7/sushi | Automates subtitle syncing by comparing audio patterns to align subtitles with different video sources. | 646 |
c2h2/tts | Generates .mp3 voice files from input text using the Google Translate service as a speech engine. | 93 |
m1guelpf/yt-whisper | Automates transcription and subtitle generation from YouTube videos using OpenAI's Whisper model | 1,365 |
tgotwig/vidmerger | A tool that merges multiple video files into one file with chapters and optional FPS scaling. | 126 |
ytsvetko/str2ipa | A tool for phonetic transcription of languages with close-to-phonetic writing systems | 10 |
bootphon/phonemizer | Converts text to phonetic transcriptions in multiple languages using various backends and algorithms | 1,231 |
lex4all/lex4all | Software tool to generate pronunciation lexicons for low-resource languages using speech recognition and machine learning algorithms. | 21 |
thudm/glm-4-voice | An end-to-end speech synthesis model that generates human-like speech in real-time | 2,269 |
mysteryx93/hanumaninstituteapps | A suite of tools for audio and video processing, including pitch-shifting, batch conversion, and background playback. | 142 |
rf5/transfusion-asr | An ASR project that uses diffusion models to transcribe speech | 75 |
pawurb/normit | A Node package that translates text into other languages using speech synthesis and private APIs. | 240 |
ronggong/jingjusingingphrasematching | This repository provides a software framework to match singing audio with corresponding music scores based on phonetic and duration information. | 27 |
pid/speakingurl | Creates clean, user-friendly URLs from input strings by transliterating and manipulating characters. | 1,116 |