SoniTranslate

Video Dubber

Software that allows video translation with synchronized audio, utilizing speech-to-text and text-to-speech technologies.

Synchronized Translation for Videos. Video dubbing

GitHub

869 stars
17 watching
162 forks
Language: Python
last commit: about 1 month ago
asraudio-processingautomatic-dubbingdiarizationdocument-translatordubbingspeech-to-textsttsubtitle-to-speechtext-to-speechtranslate-audiotranslate-videotranslationttsvideo-dubbing

Related projects:

Repository Description Stars
rongjc/autosubtitle A tool that uses AI to generate subtitles and translate them from one language to another 17
showlab/vlog Transforms video content into a long document containing visual and audio information that can be used for chat or other applications. 538
tp7/sushi Automates subtitle syncing by comparing audio patterns to align subtitles with different video sources. 646
c2h2/tts Generates .mp3 voice files from input text using the Google Translate service as a speech engine. 93
m1guelpf/yt-whisper Automates transcription and subtitle generation from YouTube videos using OpenAI's Whisper model 1,365
tgotwig/vidmerger A tool that merges multiple video files into one file with chapters and optional FPS scaling. 126
ytsvetko/str2ipa A tool for phonetic transcription of languages with close-to-phonetic writing systems 10
bootphon/phonemizer Converts text to phonetic transcriptions in multiple languages using various backends and algorithms 1,231
lex4all/lex4all Software tool to generate pronunciation lexicons for low-resource languages using speech recognition and machine learning algorithms. 21
thudm/glm-4-voice An end-to-end speech synthesis model that generates human-like speech in real-time 2,269
mysteryx93/hanumaninstituteapps A suite of tools for audio and video processing, including pitch-shifting, batch conversion, and background playback. 142
rf5/transfusion-asr An ASR project that uses diffusion models to transcribe speech 75
pawurb/normit A Node package that translates text into other languages using speech synthesis and private APIs. 240
ronggong/jingjusingingphrasematching This repository provides a software framework to match singing audio with corresponding music scores based on phonetic and duration information. 27
pid/speakingurl Creates clean, user-friendly URLs from input strings by transliterating and manipulating characters. 1,116