SoniTranslate

Video Dubber

Software that allows video translation with synchronized audio, utilizing speech-to-text and text-to-speech technologies.

Synchronized Translation for Videos. Video dubbing

GitHub

924 stars
17 watching
171 forks
Language: Python
last commit: 4 months ago
asraudio-processingautomatic-dubbingdiarizationdocument-translatordubbingspeech-to-textsttsubtitle-to-speechtext-to-speechtranslate-audiotranslate-videotranslationttsvideo-dubbing

Related projects:

Repository Description Stars
rongjc/autosubtitle A tool that uses AI to generate subtitles and translate them from one language to another 19
showlab/vlog Transforms video content into a long document containing visual and audio information that can be used for chat or other applications. 545
tp7/sushi Automates subtitle syncing by comparing audio patterns to align subtitles with different video sources. 649
c2h2/tts Generates .mp3 voice files from input text using the Google Translate service as a speech engine. 93
m1guelpf/yt-whisper Automates transcription and subtitle generation from YouTube videos using OpenAI's Whisper model 1,373
tgotwig/vidmerger A tool that merges multiple video files into one file with chapters and optional FPS scaling. 127
ytsvetko/str2ipa A tool for phonetic transcription of languages with close-to-phonetic writing systems 10
bootphon/phonemizer Converts text to phonetic transcriptions in multiple languages using various backends and algorithms 1,249
lex4all/lex4all Software tool to generate pronunciation lexicons for low-resource languages using speech recognition and machine learning algorithms. 21
thudm/glm-4-voice An end-to-end voice model for conversational dialogue in English 2,467
mysteryx93/hanumaninstituteapps A suite of tools for audio and video processing, including pitch-shifting, batch conversion, and background playback. 144
rf5/transfusion-asr An ASR project that uses diffusion models to transcribe speech 76
pawurb/normit A Node package that translates text into other languages using speech synthesis and private APIs. 241
ronggong/jingjusingingphrasematching This repository provides a software framework to match singing audio with corresponding music scores based on phonetic and duration information. 27
pid/speakingurl Creates clean, user-friendly URLs from input strings by transliterating and manipulating characters. 1,116