SoniTranslate

Video Dubber

Software that allows video translation with synchronized audio, utilizing speech-to-text and text-to-speech technologies.

Synchronized Translation for Videos. Video dubbing

GitHub

924 stars

17 watching

171 forks

Language: Python

last commit: 10 months ago

asraudio-processingautomatic-dubbingdiarizationdocument-translatordubbingspeech-to-textsttsubtitle-to-speechtext-to-speechtranslate-audiotranslate-videotranslationttsvideo-dubbing

Related projects:

Repository	Description	Stars
rongjc/autosubtitle	A tool that uses AI to generate subtitles and translate them from one language to another	19
showlab/vlog	Transforms video content into a long document containing visual and audio information that can be used for chat or other applications.	545
tp7/sushi	Automates subtitle syncing by comparing audio patterns to align subtitles with different video sources.	649
c2h2/tts	Generates .mp3 voice files from input text using the Google Translate service as a speech engine.	93
m1guelpf/yt-whisper	Automates transcription and subtitle generation from YouTube videos using OpenAI's Whisper model	1,373
tgotwig/vidmerger	A tool that merges multiple video files into one file with chapters and optional FPS scaling.	127
ytsvetko/str2ipa	A tool for phonetic transcription of languages with close-to-phonetic writing systems	10
bootphon/phonemizer	Converts text to phonetic transcriptions in multiple languages using various backends and algorithms	1,249
lex4all/lex4all	Software tool to generate pronunciation lexicons for low-resource languages using speech recognition and machine learning algorithms.	21
thudm/glm-4-voice	An end-to-end voice model for conversational dialogue in English	2,467
mysteryx93/hanumaninstituteapps	A suite of tools for audio and video processing, including pitch-shifting, batch conversion, and background playback.	144
rf5/transfusion-asr	An ASR project that uses diffusion models to transcribe speech	76
pawurb/normit	A Node package that translates text into other languages using speech synthesis and private APIs.	241
ronggong/jingjusingingphrasematching	This repository provides a software framework to match singing audio with corresponding music scores based on phonetic and duration information.	27
pid/speakingurl	Creates clean, user-friendly URLs from input strings by transliterating and manipulating characters.	1,116