aeneas

Alignment tool

Automatically synchronizes text and audio to create a synchronized alignment

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

GitHub

3k stars

72 watching

236 forks

Language: Python

last commit: about 1 year ago

Linked from 1 awesome list

alignmentaudioclidtwespeakespeak-ngfestivalffmpegforced-alignmentlinuxmacosnlppythonsmilspeechsrttexttext-to-speechttswindows

www.readbeyond.it/aeneas/

Backlinks from these awesome lists:

soruly/awesome-acg

Related projects:

Repository	Description	Stars
aubio/aubio	A comprehensive library for audio and music analysis and processing.	3,336
pettarin/forced-alignment-tools	A collection of tools and resources for computing forced alignments between audio files and transcripts.	878
pytorch/audio	A PyTorch module providing tools and functions for audio signal processing	2,561
montrealcorpustools/montreal-forced-aligner	A command-line utility for aligning audio data with written text based on pronunciation rules.	1,364
machinalis/yalign	Automates the process of extracting parallel sentences from comparable corpora to aid in statistical machine translation	127
tp7/sushi	Automates subtitle syncing by comparing audio patterns to align subtitles with different video sources.	649
eleutherai/pythia	Analyzing knowledge development and evolution in large language models during training	2,309
laion-ai/clap	A library for learning audio embeddings from text and audio data using contrastive language-audio pretraining	1,457
smacke/ffsubsync	Automatically synchronizes subtitles with video	6,879
aixander/realtime_pyaudio_fft	An audio analysis tool that extracts and visualizes features from live audio streams using FFTs.	976
iver56/audiomentations	Library for audio data augmentation used in machine learning	1,903
eli64s/readme-ai	Automates the generation of comprehensive README files using AI-powered language models.	1,665
prosodylab/prosodylab-aligner	Tools for aligning laboratory speech production data to forced audio alignment using HTK and SoX.	333
lowerquality/gentle	A tool for aligning speech with text by analyzing audio and providing an output transcript	1,471
pyannote/pyannote-audio	A toolkit for speaker diarization using PyTorch and speech activity detection.	6,508