aeneas

Alignment tool

Automatically synchronizes text and audio to create a synchronized alignment

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

GitHub

3k stars
72 watching
233 forks
Language: Python
last commit: 6 months ago
Linked from 1 awesome list

alignmentaudioclidtwespeakespeak-ngfestivalffmpegforced-alignmentlinuxmacosnlppythonsmilspeechsrttexttext-to-speechttswindows

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
aubio/aubio A comprehensive library for audio and music analysis and processing. 3,336
pettarin/forced-alignment-tools A collection of tools and resources for computing forced alignments between audio files and transcripts. 878
pytorch/audio A PyTorch module providing tools and functions for audio signal processing 2,561
montrealcorpustools/montreal-forced-aligner A command-line utility for aligning audio data with written text based on pronunciation rules. 1,364
machinalis/yalign Automates the process of extracting parallel sentences from comparable corpora to aid in statistical machine translation 127
tp7/sushi Automates subtitle syncing by comparing audio patterns to align subtitles with different video sources. 649
eleutherai/pythia Analyzing knowledge development and evolution in large language models during training 2,309
laion-ai/clap A library for learning audio embeddings from text and audio data using contrastive language-audio pretraining 1,457
smacke/ffsubsync Automatically synchronizes subtitles with video 6,879
aixander/realtime_pyaudio_fft An audio analysis tool that extracts and visualizes features from live audio streams using FFTs. 976
iver56/audiomentations Library for audio data augmentation used in machine learning 1,903
eli64s/readme-ai Automates the generation of comprehensive README files using AI-powered language models. 1,665
prosodylab/prosodylab-aligner A tool for aligning laboratory speech data by forcing audio signals to match speech contours. 333
lowerquality/gentle A tool for aligning speech with text by analyzing audio and providing an output transcript 1,471
pyannote/pyannote-audio A toolkit for speaker diarization using PyTorch and speech activity detection. 6,508