aeneas
Alignment tool
Automatically synchronizes text and audio to create a synchronized alignment
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
3k stars
72 watching
233 forks
Language: Python
last commit: 6 months ago
Linked from 1 awesome list
alignmentaudioclidtwespeakespeak-ngfestivalffmpegforced-alignmentlinuxmacosnlppythonsmilspeechsrttexttext-to-speechttswindows
Related projects:
Repository | Description | Stars |
---|---|---|
aubio/aubio | A comprehensive library for audio and music analysis and processing. | 3,336 |
pettarin/forced-alignment-tools | A collection of tools and resources for computing forced alignments between audio files and transcripts. | 878 |
pytorch/audio | A PyTorch module providing tools and functions for audio signal processing | 2,561 |
montrealcorpustools/montreal-forced-aligner | A command-line utility for aligning audio data with written text based on pronunciation rules. | 1,364 |
machinalis/yalign | Automates the process of extracting parallel sentences from comparable corpora to aid in statistical machine translation | 127 |
tp7/sushi | Automates subtitle syncing by comparing audio patterns to align subtitles with different video sources. | 649 |
eleutherai/pythia | Analyzing knowledge development and evolution in large language models during training | 2,309 |
laion-ai/clap | A library for learning audio embeddings from text and audio data using contrastive language-audio pretraining | 1,457 |
smacke/ffsubsync | Automatically synchronizes subtitles with video | 6,879 |
aixander/realtime_pyaudio_fft | An audio analysis tool that extracts and visualizes features from live audio streams using FFTs. | 976 |
iver56/audiomentations | Library for audio data augmentation used in machine learning | 1,903 |
eli64s/readme-ai | Automates the generation of comprehensive README files using AI-powered language models. | 1,665 |
prosodylab/prosodylab-aligner | A tool for aligning laboratory speech data by forcing audio signals to match speech contours. | 333 |
lowerquality/gentle | A tool for aligning speech with text by analyzing audio and providing an output transcript | 1,471 |
pyannote/pyannote-audio | A toolkit for speaker diarization using PyTorch and speech activity detection. | 6,508 |