Prosodylab-Aligner

Speech alignment tool

Tools for aligning laboratory speech production data to forced audio alignment using HTK and SoX.

Python interface for forced audio alignment using HTK and SoX

GitHub

333 stars
27 watching
77 forks
Language: Python
last commit: over 4 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
prosodylab/prosodylab.alignertools A package of scripts to prepare data for alignment in speech processing 12
lowerquality/gentle A tool for aligning speech with text by analyzing audio and providing an output transcript 1,471
cmesher/inuktitutalignerdata Scripts for aligning laboratory speech production data in Inuktitut 3
pettarin/forced-alignment-tools A collection of tools and resources for computing forced alignments between audio files and transcripts. 878
montrealcorpustools/montreal-forced-aligner A command-line utility for aligning audio data with written text based on pronunciation rules. 1,364
machinalis/yalign Automates the process of extracting parallel sentences from comparable corpora to aid in statistical machine translation 127
pku-alignment/align-anything Aligns large multimodal models with human intentions and values using various algorithms and fine-tuning methods. 270
andrewrosenberg/autobi Automated annotation tool for prosody analysis in speech recordings. 58
tp7/sushi Automates subtitle syncing by comparing audio patterns to align subtitles with different video sources. 649
pku-yuangroup/languagebind Extending pretraining models to handle multiple modalities by aligning language and video representations 751
spatialaudio/python-sounddevice Tools and bindings for playing and recording audio with Python 1,069
clab/fast_align A fast and simple unsupervised word aligner for generating parallel corpus alignments. 740
jcgood/rosetta-pangloss A Python library that uses machine learning and natural language processing to improve translation accuracy by aligning source and target languages 0
guitarbum722/align An application and library for aligning text with flexible formatting options. 84
astorfi/speechpy Provides tools and libraries for extracting speech features from audio data. 881