Prosodylab-Aligner
Speech alignment tool
Tools for aligning laboratory speech production data to forced audio alignment using HTK and SoX.
Python interface for forced audio alignment using HTK and SoX
333 stars
27 watching
77 forks
Language: Python
last commit: over 4 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
prosodylab/prosodylab.alignertools | A package of scripts to prepare data for alignment in speech processing | 12 |
lowerquality/gentle | A tool for aligning speech with text by analyzing audio and providing an output transcript | 1,471 |
cmesher/inuktitutalignerdata | Scripts for aligning laboratory speech production data in Inuktitut | 3 |
pettarin/forced-alignment-tools | A collection of tools and resources for computing forced alignments between audio files and transcripts. | 878 |
montrealcorpustools/montreal-forced-aligner | A command-line utility for aligning audio data with written text based on pronunciation rules. | 1,364 |
machinalis/yalign | Automates the process of extracting parallel sentences from comparable corpora to aid in statistical machine translation | 127 |
pku-alignment/align-anything | Aligns large multimodal models with human intentions and values using various algorithms and fine-tuning methods. | 270 |
andrewrosenberg/autobi | Automated annotation tool for prosody analysis in speech recordings. | 58 |
tp7/sushi | Automates subtitle syncing by comparing audio patterns to align subtitles with different video sources. | 649 |
pku-yuangroup/languagebind | Extending pretraining models to handle multiple modalities by aligning language and video representations | 751 |
spatialaudio/python-sounddevice | Tools and bindings for playing and recording audio with Python | 1,069 |
clab/fast_align | A fast and simple unsupervised word aligner for generating parallel corpus alignments. | 740 |
jcgood/rosetta-pangloss | A Python library that uses machine learning and natural language processing to improve translation accuracy by aligning source and target languages | 0 |
guitarbum722/align | An application and library for aligning text with flexible formatting options. | 84 |
astorfi/speechpy | Provides tools and libraries for extracting speech features from audio data. | 881 |