gentle

Speech aligner

A tool for aligning speech with text by analyzing audio and providing an output transcript

gentle forced aligner

GitHub

1k stars
44 watching
296 forks
Language: Python
last commit: 7 months ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
prosodylab/prosodylab-aligner A Python tool for aligning laboratory speech data by forced alignment using HTK and SoX. 332
montrealcorpustools/montreal-forced-aligner A command-line utility for aligning audio data with written text based on pronunciation rules. 1,349
cmesher/inuktitutalignerdata A set of scripts for aligning laboratory speech production data using prosodylab-Aligner 3
machinalis/yalign Automates the process of extracting parallel sentences from comparable corpora to aid in statistical machine translation 127
richardlitt/ldc-word-aligner A tool for annotating word alignments in parallel texts 2
prosodylab/prosodylab.alignertools A collection of scripts to prepare data for use in a speech analysis tool by cleaning and formatting audio and transcription files. 12
lowresourcelanguages/champollion A toolkit providing ready-to-use parallel text sentence alignment tools for multiple language pairs. 18
clab/fast_align A fast and simple unsupervised word aligner for generating parallel corpus alignments. 739
pettarin/forced-alignment-tools A collection of tools and resources for computing forced alignments between audio files and transcripts. 875
guitarbum722/align An application and library for aligning text with flexible formatting options. 84
thudm/longalign A framework for training and evaluating large language models on long context inputs 223
sergree/matchering An audio matching and mastering tool that uses machine learning to adapt the sound of one track to match another 1,810
talschuster/crosslingualcontextualemb Enables alignment of word embeddings across multiple languages to facilitate cross-lingual text analysis and machine learning tasks 98
jcgood/rosetta-pangloss A Python library that uses machine learning and natural language processing to improve translation accuracy by aligning source and target languages 0
raphael-group/paste2 A probabilistic alignment method for spatial transcriptomics experiments to reconstruct overlapping data slices 29