fast_align

Word aligner

A fast and simple unsupervised word aligner for generating parallel corpus alignments.

Simple, fast unsupervised word aligner

GitHub

738 stars
25 watching
159 forks
Language: C++
last commit: over 2 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
richardlitt/ldc-word-aligner A tool for annotating manual word alignments in parallel texts 2
moses-smt/mgiza A C++ implementation of a word alignment tool with multi-threading and incremental training capabilities for machine translation. 161
guitarbum722/align An application and library for aligning text with flexible formatting options. 84
talschuster/crosslingualcontextualemb Enables alignment of word embeddings across multiple languages to facilitate cross-lingual text analysis and machine learning tasks 98
machinalis/yalign Automates the process of extracting parallel sentences from comparable corpora to aid in statistical machine translation 127
vim-scripts/align A Vim plugin that provides functions to align text based on specified separators and formatting options. 134
braunefe/gargantua Software tool for aligning sentences between multiple languages 12
lowerquality/gentle A tool for aligning speech with text by analyzing audio and providing an output transcript 1,453
josefnpat/reflowprint A library that enables character-by-character text alignment in real-time 46
montrealcorpustools/montreal-forced-aligner A command-line utility for aligning audio data with written text based on pronunciation rules. 1,343
prosodylab/prosodylab-aligner A Python tool for aligning audio data from laboratory speech production experiments 331
lowresourcelanguages/champollion A toolkit providing ready-to-use parallel text sentence alignment tools for multiple language pairs. 18
wbond/sublime_alignment A Python library for aligning multi-line and multiple selections in Sublime Text 523
egtwobits/mesh_mesh_align_plus An add-on for Blender that allows precise alignment and transformation of 3D mesh objects 581
raphael-group/paste2 A software framework for aligning and reconstructing spatial transcriptomics data from non-overlapping samples 29