champollion
Sentence aligner
A toolkit providing ready-to-use parallel text sentence alignment tools for multiple language pairs.
Import of https://sourceforge.net/projects/champollion
18 stars
5 watching
8 forks
Language: Perl
last commit: over 8 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
machinalis/yalign | Automates the process of extracting parallel sentences from comparable corpora to aid in statistical machine translation | 127 |
tanloong/interlaced.nvim | Aligns bilingual parallel texts by repositioning lines. | 6 |
talschuster/crosslingualcontextualemb | Enables alignment of word embeddings across multiple languages to facilitate cross-lingual text analysis and machine learning tasks | 98 |
braunefe/gargantua | Software tool for aligning sentences between multiple languages | 12 |
jcgood/rosetta-pangloss | A Python library that uses machine learning and natural language processing to improve translation accuracy by aligning source and target languages | 0 |
lowerquality/gentle | A tool for aligning speech with text by analyzing audio and providing an output transcript | 1,453 |
montrealcorpustools/montreal-forced-aligner | A command-line utility for aligning audio data with written text based on pronunciation rules. | 1,343 |
povilasjurcys/alignment | A Ruby library implementing an alignment algorithm for corpus linguistics | 1 |
lowresourcelanguages/hltdi-morphology | Provides morphological analysis tools for various languages, including verb and noun generation, based on archived web pages. | 5 |
zhoux85/staligner | Tool for aligning and integrating spatially resolved transcriptomics data using machine learning algorithms | 29 |
clab/fast_align | A fast and simple unsupervised word aligner for generating parallel corpus alignments. | 738 |
cmesher/inuktitutalignerdata | Tools for aligning laboratory speech production data | 3 |
richardlitt/ldc-word-aligner | A tool for annotating manual word alignments in parallel texts | 2 |
dbremner/peg-sharp | Automates C# code generation for arbitrary parsing expression grammars. | 3 |
guitarbum722/align | An application and library for aligning text with flexible formatting options. | 84 |