CrossLingualContextualEmb

Language embedding aligner

Enables alignment of word embeddings across multiple languages to facilitate cross-lingual text analysis and machine learning tasks

Cross-Lingual Alignment of Contextual Word Embeddings

GitHub

98 stars
8 watching
9 forks
Language: Python
last commit: almost 5 years ago
Linked from 1 awesome list

allennlpbertcontextual-embeddingscrosslingualelmonlppytorchwordembeddingszeroshot-learning

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
hit-scir/elmoformanylangs Provides pre-trained ELMo representations for multiple languages to improve NLP tasks. 1,463
botcenter/spanishwordembeddings This project generates Spanish word embeddings using fastText on large corpora. 9
babylonhealth/fasttext_multilingual A repository providing aligned multilingual word vectors for 78 languages using the SVD method. 1,197
pedrada88/crossembeddings-twitter Pre-trained word embeddings from Twitter for natural language processing tasks 14
pku-yuangroup/languagebind Extending pretraining models to handle multiple modalities by aligning language and video representations 723
tanloong/interlaced.nvim Aligns bilingual parallel texts by repositioning lines. 6
lowresourcelanguages/champollion A toolkit providing ready-to-use parallel text sentence alignment tools for multiple language pairs. 18
ukplab/arxiv2018-xling-sentence-embeddings Reproducible research on cross-lingual sentence embeddings using power mean word embeddings 185
machinalis/yalign Automates the process of extracting parallel sentences from comparable corpora to aid in statistical machine translation 127
clab/fast_align A fast and simple unsupervised word aligner for generating parallel corpus alignments. 738
bheinzerling/bpemb A collection of pre-trained subword embeddings in 275 languages, useful for natural language processing tasks. 1,184
harsh19/spine Transforms existing word embeddings into more interpretable ones by applying a novel extension of k-sparse autoencoder with stricter sparsity constraints 52
cmesher/inuktitutalignerdata Tools for aligning laboratory speech production data 3
guitarbum722/align An application and library for aligning text with flexible formatting options. 84
malllabiisc/wordgcn A deep learning model that generates word embeddings by predicting words based on their dependency context 290