CrossLingualContextualEmb

Language embedding aligner

Enables alignment of word embeddings across multiple languages to facilitate cross-lingual text analysis and machine learning tasks

Cross-Lingual Alignment of Contextual Word Embeddings

GitHub

99 stars

8 watching

9 forks

Language: Python

last commit: over 6 years ago

Linked from 1 awesome list

allennlpbertcontextual-embeddingscrosslingualelmonlppytorchwordembeddingszeroshot-learning

Backlinks from these awesome lists:

separius/awesome-sentence-embedding

Related projects:

Repository	Description	Stars
hit-scir/elmoformanylangs	Provides pre-trained ELMo representations for multiple languages to improve NLP tasks.	1,462
botcenter/spanishwordembeddings	This project generates Spanish word embeddings using fastText on large corpora.	9
babylonhealth/fasttext_multilingual	A repository providing aligned multilingual word vectors for 78 languages using the SVD method.	1,197
pedrada88/crossembeddings-twitter	Pre-trained word embeddings from Twitter for natural language processing tasks	14
pku-yuangroup/languagebind	Extending pretraining models to handle multiple modalities by aligning language and video representations	751
tanloong/interlaced.nvim	A plugin for aligning bilingual parallel texts by re-positioning text and applying highlighting.	7
lowresourcelanguages/champollion	A toolkit providing ready-to-use parallel text sentence alignment tools for multiple language pairs.	18
ukplab/arxiv2018-xling-sentence-embeddings	Reproducible research on cross-lingual sentence embeddings using power mean word embeddings	186
machinalis/yalign	Automates the process of extracting parallel sentences from comparable corpora to aid in statistical machine translation	127
clab/fast_align	A fast and simple unsupervised word aligner for generating parallel corpus alignments.	740
bheinzerling/bpemb	A collection of pre-trained subword embeddings in 275 languages, useful for natural language processing tasks.	1,189
harsh19/spine	Transforms existing word embeddings into more interpretable ones by applying a novel extension of k-sparse autoencoder with stricter sparsity constraints	52
cmesher/inuktitutalignerdata	Scripts for aligning laboratory speech production data in Inuktitut	3
guitarbum722/align	An application and library for aligning text with flexible formatting options.	84
malllabiisc/wordgcn	A deep learning model that generates word embeddings by predicting words based on their dependency context	291