CrossLingualContextualEmb
Language embedding aligner
Enables alignment of word embeddings across multiple languages to facilitate cross-lingual text analysis and machine learning tasks
Cross-Lingual Alignment of Contextual Word Embeddings
98 stars
8 watching
9 forks
Language: Python
last commit: almost 5 years ago
Linked from 1 awesome list
allennlpbertcontextual-embeddingscrosslingualelmonlppytorchwordembeddingszeroshot-learning
Related projects:
Repository | Description | Stars |
---|---|---|
hit-scir/elmoformanylangs | Provides pre-trained ELMo representations for multiple languages to improve NLP tasks. | 1,463 |
botcenter/spanishwordembeddings | This project generates Spanish word embeddings using fastText on large corpora. | 9 |
babylonhealth/fasttext_multilingual | A repository providing aligned multilingual word vectors for 78 languages using the SVD method. | 1,197 |
pedrada88/crossembeddings-twitter | Pre-trained word embeddings from Twitter for natural language processing tasks | 14 |
pku-yuangroup/languagebind | Extending pretraining models to handle multiple modalities by aligning language and video representations | 723 |
tanloong/interlaced.nvim | Aligns bilingual parallel texts by repositioning lines. | 6 |
lowresourcelanguages/champollion | A toolkit providing ready-to-use parallel text sentence alignment tools for multiple language pairs. | 18 |
ukplab/arxiv2018-xling-sentence-embeddings | Reproducible research on cross-lingual sentence embeddings using power mean word embeddings | 185 |
machinalis/yalign | Automates the process of extracting parallel sentences from comparable corpora to aid in statistical machine translation | 127 |
clab/fast_align | A fast and simple unsupervised word aligner for generating parallel corpus alignments. | 738 |
bheinzerling/bpemb | A collection of pre-trained subword embeddings in 275 languages, useful for natural language processing tasks. | 1,184 |
harsh19/spine | Transforms existing word embeddings into more interpretable ones by applying a novel extension of k-sparse autoencoder with stricter sparsity constraints | 52 |
cmesher/inuktitutalignerdata | Tools for aligning laboratory speech production data | 3 |
guitarbum722/align | An application and library for aligning text with flexible formatting options. | 84 |
malllabiisc/wordgcn | A deep learning model that generates word embeddings by predicting words based on their dependency context | 290 |