CrossLingualContextualEmb
Language embedding aligner
Enables alignment of word embeddings across multiple languages to facilitate cross-lingual text analysis and machine learning tasks
Cross-Lingual Alignment of Contextual Word Embeddings
99 stars
8 watching
9 forks
Language: Python
last commit: about 5 years ago
Linked from 1 awesome list
allennlpbertcontextual-embeddingscrosslingualelmonlppytorchwordembeddingszeroshot-learning
Related projects:
Repository | Description | Stars |
---|---|---|
| Provides pre-trained ELMo representations for multiple languages to improve NLP tasks. | 1,462 |
| This project generates Spanish word embeddings using fastText on large corpora. | 9 |
| A repository providing aligned multilingual word vectors for 78 languages using the SVD method. | 1,197 |
| Pre-trained word embeddings from Twitter for natural language processing tasks | 14 |
| Extending pretraining models to handle multiple modalities by aligning language and video representations | 751 |
| A plugin for aligning bilingual parallel texts by re-positioning text and applying highlighting. | 7 |
| A toolkit providing ready-to-use parallel text sentence alignment tools for multiple language pairs. | 18 |
| Reproducible research on cross-lingual sentence embeddings using power mean word embeddings | 186 |
| Automates the process of extracting parallel sentences from comparable corpora to aid in statistical machine translation | 127 |
| A fast and simple unsupervised word aligner for generating parallel corpus alignments. | 740 |
| A collection of pre-trained subword embeddings in 275 languages, useful for natural language processing tasks. | 1,189 |
| Transforms existing word embeddings into more interpretable ones by applying a novel extension of k-sparse autoencoder with stricter sparsity constraints | 52 |
| Scripts for aligning laboratory speech production data in Inuktitut | 3 |
| An application and library for aligning text with flexible formatting options. | 84 |
| A deep learning model that generates word embeddings by predicting words based on their dependency context | 291 |