word-embedding-dimensionality-selection

Word embedding dimensionality selector

An implementation of dimensionality selection procedure for word embeddings in various algorithms.

On the Dimensionality of Word Embedding

GitHub

329 stars
12 watching
44 forks
Language: Python
last commit: over 4 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
vyraun/half-size An algorithm to reduce word embeddings to a specified size while maintaining performance 128
epierson9/zifa An algorithm for reducing dimensionality of single-cell gene expression data while handling zero-inflation 107
vzhong/embeddings Provides fast and efficient word embeddings for natural language processing. 223
plasticityai/magnitude A fast and efficient utility package for utilizing vector embeddings in machine learning models 1,627
chengyuegongr/frequency-agnostic Improves word embeddings by using adversarial training to make them less dependent on word frequencies 118
dsv77/hashembedding Software component providing efficient word representation using hash embeddings 42
ziyuw/rembo An optimization algorithm that uses Bayesian methods and random embeddings to solve complex problems in high-dimensional spaces. 113
zhezhaoa/ngram2vec A toolkit for learning high-quality word and text representations from ngram co-occurrence statistics 846
beringresearch/ivis A dimensionality reduction framework using a Siamese Neural Network to visualize high-dimensional datasets 331
zhangxiann/skip-gram A Python implementation of a neural network model for learning word embeddings from text data 6
seomoz/word2gauss This implementation provides a way to represent words as multivariate Gaussian distributions, allowing scalable word embeddings. 190
malllabiisc/wordgcn A deep learning model that generates word embeddings by predicting words based on their dependency context 290
zjulearning/graph_level_drug_discovery A Python project that uses machine learning to improve the representation of molecules in drug discovery 60
jwieting/charagram A tool for training and using character n-gram based word and sentence embeddings in natural language processing. 125
dfki-interactive-machine-learning/arasif Provides sentence embeddings for Arabic languages using pre-trained word embeddings and Smooth Inverse Frequency algorithm 5