word-embedding-dimensionality-selection

Word embedding dimensionality selector

An implementation of dimensionality selection procedure for word embeddings in various algorithms.

On the Dimensionality of Word Embedding

GitHub

329 stars
11 watching
44 forks
Language: Python
last commit: over 4 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
vyraun/half-size An algorithm to reduce word embeddings to a specified size while maintaining performance 129
epierson9/zifa An algorithm for dimensionality reduction in single-cell data with applications in genomics and bioinformatics 108
vzhong/embeddings Provides fast and efficient word embeddings for natural language processing. 223
plasticityai/magnitude A fast and efficient utility package for utilizing vector embeddings in machine learning models 1,635
chengyuegongr/frequency-agnostic Improves word embeddings by training with adversarial objectives 118
dsv77/hashembedding Software component providing efficient word representation using hash embeddings 42
ziyuw/rembo An optimization algorithm that uses Bayesian methods and random embeddings to solve complex problems in high-dimensional spaces. 113
zhezhaoa/ngram2vec A toolkit for learning high-quality word and text representations from ngram co-occurrence statistics 848
beringresearch/ivis A dimensionality reduction framework using a Siamese Neural Network to visualize high-dimensional datasets 332
zhangxiann/skip-gram A Python implementation of a neural network model for learning word embeddings from text data 6
seomoz/word2gauss This implementation provides a way to represent words as multivariate Gaussian distributions, allowing scalable word embeddings. 190
malllabiisc/wordgcn A deep learning model that generates word embeddings by predicting words based on their dependency context 291
zjulearning/graph_level_drug_discovery A Python project that uses machine learning to improve the representation of molecules in drug discovery 60
jwieting/charagram A tool for training and using character n-gram based word and sentence embeddings in natural language processing. 125
dfki-interactive-machine-learning/arasif Provides sentence embeddings for Arabic languages using pre-trained word embeddings and Smooth Inverse Frequency algorithm 5