word-embedding-dimensionality-selection
Word embedding dimensionality selector
An implementation of dimensionality selection procedure for word embeddings in various algorithms.
On the Dimensionality of Word Embedding
329 stars
12 watching
44 forks
Language: Python
last commit: over 4 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
vyraun/half-size | An algorithm to reduce word embeddings to a specified size while maintaining performance | 128 |
epierson9/zifa | An algorithm for reducing dimensionality of single-cell gene expression data while handling zero-inflation | 107 |
vzhong/embeddings | Provides fast and efficient word embeddings for natural language processing. | 223 |
plasticityai/magnitude | A fast and efficient utility package for utilizing vector embeddings in machine learning models | 1,627 |
chengyuegongr/frequency-agnostic | Improves word embeddings by using adversarial training to make them less dependent on word frequencies | 118 |
dsv77/hashembedding | Software component providing efficient word representation using hash embeddings | 42 |
ziyuw/rembo | An optimization algorithm that uses Bayesian methods and random embeddings to solve complex problems in high-dimensional spaces. | 113 |
zhezhaoa/ngram2vec | A toolkit for learning high-quality word and text representations from ngram co-occurrence statistics | 846 |
beringresearch/ivis | A dimensionality reduction framework using a Siamese Neural Network to visualize high-dimensional datasets | 331 |
zhangxiann/skip-gram | A Python implementation of a neural network model for learning word embeddings from text data | 6 |
seomoz/word2gauss | This implementation provides a way to represent words as multivariate Gaussian distributions, allowing scalable word embeddings. | 190 |
malllabiisc/wordgcn | A deep learning model that generates word embeddings by predicting words based on their dependency context | 290 |
zjulearning/graph_level_drug_discovery | A Python project that uses machine learning to improve the representation of molecules in drug discovery | 60 |
jwieting/charagram | A tool for training and using character n-gram based word and sentence embeddings in natural language processing. | 125 |
dfki-interactive-machine-learning/arasif | Provides sentence embeddings for Arabic languages using pre-trained word embeddings and Smooth Inverse Frequency algorithm | 5 |