mat2vec

Materials Embeddings

Unsupervised word embeddings capture latent knowledge from materials science literature

Supplementary Materials for Tshitoyan et al. "Unsupervised word embeddings capture latent knowledge from materials science literature", Nature (2019).

GitHub

619 stars
40 watching
180 forks
Language: Python
last commit: over 1 year ago

Related projects:

Repository Description Stars
materialsvirtuallab/maml A toolkit for machine learning in materials science, enabling the development of predictive models and simulations. 369
tca19/dict2vec A framework to learn word embeddings using lexical dictionaries 115
plasticityai/magnitude A fast and efficient utility package for utilizing vector embeddings in machine learning models 1,627
satwikkottur/visualword2vec Learning word embeddings from abstract images to improve language understanding 19
wikipedia2vec/wikipedia2vec A tool for learning vector representations of words and entities from Wikipedia text data. 940
malllabiisc/wordgcn A deep learning model that generates word embeddings by predicting words based on their dependency context 290
zhezhaoa/ngram2vec A toolkit for learning high-quality word and text representations from ngram co-occurrence statistics 846
vzhong/embeddings Provides fast and efficient word embeddings for natural language processing. 223
materialsproject/matbench Provides tools and resources for testing machine learning performance on materials science data 122
largelymfs/topical_word_embeddings A codebase implementing topical word embeddings using various NLP techniques as demonstrated in a paper accepted by AAAI'15. 315
gink03/alt-i2v An implementation of a deep learning-based image representation learning approach using a modified fully connected layer and transfer learning from VGG16 34
vefstathiou/so_word2vec This is a word embedding model trained on Stack Overflow posts for use in natural language processing tasks. 40
antoine77340/howto100m Provides code and tools for learning joint text-video embeddings using the HowTo100M dataset 250
bigredt/vico Multi-sense word embeddings learned from visual cooccurrences 25
cod3licious/conec A library for training and evaluating a type of word embedding model that extends the original Word2Vec algorithm 20