mat2vec

Materials Embeddings

Unsupervised word embeddings capture latent knowledge from materials science literature

Supplementary Materials for Tshitoyan et al. "Unsupervised word embeddings capture latent knowledge from materials science literature", Nature (2019).

GitHub

624 stars
40 watching
182 forks
Language: Python
last commit: almost 2 years ago

Related projects:

Repository Description Stars
materialsvirtuallab/maml A toolkit for machine learning in materials science, enabling the development of predictive models and simulations. 376
tca19/dict2vec A framework to learn word embeddings using lexical dictionaries 115
plasticityai/magnitude A fast and efficient utility package for utilizing vector embeddings in machine learning models 1,635
satwikkottur/visualword2vec Learning word embeddings from abstract images to improve language understanding 19
wikipedia2vec/wikipedia2vec A tool for learning vector representations of words and entities from Wikipedia text data. 946
malllabiisc/wordgcn A deep learning model that generates word embeddings by predicting words based on their dependency context 291
zhezhaoa/ngram2vec A toolkit for learning high-quality word and text representations from ngram co-occurrence statistics 848
vzhong/embeddings Provides fast and efficient word embeddings for natural language processing. 223
materialsproject/matbench Provides tools and resources for testing machine learning performance on materials science data 132
largelymfs/topical_word_embeddings A Python implementation of a topical word embedding technique used in natural language processing and information retrieval. 314
gink03/alt-i2v An implementation of a deep learning-based image representation learning approach using a modified fully connected layer and transfer learning from VGG16 34
vefstathiou/so_word2vec This is a word embedding model trained on Stack Overflow posts for use in natural language processing tasks. 40
antoine77340/howto100m Provides code and tools for learning joint text-video embeddings using the HowTo100M dataset 254
bigredt/vico Multi-sense word embeddings learned from visual cooccurrences 25
cod3licious/conec A library for training and evaluating a type of word embedding model that extends the original Word2Vec algorithm 20