word-embeddings-benchmarks

Embedding benchmarking tool

Provides methods for evaluating word embeddings on various benchmarks

Package for evaluating word embeddings

GitHub

437 stars
20 watching
110 forks
Language: Python
last commit: almost 4 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
ermlab/polish-word-embeddings-review An evaluation framework for Polish word embeddings prepared by various research groups using analogy tasks. 4
kobkrit/tf-nlp-thai-word-embedding An implementation of a word embedding technique using TensorFlow for Thai language processing 11
embeddings-benchmark/mteb A benchmarking suite for evaluating text embedding models across various NLP tasks and datasets. 1,952
nlprinceton/text_embedding A utility class for generating and evaluating document representations using word embeddings. 54
vzhong/embeddings Provides fast and efficient word embeddings for natural language processing. 223
jwieting/paragram-word Trains word embeddings from a paraphrase database to represent semantic relationships between words. 30
binwang28/sbert-wk-sentence-embedding A method to generate sentence embeddings from pre-trained language models 177
jwieting/charagram A tool for training and using character n-gram based word and sentence embeddings in natural language processing. 125
commonsense/conceptnet-numberbatch A pre-trained word embedding model informed by a large-scale knowledge graph, providing a nuanced representation of word meanings. 1,295
krisselden/ember-macro-benchmark An Ember application benchmarking tool to measure the effects of small changes on web applications. 25
galuhsahid/indonesian-word-embedding Demonstrates word embedding in Indonesian language using pre-trained Word2vec models 20
botcenter/spanishwordembeddings This project generates Spanish word embeddings using fastText on large corpora. 9
logicalparadox/matcha A tool for designing and running benchmarking experiments in JavaScript to measure the performance of code 563
malllabiisc/wordgcn A deep learning model that generates word embeddings by predicting words based on their dependency context 290
songlab-cal/tape Provides pre-trained protein embeddings and benchmarking tools for semi-supervised learning tasks in protein biology 662