wmd

Document distance

Calculates a measure of document similarity based on word embeddings

Word Mover's Distance from Matthew J Kusner's paper "From Word Embeddings to Document Distances"

GitHub

538 stars
25 watching
131 forks
Language: C
last commit: 6 months ago

Related projects:

Repository Description Stars
src-d/wmd-relax Calculates distances between documents based on word embeddings 460
doukremt/distance Library for comparing sequences of characters with various distance metrics. 117
lrcfmd/elmd Calculates similarity between chemical compositions based on the minimal work required to transform one distribution into another. 31
lexmag/simetric Facilities to calculate the distance and similarity between strings using various algorithms 61
wannesm/dtaidistance A library that implements a fast and efficient distance measure for time series data 1,089
mljs/distance-matrix Computes distance/similarity matrices using custom distance functions 4
mateusza/sqlite-levenshtein A utility extension for computing string similarities between two sequences using the Levenshtein distance algorithm 15
schneems/going_the_distance A Ruby library implementing algorithms for calculating the cost of changing one word into another 61
tcrouch/edits.cr A collection of edit distance algorithms and similarity measures for text sequences 16
ikhramts/metrify A browser extension that converts units between imperial and metric systems 6
mwgg/greatcircle A set of reusable functions for calculating distances and bearings between geographic locations 78
cwarden/sqlite-distance Calculates distance between two geographic points using latitude and longitude coordinates 2
sindresorhus/leven A JavaScript implementation of the Levenshtein distance algorithm for measuring string similarity. 715
emijrp/wmcharts Provides data-driven visualizations for Wikimedia projects' activity metrics 3
universalavenue/simhash-ex An Elixir implementation of Simhash, a distance metric for measuring semantic similarity between texts. 22