wmd
Document distance
Calculates a measure of document similarity based on word embeddings
Word Mover's Distance from Matthew J Kusner's paper "From Word Embeddings to Document Distances"
538 stars
25 watching
131 forks
Language: C
last commit: 6 months ago Related projects:
Repository | Description | Stars |
---|---|---|
src-d/wmd-relax | Calculates distances between documents based on word embeddings | 460 |
doukremt/distance | Library for comparing sequences of characters with various distance metrics. | 117 |
lrcfmd/elmd | Calculates similarity between chemical compositions based on the minimal work required to transform one distribution into another. | 31 |
lexmag/simetric | Facilities to calculate the distance and similarity between strings using various algorithms | 61 |
wannesm/dtaidistance | A library that implements a fast and efficient distance measure for time series data | 1,089 |
mljs/distance-matrix | Computes distance/similarity matrices using custom distance functions | 4 |
mateusza/sqlite-levenshtein | A utility extension for computing string similarities between two sequences using the Levenshtein distance algorithm | 15 |
schneems/going_the_distance | A Ruby library implementing algorithms for calculating the cost of changing one word into another | 61 |
tcrouch/edits.cr | A collection of edit distance algorithms and similarity measures for text sequences | 16 |
ikhramts/metrify | A browser extension that converts units between imperial and metric systems | 6 |
mwgg/greatcircle | A set of reusable functions for calculating distances and bearings between geographic locations | 78 |
cwarden/sqlite-distance | Calculates distance between two geographic points using latitude and longitude coordinates | 2 |
sindresorhus/leven | A JavaScript implementation of the Levenshtein distance algorithm for measuring string similarity. | 715 |
emijrp/wmcharts | Provides data-driven visualizations for Wikimedia projects' activity metrics | 3 |
universalavenue/simhash-ex | An Elixir implementation of Simhash, a distance metric for measuring semantic similarity between texts. | 22 |