textdistance

Sequence distance library

A Python library for comparing distances between sequences using various algorithms.

📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.

GitHub

3k stars
63 watching
250 forks
Language: Python
last commit: 2 months ago
Linked from 2 awesome lists

algorithmalgorithmsdamerau-levenshteindamerau-levenshtein-distancediffdistancedistance-calculationhamming-distancejellyfishlevenshteinlevenshtein-distancepythontextdistance

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
tdebatty/java-string-similarity A Java library implementing various string similarity and distance measures. 2,696
doukremt/distance Library for comparing sequences of characters with various distance metrics. 117
ztane/python-levenshtein Fast string computation and similarity functions for text analysis 1,263
roy-ht/editdistance A fast implementation of Levenshtein distance for calculating string similarity 661
prakhar1989/algorithms An implementation of various algorithms and data structures in Python. 3,053
hbollon/go-edlib A comprehensive Go library for calculating string similarity and edit distances between strings 481
dbalatero/levenshtein-ffi Fast string edit distance computation using the Damerau-Levenshtein algorithm 149
globalnamesarchitecture/damerau-levenshtein Calculates edit distance between two strings using the Damerau-Levenshtein algorithm 144
feature23/stringsimilarity.net A .NET port of Java string similarity library implementing various distance and similarity measures 448
pkinney/distance Provides functions to calculate distances between points and lines in 2D and 3D spaces 62
lrcfmd/elmd Calculates similarity between chemical compositions based on the minimal work required to transform one distribution into another. 31
tcrouch/edits.cr A collection of edit distance algorithms and similarity measures for text sequences 16
nektro/zig-leven Calculates the difference between two strings using the Levenshtein distance algorithm 7
tonytonyjan/jaro_winkler An implementation of the Jaro-Winkler distance algorithm to compare strings 195
mateusza/sqlite-levenshtein A utility extension for computing string similarities between two sequences using the Levenshtein distance algorithm 15