fuzzy_match
Record matcher
A tool for finding similar records in large datasets using string similarity and regular expression rules.
Find a needle (a document or record) in a haystack using string similarity and (optionally) regular expression rules. Uses Dice's Coefficient (aka Pair Similiarity) and Levenshtein Distance internally.
676 stars
10 watching
46 forks
Language: Ruby
last commit: over 3 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
wouterrutgers/fuzzy-search | A lightweight JavaScript library for searching similar strings in an array of objects | 226 |
jamesturk/jellyfish | A Python library providing algorithms and encoding schemes for approximate string matching. | 2,075 |
glench/fuzzyset.js | A fuzzy string matching library that performs approximate string matching and likely mispellings detection | 1,371 |
danharltey/fastenshtein | An optimized Levenshtein implementation for fast fuzzy matching and string comparison in .NET. | 250 |
hernanmd/fuzzysearcher | An implementation of the ambiguous matching algorithm from Baeta-Yates et al. | 2 |
rapidfuzz/rapidfuzz-cpp | A C++ library for fast string matching using the Levenshtein Distance algorithm | 250 |
jhawthorn/fzy.js | A JavaScript implementation of a fuzzy string matching algorithm | 152 |
rmm5t/liquidmetal | A JavaScript library to improve fuzzy matching in web controls by leveraging a modified Quicksilver scoring algorithm. | 296 |
brianhempel/fuzzy_tools | A toolset for searching and indexing strings in Ruby with fuzzy matching capabilities | 23 |
kiyoka/fuzzy-string-match | Library implementing Jaro-Winkler distance algorithm for fuzzy string matching in Ruby | 285 |
dgrtwo/fuzzyjoin | Package for joining tables based on inexact matching | 673 |
lexmag/simetric | Facilities to calculate the distance and similarity between strings using various algorithms | 61 |
mitsuhiko/insta | A library for comparing expected values against reference data to ensure consistency during development and testing. | 2,293 |
blackrabbitt/mspm | An algorithm implementation for efficient multi-string pattern matching using trie data structures. | 25 |
sindresorhus/matcher | A utility for simple string matching with wildcard patterns | 539 |