fuzzy_match
Record matcher
A tool for finding similar records in large datasets using string similarity and regular expression rules.
Find a needle (a document or record) in a haystack using string similarity and (optionally) regular expression rules. Uses Dice's Coefficient (aka Pair Similiarity) and Levenshtein Distance internally.
676 stars
10 watching
46 forks
Language: Ruby
last commit: almost 4 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| A lightweight JavaScript library for searching similar strings in an array of objects | 226 |
| A Python library providing algorithms and encoding schemes for approximate string matching. | 2,075 |
| A fuzzy string matching library that performs approximate string matching and likely mispellings detection | 1,371 |
| An optimized Levenshtein implementation for fast fuzzy matching and string comparison in .NET. | 250 |
| An implementation of the ambiguous matching algorithm from Baeta-Yates et al. | 2 |
| A C++ library for fast string matching using the Levenshtein Distance algorithm | 250 |
| A JavaScript implementation of a fuzzy string matching algorithm | 152 |
| A JavaScript library to improve fuzzy matching in web controls by leveraging a modified Quicksilver scoring algorithm. | 296 |
| A toolset for searching and indexing strings in Ruby with fuzzy matching capabilities | 23 |
| Library implementing Jaro-Winkler distance algorithm for fuzzy string matching in Ruby | 285 |
| Package for joining tables based on inexact matching | 673 |
| Facilities to calculate the distance and similarity between strings using various algorithms | 61 |
| A library for comparing expected values against reference data to ensure consistency during development and testing. | 2,293 |
| An algorithm implementation for efficient multi-string pattern matching using trie data structures. | 25 |
| A utility for simple string matching with wildcard patterns | 539 |