dedupe
A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
4k stars
120 watching
550 forks
Language: Python
last commit: 5 days ago
Linked from 2 awesome lists
clusteringdatamadede-duplicatingdedupededupe-libraryentity-resolutionpythonpython-libraryrecord-linkage