dedupe

id A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

GitHub

4k stars
120 watching
550 forks
Language: Python
last commit: 5 days ago
Linked from 2 awesome lists

clusteringdatamadede-duplicatingdedupededupe-libraryentity-resolutionpythonpython-libraryrecord-linkage

Backlinks from these awesome lists: