splink
Data linker
A Python package that uses statistical models to link and deduplicate data records from datasets lacking unique identifiers.
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
1k stars
20 watching
150 forks
Language: Python
last commit: 3 days ago data-matchingdata-sciencededuplicate-datadeduplicationduckdbem-algorithmentity-resolutionfuzzy-matchingrecord-linkagesparkuk-gov-data-science
Related projects:
Repository | Description | Stars |
---|---|---|
josephfrazier/octopermalinker | A browser extension that automatically updates links to branches on GitHub | 27 |
byrnereese/linkchecker-mkdocs | Tool to validate links in static generated websites with Markdown files | 10 |
mommi84/mandolin | A system that uses Markov Logic Networks to discover links in knowledge graphs | 4 |
rafaelstz/magento2-quicklink | An extension that predicts and preloads links on subsequent pages to improve loading speed | 51 |
rafguns/linkpred | A tool for predicting links in networks | 141 |
rescribet/link-redux | A JavaScript library and React component suite for rendering Linked Data in web applications. | 37 |
blake-regalia/linked-data.syntaxes | A package of syntax highlighters for various linked data formats | 30 |
arbazkiraak/burpblh | An extension for Burp Suite to identify broken links in web responses | 55 |
ged/linkparser | An interface to parse and analyze English sentences using the CMU Link Grammar | 76 |
arbazkiraak/linksdumper | A tool that extracts and filters links from web responses | 86 |
dzonatan/ngx-linky | An Angular pipe to find links in text and turn them into HTML links | 41 |
umbrelladocs/linkspector | A CLI tool that checks for dead hyperlinks in files using multiple markup languages. | 67 |
rmlio/rmlmapper-java | Executes RML rules to generate high-quality Linked Data from multiple data sources | 158 |
archi-doc/valuelink | A C# library for creating and managing flexible links between objects in code. | 8 |
remodoy/clj-postgresql | A Clojure library that provides an interface to PostgreSQL databases with support for connection parameter customization and type conversion | 161 |