Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Libpostal | 3,897 | 3 months ago | 315 | mit | C | |||||
A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data. | ||||||||||
Dedupe | 3,879 | 39 | 10 | 4 months ago | 174 | February 17, 2023 | 72 | mit | Python | |
:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution. | ||||||||||
Splink | 939 | 2 | 3 months ago | 119 | November 14, 2023 | 167 | mit | Python | ||
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends | ||||||||||
Recordlinkage | 808 | 9 | 3 | 9 months ago | 23 | July 20, 2023 | 57 | bsd-3-clause | Python | |
A powerful and modular toolkit for record linkage and duplicate detection in Python | ||||||||||
Talisman | 666 | 1,135 | 48 | a year ago | 30 | January 21, 2021 | 80 | mit | JavaScript | |
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript. | ||||||||||
Csvdedupe | 398 | 4 years ago | 21 | other | Python | |||||
:id: Command line tool for deduplicating CSV files | ||||||||||
Data Matching Software | 329 | 5 months ago | 8 | |||||||
A list of free data matching and record linkage software. | ||||||||||
Dedupe Examples | 306 | 2 years ago | 7 | mit | Python | |||||
:id: Examples for using the dedupe library | ||||||||||
Spark Lucenerdd | 127 | 3 months ago | 39 | June 02, 2021 | 36 | apache-2.0 | Scala | |||
Spark RDD with Lucene's query and entity linkage capabilities | ||||||||||
Entity Embed | 98 | 2 years ago | 6 | July 16, 2021 | mit | Jupyter Notebook | ||||
PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors. |