Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Splink | 939 | 2 | 3 months ago | 119 | November 14, 2023 | 167 | mit | Python | ||
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends | ||||||||||
Spark Lucenerdd | 127 | 4 months ago | 39 | June 02, 2021 | 36 | apache-2.0 | Scala | |||
Spark RDD with Lucene's query and entity linkage capabilities | ||||||||||
Dblink | 38 | 3 years ago | 4 | other | Scala | |||||
Distributed Bayesian Entity Resolution in Apache Spark | ||||||||||
Spark Matcher | 27 | 6 months ago | 5 | gpl-2.0 | Python | |||||
Record matching and entity resolution at scale in Spark | ||||||||||
Whakapai | 22 | 3 months ago | 1 | Python | ||||||
Various Python Data Science Projects available in PyPi | ||||||||||
Spark Lucenerdd Examples | 15 | 7 months ago | 2 | apache-2.0 | Scala | |||||
Examples of spark-lucenerdd | ||||||||||
Atyimo | 7 | 5 years ago | mit | Python | ||||||
Splink_graph | 6 | a year ago | 41 | March 14, 2022 | 8 | mit | HTML | |||
pyspark-parallelised functions producing graph-theoretical metrics in connected component clusters for use in record-linkage (or other domains) |