Spark Matcher

Record matching and entity resolution at scale in Spark
Alternatives To Spark Matcher
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Splink93923 months ago119November 14, 2023167mitPython
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
Spark Lucenerdd127
4 months ago39June 02, 202136apache-2.0Scala
Spark RDD with Lucene's query and entity linkage capabilities
Dblink38
3 years ago4otherScala
Distributed Bayesian Entity Resolution in Apache Spark
Spark Matcher27
6 months ago5gpl-2.0Python
Record matching and entity resolution at scale in Spark
Whakapai22
3 months ago1Python
Various Python Data Science Projects available in PyPi
Spark Lucenerdd Examples15
7 months ago2apache-2.0Scala
Examples of spark-lucenerdd
Atyimo7
5 years agomitPython
Splink_graph6
a year ago41March 14, 20228mitHTML
pyspark-parallelised functions producing graph-theoretical metrics in connected component clusters for use in record-linkage (or other domains)
Alternatives To Spark Matcher
Select To Compare


Alternative Project Comparisons
Popular Spark Projects
Popular Record Linkage Projects
Popular Data Processing Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Python
Spark
Deduplication
Record Linkage