Dedupe Alternatives

Name: dedupeio/dedupe
Brand: dedupeio/dedupe
SKU: project/dedupeio/dedupe
Rating: 4.94 (3879 reviews)

:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

Categories > Data Processing > Deduplication

Suggest Alternative

Stars

3,879

Alternatives

License

mit

Open Issues

Most Recent Commit

over 2 years ago

Programming Language

Python

Monthly Downloads

Dependent Repos

Dependent Packages

Total Releases

174

Latest Release

February 17, 2023

Categories

Programming Languages > Python

Data Processing > Deduplication

Data Processing > Record Linkage

Site

Repo

Alternatives To dedupeio/dedupe

Project Name	Stars	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
openvenues/libpostal	3,897	0	0	over 2 years ago	0		315	mit	C
A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
dedupeio/dedupe	3,879	39	10	over 2 years ago	174	February 17, 2023	72	mit	Python
:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
moj-analytical-services/splink	939	0	2	over 2 years ago	119	November 14, 2023	167	mit	Python
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
J535D165/recordlinkage	808	9	3	almost 3 years ago	23	July 20, 2023	57	bsd-3-clause	Python
A powerful and modular toolkit for record linkage and duplicate detection in Python
Yomguithereal/talisman	666	1,135	48	over 3 years ago	30	January 21, 2021	80	mit	JavaScript
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
dedupeio/csvdedupe	393	0	0	over 6 years ago	0		21	other	Python
:id: Command line tool for deduplicating CSV files
J535D165/data-matching-software	329	0	0	over 2 years ago	0		8
A list of free data matching and record linkage software.
dedupeio/dedupe-examples	306	0	0	over 4 years ago	0		7	mit	Python
:id: Examples for using the dedupe library
zouzias/spark-lucenerdd	127	0	0	over 2 years ago	39	June 02, 2021	36	apache-2.0	Scala
Spark RDD with Lucene's query and entity linkage capabilities
vintasoftware/entity-embed	98	0	0	about 4 years ago	6	July 16, 2021	0	mit	Jupyter Notebook
PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.

Alternatives To dedupeio/dedupe

Select To Compare

openvenues/libpostal ⭐ 3,897

A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.

dependent packages 0 total releases 0 most recent commit over 2 years ago

dedupeio/dedupe ⭐ 3,879

:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

dependent packages 10 total releases 174 most recent commit over 2 years ago downloads badge

moj-analytical-services/splink ⭐ 939

Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends

dependent packages 2 total releases 119 most recent commit over 2 years ago downloads badge

J535D165/recordlinkage ⭐ 808

A powerful and modular toolkit for record linkage and duplicate detection in Python

dependent packages 3 total releases 23 most recent commit almost 3 years ago downloads badge

Yomguithereal/talisman ⭐ 666

Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.

dependent packages 48 total releases 30 most recent commit over 3 years ago downloads badge

dedupeio/csvdedupe ⭐ 393

:id: Command line tool for deduplicating CSV files

dependent packages 0 total releases 0 most recent commit over 6 years ago

J535D165/data-matching-software ⭐ 329

A list of free data matching and record linkage software.

dependent packages 0 total releases 0 most recent commit over 2 years ago

dedupeio/dedupe-examples ⭐ 306

:id: Examples for using the dedupe library

dependent packages 0 total releases 0 most recent commit over 4 years ago

zouzias/spark-lucenerdd ⭐ 127

Spark RDD with Lucene's query and entity linkage capabilities

dependent packages 0 total releases 39 most recent commit over 2 years ago

vintasoftware/entity-embed ⭐ 98

PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.

dependent packages 0 total releases 6 most recent commit about 4 years ago downloads badge

Suggest An Alternative To dedupe

Alternative Project Comparisons

dedupeio/dedupe vs Libpostal

dedupeio/dedupe vs Dedupe

dedupeio/dedupe vs Splink

dedupeio/dedupe vs Recordlinkage

dedupeio/dedupe vs Talisman

dedupeio/dedupe vs Csvdedupe

dedupeio/dedupe vs Data Matching Software

dedupeio/dedupe vs Dedupe Examples

dedupeio/dedupe vs Spark Lucenerdd

dedupeio/dedupe vs Entity Embed

Popular Deduplication Projects

restic/restic⭐ 34,837

Fast, secure, efficient backup program

borgbackup/borg⭐ 10,158

Deduplicating archiver with compression and authenticated encryption.

prometheus/alertmanager⭐ 8,323

Prometheus Alertmanager

kopia/kopia⭐ 5,678

Cross-platform backup tool for Windows, macOS & Linux with fast, incremental backups, client-side end-to-end encryption, compression and data deduplication. CLI and GUI included.

gilbertchen/duplicacy⭐ 4,900

A new generation cloud backup tool