Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Lsh | 243 | a year ago | 12 | mit | Python | |||||
Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents | ||||||||||
Neural Scam Artist | 15 | 2 years ago | mit | Python | ||||||
Web Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset. | ||||||||||
Dedup | 10 | a year ago | mit | Python | ||||||
Find duplicate text files. | ||||||||||
Product Deduplication | 7 | 5 years ago | apache-2.0 | HTML | ||||||
A practical implementation for product deduplication using TFIDF and Super Bit LSH | ||||||||||
Narrow Down | 6 | a year ago | 18 | May 01, 2023 | 10 | apache-2.0 | Python | |||
Fast fuzzy text search |