Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for similarity search
similarity-search
x
118 search results found
Typesense
⭐
16,670
Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences
Qdrant
⭐
15,789
Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
Weaviate
⭐
10,120
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
Gptcache
⭐
5,954
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
Gerev
⭐
2,534
🧠 AI-powered enterprise search engine 🔎
Hora
⭐
2,469
🚀 efficient approximate nearest neighbor search algorithm collections library written in Rust 🦀 .
Lancedb
⭐
2,000
Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
Vald
⭐
1,422
Vald. A Highly Scalable Distributed Vector Search Engine
Usearch
⭐
1,335
Fast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍
Awesome Vector Search
⭐
1,143
Collections of vector search related libraries, service and research papers
Jvector
⭐
1,115
JVector: the most advanced embedded vector search engine
Similarity
⭐
979
TensorFlow Similarity is a python package focused on making similarity learning quick and easy.
Quaterion
⭐
580
Blazing fast framework for fine-tuning similarity learning models
Setsimilaritysearch
⭐
555
All-pair set similarity search on millions of sets in Python and on a laptop
Deephash
⭐
537
An Open-Source Package for Deep Learning to Hash (DeepHash)
Simsimd
⭐
514
Vector Similarity Functions 3x-200x Faster than SciPy and NumPy — for Python, JavaScript, and C 11, supporting f64, f32, f16, i8, and binary vectors using SIMD for both x86 AVX2 & AVX-512 and Arm NEON & SVE 📐
Similarities
⭐
495
Similarities: a toolkit for similarity calculation and semantic search. 相似度计算、匹配搜索工具包,支持亿级数据文搜文、文搜图、图搜图,python3开发,开箱即用。
Dbreeze
⭐
486
C# .NET NOSQL ( key value store embedded ) ACID multi-paradigm database management system.
Arcadedb
⭐
421
ArcadeDB Multi-Model Database, one DBMS that supports SQL, Cypher, Gremlin, HTTP/JSON, MongoDB and Redis. ArcadeDB is a conceptual fork of OrientDB, the first Multi-Model DBMS. ArcadeDB supports Vector Embeddings.
Awesome Metric Learning
⭐
351
😎 A curated list of awesome practical Metric Learning and its applications
Elastiknn
⭐
347
Elasticsearch plugin for nearest neighbor search. Store vectors and run similarity search using exact and approximate algorithms.
Deephash Papers
⭐
319
Must-read papers on deep learning to hash (DeepHash)
Tinyvector
⭐
314
A tiny embedding database in pure Rust.
Aquiladb
⭐
311
An easy to use Neural Search Engine. Index latent vectors along with JSON metadata and do efficient k-NN search.
Generalized Kmeans Clustering
⭐
284
Spark library for generalized K-Means clustering. Supports general Bregman divergences. Suitable for clustering probabilistic data, time series data, high dimensional data, and very large data.
Caiss
⭐
261
跨平台/多语言的 相似向量/相似词/相似句 高性能检索引擎。欢迎star & fork。Build together! Power another !
Voy
⭐
255
🕸️🦀 A WASM vector similarity search written in Rust
Mrpt
⭐
254
Fast and lightweight header-only C++ library (with Python bindings) for approximate nearest neighbor search
Elastik Nearest Neighbors
⭐
242
Go to: https://github.com/alexklibisz/elastiknn
Imgsmlr
⭐
237
Similar images search for PostgreSQL
Aquiladb
⭐
200
Drop in solution for Decentralized Neural Information Retrieval. Index latent vectors along with JSON metadata and do efficient k-NN search.
Kgclue
⭐
199
KgCLUE: 大规模中文开源知识图谱问答
Stocks Pattern Analyzer
⭐
184
This tool should help discover different patterns based on similarity measures in historical (financial) data
Pinecone Ts Client
⭐
144
The official TypeScript/Node client for the Pinecone vector database
Fast
⭐
128
End-to-end earthquake detection pipeline via efficient time series similarity search
Postgres Word2vec
⭐
105
utils to use word embedding models like word2vec vectors in a PostgreSQL database
Awesome Vector Database
⭐
96
A curated list of awesome works related to high dimensional structure/vector search & database
Ivf Hnsw
⭐
96
Code for ECCV2018 paper: Revisiting the Inverted Indices for Billion-Scale Approximate Nearest Neighbors
Citrus
⭐
92
(distributed) vector database
Fpsim2
⭐
89
Simple package for fast molecular similarity searches
Dhash Vips
⭐
80
vips-powered ruby gem to measure images similarity, implementing dHash and IDHash algorithms
Mass Ts
⭐
78
MASS (Mueen's Algorithm for Similarity Search) - a python 2 and 3 compatible library used for searching time series sub-sequences under z-normalized Euclidean distance for similarity.
Talkwithyourfiles
⭐
70
An LLM GUI application; enables you to interact with your files, offering dynamic parameters that can modify response behavior during runtime.
Org Similarity
⭐
66
Emacs package that helps org-mode users (re)discover similar documents
Efficientir
⭐
64
人工智障本地图片检索工具 | An EfficientNet based image retrieval tool
Wordvector_be
⭐
61
Web服务:使用腾讯 800 万词向量模型和 spotify annoy 引擎得到相似关键词
Scenery
⭐
58
photo gallery with extended search capabilities
Faiss Node
⭐
56
Node.js bindings for faiss
Consimilo
⭐
53
A Clojure library for querying large data-sets on similarity
Superinsight Db
⭐
49
Relational Database for Unstructured Data
Cbird
⭐
49
Command-line program for managing a media collection, with focus on Content-Based Image Retrieval (Computer Vision) methods for finding duplicates.
Apollo
⭐
48
Advanced similarity and duplicate source code proof of concept for our research efforts.
Telegram Similar Channels
⭐
44
Telegram similar channels search tool (CLI + Maltego)
Find Simdoc
⭐
42
Finding all pairs of similar documents time- and memory-efficiently
Similaritysearch.jl
⭐
36
A nearest neighbor search library with exact and approximate algorithms
Aaai17 Cdq
⭐
31
The implementation of AAAI-17 paper "Collective Deep Quantization of Efficient Cross-modal Retrieval"
Tf Metric Learning
⭐
28
Minimalistic TensorFlow2+ deep metric/similarity learning library with loss functions, miners, and utils as embedding projector.
Matrixprofile.jl
⭐
28
Time-series analysis using the Matrix profile in Julia
Amazon Ml Challenge2021
⭐
28
Scripts and Approach for Amazon ML Challenge
Portable Hnsw
⭐
28
Cottontaildb
⭐
27
Cottontail DB is a column store vector database aimed at multimedia retrieval. It allows for classical boolean as well as vector-space retrieval (nearest neighbour search) used in similarity search using a unified data and query model.
Kawaiisearch
⭐
25
An application to find similar pictures based on the VGG16 and kNN
Visualsearch
⭐
25
Visual Search is a little app to find and cluster similar images using Tagbox
Alvd
⭐
23
alvd = A Lightweight Vald. A lightweight distributed vector search engine works without K8s.
Refinery Sample Projects
⭐
22
Containing examples of projects you can use to test refinery. Please select the use case from the branches.
Whales_descriptors
⭐
22
python code for calculating the WHALES (Weighted Holistic Atom Localization and Entity Shape) molecular descriptors
Elastichash
⭐
21
Semantic Image Similarity Search in Elasticsearch
Simimg
⭐
20
Similar image search
Faiss Mobile
⭐
20
FAISS library compiled for iOS, macOS, tvOS, watchOS
Embedders
⭐
19
With embedders, you can easily convert your texts into sentence- or token-level embeddings within a few lines of code. Use cases for this include similarity search between texts, information extraction such as named entity recognition, or basic text classification.
Pause
⭐
19
🍊 PAUSE (Positive and Annealed Unlabeled Sentence Embedding), accepted by EMNLP'2021 🌴
Movie_bot
⭐
19
https://youtu.be/7hQhPMPNY6A
Go Set Similarity Search
⭐
19
Efficient set similarity search algorithms implemented in Go
Searchly
⭐
18
🎶 Song similarity search API based on lyrics
Topsim
⭐
18
Efficiently search the most similar strings against the query in Python.
Youtubegpt
⭐
16
YouTube GPT is an Android app that allows users to generate video summaries using A.I models. Also capable of answering cross-questions related to the video content!
Data Science Articles
⭐
15
A collection of my data science articles published in Towards Data Science and Towards AI.
Alegre
⭐
14
A text and media analysis service for Meedan Check, a collaborative media annotation platform
Awesome Milvus
⭐
14
A curated list of awesome Milvus projects and resources.
Chatgpt Long Term Memory
⭐
14
The ChatGPT Long Term Memory package is a powerful tool designed to empower your projects with the ability to handle a large number of simultaneous users and external sources.
Amoebae
⭐
14
Workflow for identifying and classifying homologous gene/protein sequences
Google Reverse Image Api
⭐
14
This is a simple API built using Node.js and Express.js that allows you to perform Google Reverse Image Search by providing an image URL. The API uses Cheerio to scrap Google's image search engine's html to get result text and similar images url.
Chromadb Rs
⭐
13
Rust client library for the ChromaDB vector database
Oasysdb
⭐
13
An AI-native lightweight and reliable open-source vector database designed to run on the edge.
Treeminhash
⭐
12
TreeMinHash: Fast Sketching for Weighted Jaccard Similarity Estimation
Donework
⭐
12
📚 Text generator using ML and Search Similarity
Whiplash
⭐
11
Serverless, lightweight, and fast vector database on top of DynamoDB
Anndb
⭐
11
Distributed Approximate Nearest Neighbors Database https://anndb.com
Jni Faiss
⭐
11
java native interface for faiss
Kv Match
⭐
10
ICDE 2019 - KV-match: A Subsequence Matching Approach Supporting Normalization and Time Warping
Contextqa
⭐
10
ContextQA - Open source tool to chat with your data
Soph
⭐
10
Efficiently import pictures while handling duplicates gracefully
Emdrive
⭐
10
💫 Fast similarity search DBMS
Qdrant Operator
⭐
9
Kubernetes operator for Qdrant
Digs Tool
⭐
9
Database-Integrated Genome Screening (DIGS) tool. Explore genomes interactively using BLAST and a relational database.
Nftfoundation
⭐
9
Code for paper "Under the Skin of Foundation NFT Auctions"
St2vec
⭐
9
Source code for Spatio-Temporal Trajectory Similarity Learning in Road Networks. KDD 2022.
Casescan
⭐
8
🔍 Clinical cases search by similarity specialized in Covid-19
Mb_milvus
⭐
8
Image Similarity search build on Milvus
Textsearch.jl
⭐
8
Searching methods and models for textual data; it was designed to work with SimilaritySearch.jl
1-100 of 118 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.