Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for java similarity
java
x
similarity
x
119 search results found
Java String Similarity
⭐
2,417
Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
Similarity
⭐
1,239
similarity: Text similarity calculation Toolkit for Java. 文本相似度计算工具包,java编写,可用于文本相似度计算、情感分析等任务,开箱即用。
Cogcomp Nlp
⭐
448
CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, relation-extraction, similarity, temporal normalizer, tokenizer, transliteration, verb-sense, and more.
Fast Elasticsearch Vector Scoring
⭐
348
Score documents using embedding-vectors dot-product or cosine-similarity with ES Lucene engine
Sifarish
⭐
328
Content based and collaborative filtering based recommendation and personalization engine implementation on Hadoop and Storm
Java Lsh
⭐
275
A Java implementation of Locality Sensitive Hashing (LSH)
Simmetrics
⭐
253
Similarity or Distance Metrics, e.g. Levenshtein, for Java
Elasticsearch Vector Scoring
⭐
221
Score documents with pure dot product / cosine similarity with ES
Sourceafis Java
⭐
217
Fingerprint recognition engine for Java that takes a pair of human fingerprint images and returns their similarity score. Supports efficient 1:N search.
Simbase
⭐
211
A vector similarity database
K Nn
⭐
199
🆕 A machine learning plugin which supports an approximate k-NN search algorithm for Open Distro for Elasticsearch
Textanalyzer
⭐
149
A text analyzer which is based on machine learning,statistics and dictionaries that can analyze text. So far, it supports hot word extracting, text classification, part of speech tagging, named entity recognition, chinese word segment, extracting address, synonym, text clustering, word2vec model, edit distance, chinese word segment, sentence similarity,word sentiment tendency, name recognition, idiom recognition, placename recognition, organization recognition, traditional chinese recognition,
Cafecompare
⭐
130
Java code comparison tool (jar / class)
Java String Similarity
⭐
117
A Java library that implements several algorithms that calculate similarity between strings.
Fast Cosine Similarity
⭐
85
Fast cosine similarity (vector scoring) ElasticSearch 6.4+ Plugin
Elasticsearch Position Similarity
⭐
68
Elasticsearch term position similarity plugin
Stringdistance
⭐
57
A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..
Antiplag
⭐
55
作业查重软件,它实现了程序代码、文档文本、图片之间的相似度检查。a code-similarity, text-similarity and image-similarity computation software for the codes, documents and images of assignment.
Reactiondecoder
⭐
51
Reaction Decoder Tool (RDT) - Atom Atom Mapping Tool
Similar Request Excluder
⭐
42
A Burp Suite extension that automatically marks similar requests as 'out-of-scope'.
Indra
⭐
38
Indra is a Web Service which allows easy access to different distributional semantics models in several languages.
Summarization
⭐
37
Performs multi document summarization. Includes a method to generate summaries: The method uses a sentence importance score calculator based on various semantic features and a semantic similarity score to select sentences that would be most representative of the document. It uses stack-decoder algorithm as used as a template and builds on it to produce summaries that are closer to optimal.
Angeleyes
⭐
36
a system for searching information about losing children. you can find text information and upload images to achieve the similarity with the images in the system database so that you can be sure if this children is the person who you want to find.
Esalib
⭐
36
My implementation of Explicit Semantic Analysis (ESA) library that we used at KMi, Open University to produce our submission at the NTCIR-9 CrossLink task.
Ws4j
⭐
36
WordNet Similarity for Java provides an API for several Semantic Relatedness/Similarity algorithms
Relevance Based On Parse Trees
⭐
36
Sentence and paragraph - level relevance and applications
Dv Cosine
⭐
35
Itsjustacoincidenceprofessor
⭐
35
"It's just a coincidence professor!" is a plagiarism checker for source code. It uses the Wagner–Fischer algorithm to precisely and accurately determine percentage similarity of two given strings. We also cross reference common sites like GitHub and Stackoverflow, for potential cheating.
Java Graphs
⭐
33
Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...
Minhash
⭐
33
This provides tools for b-bit MinHash algorism.
Simpletextsearch
⭐
31
A lightweight and easy to use full text search implementation for Java. Uses inverted index and cosine similarity w/ TFIDF ranking.
Trie
⭐
31
A Mixed Trie and Levenshtein distance implementation in Java for extremely fast prefix string searching and string similarity.
Similarity
⭐
31
Similarity is an optical as well as keyword based image similarity search engine built on top of Lire.
Ghidra Patchdiff Correlator
⭐
28
This project tries to provide additional Ghidra Version Tracking Correlators suitable for patch diffing.
Lshdb
⭐
27
LSHDB is a parallel and distributed data engine, which relies on Locality-Sensitive Hashing and noSQL systems, for performing record linkage (and privacy-preserving record linkage) and similarity search tasks.
Ankus
⭐
26
Siamese
⭐
25
Siamese: a scalable code clone search engine
Clanongithub
⭐
24
An implementation of CLAN (Closely reLated ApplicatioNs) on GitHub.
Multsum
⭐
22
Summarization system taking multiple sentence similarity measures into account
Simhashdb
⭐
21
Text retrieval database based on simhash similarity search
K Meanscluster
⭐
21
A java implementation of k-means algorithm.It uses ball tree as internal data structure to accelerate the computation.It uses 2-norm distance to compute the similarity between instances.
Ws4j
⭐
20
WordNet Similarity for Java provides an API for several Semantic Relatedness/Similarity algorithms
Elasticsearch Flavor
⭐
20
Kelp Additional Kernels
⭐
19
Iterative Cf
⭐
19
storm/trident based highly scalable recommendation engine
Faceverificationandroid
⭐
18
Face verification with mxnet on android
Image Similarity
⭐
18
Canny edges + color histograms + KD-tree indexing
Byblo
⭐
18
A tools for the automatic construction of Distribution Thesauri
Similarity Uniform Fuzzy Hash
⭐
17
Similarity algorithm (computes the similarity between two files as a 0 to 1 score) with linear complexity, based on context triggered piecewise (fuzzy) hashes.
Itemcf
⭐
17
ItemBased Collaborative Filtering in Apache Spark
Approximate Text Match
⭐
16
APPROX EQUALS AND APPROX CONTAINS
Sikuliwrapper
⭐
16
Melodyshape
⭐
16
A Library and Tool for Symbolic Melodic Similarity based on Shape Similarity
Similarity Search Java
⭐
16
Easy-to-use Java similarity algorithms for text and numeric-series
Libpecker
⭐
16
an obfuscation-resilient, highly precise and reliable library detector for Android applications
Word_similarity
⭐
15
基于《知网》的语义相似度计算 python2.7 API
Mahout_collaborative_filtering
⭐
15
Recommendation, Classification, Clustering in Java
Libsim
⭐
15
Libsim is a dictionary of similarity (distance) functions.
Thesaurus
⭐
14
A dynamically generated thesaurus using Syntactic N-grams parsed by Google Research. Rather than providing synonyms, this thesaurus provides words used in similar contexts. It also provides actual values, so certainty of similarity can be properly gauged.
Solr Vector Scoring
⭐
14
Vector Plugin for Solr: calculate dot product / cosine similarity on documents
Greedystringtiling
⭐
13
A Java implementatio of Greedy String Tiling algorithm
Simidroid
⭐
13
Identifying and Explaining Similarities in Android Apps
Dedupe
⭐
13
Java DSL for (online) deduplication
Mi File
⭐
13
Automatically exported from code.google.com/p/mi-file
Serf
⭐
13
Stanford Entity-Resolution Framework
Lazo
⭐
12
Sketch and LSH Index library for Java, including OPH methods as well as the Lazo method
Hesml
⭐
12
HESML Java software library of ontology-based semantic similarity measures and information content models
Jarowinklersimilarity
⭐
12
A Java implementation of the Jaro Winkler Similarity, which is optimized for the search of similar strings in a large set of strings.
Publications Ssnmm
⭐
12
Experiments for the publication 'Scalable Similarity-Based Neighborhood Methods with MapReduce' at the ACM Recommender Systems Conference 2012
Simptextalign
⭐
11
Repo for the simplified text alignment tools.
Drug Target Prediction
⭐
11
Reeb_graph
⭐
11
Topological similarity estimation for 3D models (triangle meshes) using multiresolutional Reeb graphs.
Simmetrics
⭐
11
SimMetrics is a Similarity Metric Library, based on previous work by http://sourceforge.net/projects/simmetrics/
Sikuli Factory
⭐
11
A based PageFactory model for SikuliX.
Spellblaze
⭐
10
Symmetric Delete spelling correction algorithm using Java
Es Commons Plugin
⭐
10
Similarity and other common Elasticsearch add-ons
Code Similarity
⭐
10
Code Similarity Detection.
Lad
⭐
10
An ongoing project aiming to provide a decentralised distributed recommender system for the discovery and retrieval of new media.
Javaphash
⭐
10
One JAVA version of PHash (Perceptual Hash) algorithm (JAVA 版本的 官方经典PHash算法)
Es Custom Similarity Provider
⭐
10
elasticsearch similarity Custom plug-in
Kmeanscluster
⭐
10
A java implementation of k-means algorithm.It uses ball tree as internal data structure to accelerate the computation.It uses 2-norm distance to compute the similarity between instances.
Gossto
⭐
9
Gene Ontology Semantic Similarity Tool
Nilsimsa
⭐
9
A Java library for computing and comparing Nilsimsa string similarity hashes.
Smarttrace
⭐
8
SmartTrace is a prototype crowdsourced trajectory similarity search framework for smartphones. More info: http://smarttrace.cs.ucy.ac.cy/
Biosses
⭐
8
Domain Discovery D4
⭐
8
Data-Driven Domain Discovery for Structured Datasets
Kdd Music Recommender Mapreduce
⭐
8
KDD Music Recommender with MapReduce!
Citation Context Summarization
⭐
8
An algorithm for extractive summarization of citation contexts of a paper based on C-LexRank
Elasticsearch Similarity Plugin
⭐
8
elasticsearch-similarity-plugin
Comparestring2
⭐
8
String comparison made easy
Disco
⭐
7
compute semantic similarity between arbitrary words and phrases in many languages
Related Searches
⭐
7
Related Searches - get queries related or similar to a given query
Audiomerge
⭐
7
Merge multiple scattered music collections into one, taking only the best version of duplicates
Similarity
⭐
7
Calculate similarity between two contents
Naisc
⭐
7
Naisc - Automated Linking Tool
Twitter Data Miner
⭐
7
Simrank
⭐
7
An implementation of the SimRank algorithm in Java
Semsim
⭐
7
Java library for building vector space models and calculating distributional similarity
Fit3d
⭐
7
🔍 Fit3D - An application for template-based detection of small structural motifs in protein structures and macromolecular structure data.
Stsmodule
⭐
6
STSModule (Semantic Textual Similarity Module) aims at helping users computing the semantic similarity between sentences and documents in English. Similarity measures play an important role in a wide variety of NLP applications. By a way of example, Information Retrieval (IR) relies on semantic similarity in order to determine the best result for a related query. Semantic similarity also plays a crucial role in other applications such as Paraphrasing and Translation Memory (TM). However, computi
Related Searches
Java Spring (21,350)
Java Spring Boot (11,982)
Java Video Game (8,093)
Java Gradle (8,072)
Java Docker (6,180)
Java Database (6,015)
Java Mysql (5,954)
Java Sdk (5,864)
Javascript Java (5,468)
Java Rest (4,956)
1-100 of 119 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.