Awesome Open Source
Awesome Open Source
Combined Topics
clustering
x
Advertising
📦 10
All Projects
Application Programming Interfaces
📦 124
Applications
📦 192
Artificial Intelligence
📦 78
Blockchain
📦 73
Build Tools
📦 113
Cloud Computing
📦 80
Code Quality
📦 28
Collaboration
📦 32
Command Line Interface
📦 49
Community
📦 83
Companies
📦 60
Compilers
📦 63
Computer Science
📦 80
Configuration Management
📦 42
Content Management
📦 175
Control Flow
📦 213
Data Formats
📦 78
Data Processing
📦 276
Data Storage
📦 135
Economics
📦 64
Frameworks
📦 215
Games
📦 129
Graphics
📦 110
Hardware
📦 152
Integrated Development Environments
📦 49
Learning Resources
📦 166
Legal
📦 29
Libraries
📦 129
Lists Of Projects
📦 22
Machine Learning
📦 347
Mapping
📦 64
Marketing
📦 15
Mathematics
📦 55
Media
📦 239
Messaging
📦 98
Networking
📦 315
Operating Systems
📦 89
Operations
📦 121
Package Managers
📦 55
Programming Languages
📦 245
Runtime Environments
📦 100
Science
📦 42
Security
📦 396
Social Media
📦 27
Software Architecture
📦 72
Software Development
📦 72
Software Performance
📦 58
Software Quality
📦 133
Text Editors
📦 49
Text Processing
📦 136
User Interface
📦 330
User Interface Components
📦 514
Version Control
📦 30
Virtualization
📦 71
Web Browsers
📦 42
Web Servers
📦 26
Web User Interface
📦 210
The Top 171 Clustering Open Source Projects
Categories
>
Networking
>
Clustering
Smile
⭐
5,219
Statistical Machine Intelligence & Learning Engine
Tensorflow Book
⭐
4,433
Accompanying source code for Machine Learning with TensorFlow. Refer to the book for step-by-step explanations.
Hazelcast
⭐
4,295
Open Source In-Memory Data Grid
Protoactor Go
⭐
3,585
Proto Actor - Ultra fast distributed actors for Go, C# and Java/Kotlin
Leaflet.markercluster
⭐
3,129
Marker Clustering plugin for Leaflet
Dedupe
⭐
2,978
🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
Orange3
⭐
2,695
🍊 📊 💡 Orange: Interactive data analysis
Machine Learning With Python
⭐
1,863
Practice and tutorial-style notebooks covering wide variety of machine learning techniques
Hdbscan
⭐
1,852
A high performance implementation of HDBSCAN clustering.
Practical Machine Learning With Python
⭐
1,723
Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.
Awesome Single Cell
⭐
1,665
Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.
Awesome Community Detection
⭐
1,635
A curated list of community detection research papers with implementations.
Dat8
⭐
1,490
General Assembly's 2015 Data Science course in Washington, DC
Mlr
⭐
1,477
Machine Learning in R
Uis Rnn
⭐
1,316
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
Ml
⭐
1,300
A high-level machine learning and deep learning library for the PHP language.
Libcluster
⭐
1,296
Automatic cluster formation/healing for Elixir applications
Supercluster
⭐
1,262
A very fast geospatial point clustering library for browsers and Node.
Text Analytics With Python
⭐
1,185
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Protoactor Dotnet
⭐
1,138
Proto Actor - Ultra fast distributed actors for Go, C# and Java/Kotlin
Cluster
⭐
1,134
Easy Map Annotation Clustering 📍
Bottleneck
⭐
1,125
Job scheduler and rate limiter, supports Clustering
Mlj.jl
⭐
1,045
A Julia machine learning framework
Moosefs
⭐
1,037
MooseFS – Open Source, Petabyte, Fault-Tolerant, Highly Performing, Scalable Network Distributed File System (Software-Defined Storage)
Swarm
⭐
1,010
Easy clustering, registration, and distribution of worker processes for Erlang/Elixir
Tribuo
⭐
897
Tribuo - A Java machine learning library
Minisom
⭐
814
🔴 MiniSom is a minimalistic implementation of the Self Organizing Maps
Pyclustering
⭐
813
pyclustring is a Python, C++ data mining library.
Agoo
⭐
683
A High Performance HTTP Server for Ruby
Complexheatmap
⭐
675
Make Complex Heatmaps
Depth_clustering
⭐
674
🚕 Fast and robust clustering of point clouds generated with a Velodyne sensor.
Scikit Multilearn
⭐
650
A scikit-learn based module for multi-label et. al. classification
Machine Learning Octave
⭐
649
🤖 MatLab/Octave examples of popular machine learning algorithms with code examples and mathematics being explained
Unsupervised Classification
⭐
624
SCAN: Learning to Classify Images without Labels (ECCV 2020), incl. SimCLR.
Elki
⭐
617
ELKI Data Mining Toolkit
Eliasdb
⭐
614
EliasDB a graph-based database.
Talisman
⭐
588
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
Cilantro
⭐
587
A lean C++ library for working with point cloud data
Superpoint_graph
⭐
539
Large-scale Point Cloud Semantic Segmentation with Superpoint Graphs
Clustergcn
⭐
536
A PyTorch implementation of "Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks" (KDD 2019).
Lopq
⭐
530
Training of Locally Optimized Product Quantization (LOPQ) models for approximate nearest neighbor search of high dimensional data in Python and Spark.
Dtaidistance
⭐
517
Time series distances: Dynamic Time Warping (DTW)
Lidar_for_ad_references
⭐
465
A list of references on lidar point cloud processing for autonomous driving
React Native Map Clustering
⭐
460
React Native map clustering both for Android and iOS.
Vsearch
⭐
448
Versatile open-source tool for microbiome analysis
Shaman
⭐
428
Small, lightweight, api-driven dns server.
Akkatecture
⭐
427
a cqrs and event sourcing framework for dotnet core using akka.net
Moa
⭐
416
MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detection and recommender systems) and tools for evaluation.
Stats Maths With Python
⭐
397
General statistics, mathematical programming, and numerical/scientific computing scripts and notebooks in Python
Cdp
⭐
395
Code for our ECCV 2018 work.
Dogvscat
⭐
379
Sample Docker Swarm cluster stack of tools
Self Label
⭐
335
Self-labelling via simultaneous clustering and representation learning. (ICLR 2020)
Coherence
⭐
332
Oracle Coherence Community Edition
Cdhit
⭐
327
Automatically exported from code.google.com/p/cdhit
Malheur
⭐
316
A Tool for Automatic Analysis of Malware Behavior
R
⭐
304
Collection of various algorithms implemented in R.
Elasticluster
⭐
302
Create clusters of VMs on the cloud and configure them with Ansible.
Gcn_clustering
⭐
287
Code for CVPR'19 paper Linkage-based Face Clustering via GCN
Rabbitmq Peer Discovery K8s
⭐
286
Kubernetes-based peer discovery mechanism for RabbitMQ
React Native Maps Super Cluster
⭐
286
A Clustering-enabled map for React Native
Supervizer
⭐
281
The most simple NodeJS application manager with RESTful API.
2018 Machinelearning Lectures Esa
⭐
281
Machine Learning Lectures at the European Space Agency (ESA) in 2018
Google Maps Clustering
⭐
277
Fast marker clustering library for Google Maps Android API.
Dagsfm
⭐
274
Distributed and Graph-based Structure from Motion
L2c
⭐
267
Learning to Cluster. A deep clustering strategy.
Mongodb_consistent_backup
⭐
256
A tool for performing consistent backups of MongoDB Clusters or Replica Sets
Clustering With Deep Learning
⭐
240
Generic implementation for clustering with deep learning : representation learning (DNN) + clustering
Clustering.jl
⭐
228
A Julia package for data clustering
Spectralcluster
⭐
221
Python re-implementation of the spectral clustering algorithm in the paper "Speaker Diarization with LSTM"
Gemsec
⭐
212
The TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).
Keras_deep_clustering
⭐
204
How to do Unsupervised Clustering with Keras
Vectorai
⭐
202
Vector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.
Timeseries Clustering Vae
⭐
201
Variational Recurrent Autoencoder for timeseries clustering in pytorch
Uci Ml Api
⭐
194
Simple API for UCI Machine Learning Dataset Repository (search, download, analyze)
Pqkmeans
⭐
191
Fast and memory-efficient clustering
Clustergrammer
⭐
188
An interactive heatmap visualization built using D3.js
Dtwclust
⭐
187
R Package for Time Series Clustering Along with Optimizations for DTW
Gsdmm
⭐
182
GSDMM: Short text clustering
Dcc
⭐
179
This repository contains the source code and data for reproducing results of Deep Continuous Clustering paper
Splitter
⭐
177
A Pytorch implementation of "Splitter: Learning Node Representations that Capture Multiple Social Contexts" (WWW 2019).
Micro Cluster
⭐
173
Run multiple micro servers and a front proxy at a time
Rayo.js
⭐
172
Micro framework for Node.js
Flexsearch Server
⭐
172
High-performance FlexSearch Server for Node.js (Cluster)
Mars
⭐
168
Asynchronous Block-Level Storage Replication
Khiva
⭐
168
An open-source library of algorithms to analyse time series in GPU and CPU.
Slot Attention
⭐
168
Implementation of Slot Attention from GoogleAI
Newsrecommender
⭐
165
A news recommendation system tailored for user communities
Dbscan
⭐
163
Density Based Clustering of Applications with Noise (DBSCAN) and Related Algorithms - R package
Danmf
⭐
161
A sparsity aware implementation of "Deep Autoencoder-like Nonnegative Matrix Factorization for Community Detection" (CIKM 2018).
Instance Segmentation With Discriminative Loss Tensorflow
⭐
159
Tensorflow implementation of "Semantic Instance Segmentation with a Discriminative Loss Function"
Isee
⭐
157
R/shiny interface for interactive visualization of data in SummarizedExperiment objects
Ml Course
⭐
156
Starter code of Prof. Andrew Ng's machine learning MOOC in R statistical language
Python Clustering Exercises
⭐
155
Jupyter Notebook exercises for k-means clustering with Python 3 and scikit-learn
Docker Blinkt Workshop
⭐
151
Get into physical computing with Docker and Raspberry Pi
Matrixprofile
⭐
149
A Python 3 library making time series data mining tasks, utilizing matrix profile algorithms, accessible to everyone.
Docker Rabbitmq Cluster
⭐
144
Cluster RabbitMQ (official docker image)
Machine Learning Projects
⭐
144
This repository consists of all my Machine Learning Projects.
Hazelcast Go Client
⭐
141
Hazelcast IMDG Go Client
Meanrecipe
⭐
139
Get a consensus recipe for your next meal. 🍪 🍰
Qlik Py Tools
⭐
137
Data Science algorithms for Qlik implemented as a Python Server Side Extension (SSE).
1-100 of 171 projects
Next >
Advertising
📦 10
All Projects
Application Programming Interfaces
📦 124
Applications
📦 192
Artificial Intelligence
📦 78
Blockchain
📦 73
Build Tools
📦 113
Cloud Computing
📦 80
Code Quality
📦 28
Collaboration
📦 32
Command Line Interface
📦 49
Community
📦 83
Companies
📦 60
Compilers
📦 63
Computer Science
📦 80
Configuration Management
📦 42
Content Management
📦 175
Control Flow
📦 213
Data Formats
📦 78
Data Processing
📦 276
Data Storage
📦 135
Economics
📦 64
Frameworks
📦 215
Games
📦 129
Graphics
📦 110
Hardware
📦 152
Integrated Development Environments
📦 49
Learning Resources
📦 166
Legal
📦 29
Libraries
📦 129
Lists Of Projects
📦 22
Machine Learning
📦 347
Mapping
📦 64
Marketing
📦 15
Mathematics
📦 55
Media
📦 239
Messaging
📦 98
Networking
📦 315
Operating Systems
📦 89
Operations
📦 121
Package Managers
📦 55
Programming Languages
📦 245
Runtime Environments
📦 100
Science
📦 42
Security
📦 396
Social Media
📦 27
Software Architecture
📦 72
Software Development
📦 72
Software Performance
📦 58
Software Quality
📦 133
Text Editors
📦 49
Text Processing
📦 136
User Interface
📦 330
User Interface Components
📦 514
Version Control
📦 30
Virtualization
📦 71
Web Browsers
📦 42
Web Servers
📦 26
Web User Interface
📦 210