Awesome Open Source
Awesome Open Source
Combined Topics
clustering
x
Advertising
📦 10
All Projects
Application Programming Interfaces
📦 124
Applications
📦 192
Artificial Intelligence
📦 78
Blockchain
📦 73
Build Tools
📦 113
Cloud Computing
📦 80
Code Quality
📦 28
Collaboration
📦 32
Command Line Interface
📦 49
Community
📦 83
Companies
📦 60
Compilers
📦 63
Computer Science
📦 80
Configuration Management
📦 42
Content Management
📦 175
Control Flow
📦 213
Data Formats
📦 78
Data Processing
📦 276
Data Storage
📦 135
Economics
📦 64
Frameworks
📦 215
Games
📦 129
Graphics
📦 110
Hardware
📦 152
Integrated Development Environments
📦 49
Learning Resources
📦 166
Legal
📦 29
Libraries
📦 129
Lists Of Projects
📦 22
Machine Learning
📦 347
Mapping
📦 64
Marketing
📦 15
Mathematics
📦 55
Media
📦 239
Messaging
📦 98
Networking
📦 315
Operating Systems
📦 89
Operations
📦 121
Package Managers
📦 55
Programming Languages
📦 245
Runtime Environments
📦 100
Science
📦 42
Security
📦 396
Social Media
📦 27
Software Architecture
📦 72
Software Development
📦 72
Software Performance
📦 58
Software Quality
📦 133
Text Editors
📦 49
Text Processing
📦 136
User Interface
📦 330
User Interface Components
📦 514
Version Control
📦 30
Virtualization
📦 71
Web Browsers
📦 42
Web Servers
📦 26
Web User Interface
📦 210
The Top 159 Clustering Open Source Projects
Categories
>
Networking
>
Clustering
Smile
⭐
5,120
Statistical Machine Intelligence & Learning Engine
Tensorflow Book
⭐
4,422
Accompanying source code for Machine Learning with TensorFlow. Refer to the book for step-by-step explanations.
Hazelcast
⭐
4,148
Open Source In-Memory Data Grid
Protoactor Go
⭐
3,471
Proto Actor - Ultra fast distributed actors for Go, C# and Java/Kotlin
Leaflet.markercluster
⭐
3,049
Marker Clustering plugin for Leaflet
Dedupe
⭐
2,869
🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
Orange3
⭐
2,568
🍊 📊 💡 Orange: Interactive data analysis
Hdbscan
⭐
1,788
A high performance implementation of HDBSCAN clustering.
Machine Learning With Python
⭐
1,739
Practice and tutorial-style notebooks covering wide variety of machine learning techniques
Practical Machine Learning With Python
⭐
1,665
Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.
Awesome Single Cell
⭐
1,580
Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.
Awesome Community Detection
⭐
1,572
A curated list of community detection research papers with implementations.
Dat8
⭐
1,484
General Assembly's 2015 Data Science course in Washington, DC
Mlr
⭐
1,454
Machine Learning in R
Uis Rnn
⭐
1,282
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
Libcluster
⭐
1,236
Automatic cluster formation/healing for Elixir applications
Supercluster
⭐
1,212
A very fast geospatial point clustering library for browsers and Node.
Ml
⭐
1,187
A high-level machine learning and deep learning library for the PHP language.
Cluster
⭐
1,120
Easy Map Annotation Clustering 📍
Text Analytics With Python
⭐
1,089
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Bottleneck
⭐
1,078
Job scheduler and rate limiter, supports Clustering
Protoactor Dotnet
⭐
1,036
Proto Actor - Ultra fast distributed actors for Go, C# and Java/Kotlin
Moosefs
⭐
995
MooseFS – Open Source, Petabyte, Fault-Tolerant, Highly Performing, Scalable Network Distributed File System
Swarm
⭐
979
Easy clustering, registration, and distribution of worker processes for Erlang/Elixir
Mlj.jl
⭐
932
A Julia machine learning framework
Tribuo
⭐
820
Tribuo - A Java machine learning library
Pyclustering
⭐
776
pyclustring is a Python, C++ data mining library.
Minisom
⭐
757
🔴 MiniSom is a minimalistic implementation of the Self Organizing Maps
Agoo
⭐
671
A High Performance HTTP Server for Ruby
Machine Learning Octave
⭐
631
🤖 MatLab/Octave examples of popular machine learning algorithms with code examples and mathematics being explained
Depth_clustering
⭐
624
🚕 Fast and robust clustering of point clouds generated with a Velodyne sensor.
Complexheatmap
⭐
621
Make Complex Heatmaps
Scikit Multilearn
⭐
618
A scikit-learn based module for multi-label et. al. classification
Elki
⭐
601
ELKI Data Mining Toolkit
Eliasdb
⭐
601
EliasDB a graph-based database.
Talisman
⭐
576
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
Cilantro
⭐
563
A lean C++ library for working with point cloud data
Unsupervised Classification
⭐
541
SCAN: Learning to Classify Images without Labels (ECCV 2020), incl. SimCLR.
Lopq
⭐
527
Training of Locally Optimized Product Quantization (LOPQ) models for approximate nearest neighbor search of high dimensional data in Python and Spark.
Superpoint_graph
⭐
522
Large-scale Point Cloud Semantic Segmentation with Superpoint Graphs
Clustergcn
⭐
503
A PyTorch implementation of "Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks" (KDD 2019).
Dtaidistance
⭐
477
Time series distances: Dynamic Time Warping (DTW)
Vsearch
⭐
433
Versatile open-source tool for microbiome analysis
Shaman
⭐
422
Small, lightweight, api-driven dns server.
React Native Map Clustering
⭐
421
React Native map clustering both for Android and iOS.
Lidar_for_ad_references
⭐
412
A list of references on lidar point cloud processing for autonomous driving
Moa
⭐
399
MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detection and recommender systems) and tools for evaluation.
Akkatecture
⭐
388
a cqrs and event sourcing framework for dotnet core using akka.net
Cdp
⭐
386
Code for our ECCV 2018 work.
Dogvscat
⭐
355
Sample Docker Swarm cluster stack of tools
Stats Maths With Python
⭐
350
General statistics, mathematical programming, and numerical/scientific computing scripts and notebooks in Python
Malheur
⭐
313
A Tool for Automatic Analysis of Malware Behavior
Cdhit
⭐
306
Automatically exported from code.google.com/p/cdhit
Self Label
⭐
297
Self-labelling via simultaneous clustering and representation learning. (ICLR 2020)
Elasticluster
⭐
292
Create clusters of VMs on the cloud and configure them with Ansible.
Rabbitmq Peer Discovery K8s
⭐
282
Kubernetes-based peer discovery mechanism for RabbitMQ
React Native Maps Super Cluster
⭐
282
A Clustering-enabled map for React Native
Supervizer
⭐
278
NodeJS Application Manager
2018 Machinelearning Lectures Esa
⭐
278
Machine Learning Lectures at the European Space Agency (ESA) in 2018
Gcn_clustering
⭐
278
Code for CVPR'19 paper Linkage-based Face Clustering via GCN
Google Maps Clustering
⭐
275
Fast marker clustering library for Google Maps Android API.
Coherence
⭐
263
Oracle Coherence Community Edition
Dagsfm
⭐
256
Distributed and Graph-based Structure from Motion
Mongodb_consistent_backup
⭐
253
A tool for performing consistent backups of MongoDB Clusters or Replica Sets
L2c
⭐
248
Learning to Cluster. A deep clustering strategy.
Clustering With Deep Learning
⭐
223
Generic implementation for clustering with deep learning : representation learning (DNN) + clustering
Clustering.jl
⭐
218
A Julia package for data clustering
Gemsec
⭐
207
The TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).
Keras_deep_clustering
⭐
199
How to do Unsupervised Clustering with Keras
Spectralcluster
⭐
196
Python re-implementation of the spectral clustering algorithm in the paper "Speaker Diarization with LSTM"
Pqkmeans
⭐
186
Fast and memory-efficient clustering
Uci Ml Api
⭐
185
Simple API for UCI Machine Learning Dataset Repository (search, download, analyze)
Dtwclust
⭐
184
R Package for Time Series Clustering Along with Optimizations for DTW
Clustergrammer
⭐
184
An interactive heatmap visualization built using D3.js
Vectorai
⭐
180
Vector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.
Timeseries Clustering Vae
⭐
179
Variational Recurrent Autoencoder for timeseries clustering in pytorch
Dcc
⭐
174
This repository contains the source code and data for reproducing results of Deep Continuous Clustering paper
Micro Cluster
⭐
173
Run multiple micro servers and a front proxy at a time
Splitter
⭐
170
A Pytorch implementation of "Splitter: Learning Node Representations that Capture Multiple Social Contexts" (WWW 2019).
Rayo.js
⭐
165
Micro framework for Node.js
Gsdmm
⭐
164
GSDMM: Short text clustering
Flexsearch Server
⭐
164
High-performance FlexSearch Server for Node.js (Cluster)
Mars
⭐
160
Asynchronous Block-Level Storage Replication
Newsrecommender
⭐
160
A news recommendation system tailored for user communities
Danmf
⭐
159
A sparsity aware implementation of "Deep Autoencoder-like Nonnegative Matrix Factorization for Community Detection" (CIKM 2018).
Instance Segmentation With Discriminative Loss Tensorflow
⭐
155
Tensorflow implementation of "Semantic Instance Segmentation with a Discriminative Loss Function"
Slot Attention
⭐
155
Implementation of Slot Attention from GoogleAI
Dbscan
⭐
154
Density Based Clustering of Applications with Noise (DBSCAN) and Related Algorithms - R package
Khiva
⭐
154
An open-source library of algorithms to analyse time series in GPU and CPU.
Ml Course
⭐
153
Starter code of Prof. Andrew Ng's machine learning MOOC in R statistical language
Python Clustering Exercises
⭐
153
Jupyter Notebook exercises for k-means clustering with Python 3 and scikit-learn
Docker Blinkt Workshop
⭐
151
Get into physical computing with Docker and Raspberry Pi
Isee
⭐
147
R/shiny interface for interactive visualization of data in SummarizedExperiment objects
Docker Rabbitmq Cluster
⭐
140
Cluster RabbitMQ (official docker image)
Meanrecipe
⭐
137
Get a consensus recipe for your next meal. 🍪 🍰
Hazelcast Go Client
⭐
135
Hazelcast IMDG Go Client
Qlik Py Tools
⭐
131
Data Science algorithms for Qlik implemented as a Python Server Side Extension (SSE).
Hawk
⭐
129
A web-based GUI for managing and monitoring the Pacemaker High-Availability cluster resource manager
Libcluster
⭐
128
An extensible C++ library of Hierarchical Bayesian clustering algorithms, such as Bayesian Gaussian mixture models, variational Dirichlet processes, Gaussian latent Dirichlet allocation and more.
Clustering
⭐
126
Clustering / Subspace Clustering Algorithms on MATLAB
1-100 of 159 projects
Next >
Advertising
📦 10
All Projects
Application Programming Interfaces
📦 124
Applications
📦 192
Artificial Intelligence
📦 78
Blockchain
📦 73
Build Tools
📦 113
Cloud Computing
📦 80
Code Quality
📦 28
Collaboration
📦 32
Command Line Interface
📦 49
Community
📦 83
Companies
📦 60
Compilers
📦 63
Computer Science
📦 80
Configuration Management
📦 42
Content Management
📦 175
Control Flow
📦 213
Data Formats
📦 78
Data Processing
📦 276
Data Storage
📦 135
Economics
📦 64
Frameworks
📦 215
Games
📦 129
Graphics
📦 110
Hardware
📦 152
Integrated Development Environments
📦 49
Learning Resources
📦 166
Legal
📦 29
Libraries
📦 129
Lists Of Projects
📦 22
Machine Learning
📦 347
Mapping
📦 64
Marketing
📦 15
Mathematics
📦 55
Media
📦 239
Messaging
📦 98
Networking
📦 315
Operating Systems
📦 89
Operations
📦 121
Package Managers
📦 55
Programming Languages
📦 245
Runtime Environments
📦 100
Science
📦 42
Security
📦 396
Social Media
📦 27
Software Architecture
📦 72
Software Development
📦 72
Software Performance
📦 58
Software Quality
📦 133
Text Editors
📦 49
Text Processing
📦 136
User Interface
📦 330
User Interface Components
📦 514
Version Control
📦 30
Virtualization
📦 71
Web Browsers
📦 42
Web Servers
📦 26
Web User Interface
📦 210