Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for machine learning data mining
data-mining
x
machine-learning
x
285 search results found
Ml From Scratch
⭐
23,095
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.
Awesome Datascience
⭐
23,007
📝 An awesome Data Science repository to learn and apply for real world problems.
Easyocr
⭐
20,438
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Lightgbm
⭐
15,999
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
Awesome Production Machine Learning
⭐
15,804
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
Gensim
⭐
15,180
Topic Modelling for Humans
Python Machine Learning Book
⭐
11,645
The "Python Machine Learning (1st edition)" book code repository and info resource
Ai Learn
⭐
8,256
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Py tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
Pyod
⭐
7,751
A Comprehensive and Scalable Python Library for Outlier Detection (Anomaly Detection)
Anomaly Detection Resources
⭐
7,616
Anomaly detection related books, papers, videos, and toolboxes
Catboost
⭐
7,564
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
Sktime
⭐
7,368
A unified framework for machine learning with time series
Awesome Ml For Cybersecurity
⭐
6,564
:octocat: Machine Learning for Cyber Security
Mlxtend
⭐
4,669
A library of extension and helper modules for Python's data analysis and machine learning libraries.
Orange3
⭐
4,469
🍊 📊 💡 Orange: Interactive data analysis
Datascience
⭐
3,955
Curated list of Python resources for data science.
Rath
⭐
3,717
Next generation of automated data exploratory analysis and visualization platform.
Kaggle Solutions
⭐
3,579
🏅 Collection of Kaggle Solutions and Ideas 🏅
Alink
⭐
3,479
Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.
Machinelearning
⭐
3,016
Machine learning resources
Awesome Ts Anomaly Detection
⭐
2,320
List of tools & datasets for anomaly detection on time-series data.
Bolt
⭐
2,312
10x faster matrix and vector operations
Papers Literature Ml Dl Rl Ai
⭐
1,798
Highly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning
Ai For Security Learning
⭐
1,571
安全场景、基于AI的安全算法和安全数据分析业界实践
Pycm
⭐
1,413
Multi-class confusion matrix library in Python
Vvedenie Mashinnoe Obuchenie
⭐
1,187
📝 Подборка ресурсов по машинному обучению
Clevercsv
⭐
1,168
CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
Graph Fraud Detection Papers
⭐
1,148
A curated list of graph-based fraud, anomaly, and outlier detection papers & resources
Awesome Fl
⭐
1,103
Comprehensive and timely academic information on federated learning (papers, frameworks, datasets, tutorials, workshops)
Awesome Ai Books
⭐
1,086
Some awesome AI related books and pdfs for learning and downloading, also apply some playground models for learning
Nfstream
⭐
1,015
NFStream: a Flexible Network Data Analysis Framework.
Pyclustering
⭐
853
pyclustring is a Python, C++ data mining library.
Feature Engineering And Feature Selection
⭐
798
A Guide for Feature Engineering and Feature Selection, with implementations and examples in Python.
Cookbook 2nd
⭐
773
IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Graph Adversarial Learning Literature
⭐
772
A curated list of adversarial attacks and defenses papers on graph-structured data.
Elki
⭐
746
ELKI Data Mining Toolkit
R
⭐
745
Collection of various algorithms implemented in R.
Aeon
⭐
723
A toolkit for conducting machine learning tasks with time series data
Interpretable_machine_learning_with_python
⭐
629
Examples of techniques for training interpretable ML models, explaining ML models, and debugging ML models for accuracy, discrimination, and security.
Awesome Ai For Time Series Papers
⭐
627
A professional list of Papers, Tutorials, and Surveys on AI for Time Series in top AI conferences and journals.
Awesome Deep Graph Clustering
⭐
626
Awesome Deep Graph Clustering is a collection of SOTA, novel deep graph clustering methods (papers, codes, and datasets).
Pm4py Core
⭐
617
Public repository for the PM4Py (Process Mining for Python) project.
Adbench
⭐
609
Official Implement of "ADBench: Anomaly Detection Benchmark".
Combo
⭐
607
(AAAI' 20) A Python Toolbox for Machine Learning Model Combination
Kam1n0 Community
⭐
601
The Kam1n0 Assembly Analysis Platform
Timetk
⭐
594
Time series analysis in the `tidyverse`
Pypots
⭐
558
A Python toolbox/library for reality-centric machine learning/deep learning on partially-observed time series with PyTorch, including SOTA models supporting tasks of imputation, classification, clustering, and forecasting on incomplete (irregularly-sampled) multivariate time series with NaN missing values/data. https://arxiv.org/abs/2305.18811
Cookbook 2nd Code
⭐
532
Code of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Text_mining_resources
⭐
511
Resources for learning about Text Mining and Natural Language Processing
Amazing Feature Engineering
⭐
485
Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Dgfraud
⭐
432
A Deep Graph-based Toolbox for Fraud Detection
Chefboost
⭐
428
A Lightweight Decision Tree Framework supporting regular algorithms: ID3, C4,5, CART, CHAID and Regression Trees; some advanced techniques: Gradient Boosting, Random Forest and Adaboost w/categorical features support for Python
Matminer
⭐
422
Data mining for materials science
Rmdl
⭐
409
RMDL: Random Multimodel Deep Learning for Classification
Mli Resources
⭐
405
H2O.ai Machine Learning Interpretability Resources
Suod
⭐
371
(MLSys' 21) An Acceleration System for Large-scare Unsupervised Heterogeneous Outlier Detection (Anomaly Detection)
Fraud Detection Handbook
⭐
352
Reproducible Machine Learning for Credit Card Fraud Detection - Practical Handbook
Pydatalab
⭐
347
open source for wechat-official-account (ID: PyDataLab)
Automlpipeline.jl
⭐
331
A package that makes it trivial to create and evaluate machine learning pipeline architectures.
Ml And Dm In Action
⭐
318
Share my code during learning machine learning and data mining
Artificial Adversary
⭐
317
🗣️ Tool to generate adversarial text examples and test machine learning models against them
Pyss3
⭐
307
A Python package implementing a new interpretable machine learning model for text classification (with visualization tools for Explainable AI :octocat:)
Efficient Apriori
⭐
286
An efficient Python implementation of the Apriori algorithm.
Lagoujob
⭐
250
Job data mining repo for lagou.com
Awesome Python Data Science Books
⭐
242
Probably the best curated list of data science books in Python
Pzad
⭐
235
Курс "Прикладные задачи анализа данных" (ВМК, МГУ имени М.В. Ломоносова)
Imbalanced Ensemble
⭐
234
Class-imbalanced Ensemble Learning in Python. | 类别不平衡/长尾机器学习库
Gwu_data_mining
⭐
228
Materials for GWU DNSC 6279 and DNSC 6290.
Qminer
⭐
217
Analytic platform for real-time large-scale streams containing structured and unstructured data.
Awesome Deep Graph Anomaly Detection
⭐
215
Awesome graph anomaly detection techniques built based on deep learning frameworks. Collections of commonly used datasets, papers as well as implementations are listed in this github repository. We also invite researchers interested in anomaly detection, graph representation learning, and graph anomaly detection to join this project as contributors and boost further research in this area.
Data Science Resources
⭐
197
👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Toloka Kit
⭐
195
Toloka-Kit is a Python library for working with Toloka API.
Data Science Toolkit
⭐
185
Collection of stats, modeling, and data science tools in Python and R.
Estadistica Con R
⭐
178
Apuntes personales sobre estadística, machine learning y lenguaje de programación R
Tipdm
⭐
178
TipDM建模平台,开源的数据挖掘工具。
Wrapper Feature Selection Toolbox Python
⭐
170
This toolbox offers 13 wrapper feature selection methods (PSO, GA, GWO, HHO, BA, WOA, and etc.) with examples. It is simple and easy to implement.
Machine_learning_for_good
⭐
145
Machine learning fundamentals lesson in interactive notebooks
Transtab
⭐
140
NeurIPS'22 | TransTab: Learning Transferable Tabular Transformers Across Tables
Wekadeeplearning4j
⭐
139
Weka package for the Deeplearning4j java library
Hust Homeworks
⭐
134
HUST Homeworks(Course design / Reports / Labs / etc. )
Pre Modern_chinese_corpus_dataset
⭐
132
近代汉语语料库数据集 自然语言处理 语料库 古代汉语 古汉语 文言文 数字人文 计算语言
Awesome Ensemble Learning
⭐
129
Ensemble learning related books, papers, videos, and toolboxes
Sparselsh
⭐
128
A Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.
Care Gnn
⭐
127
Code for CIKM 2020 paper Enhancing Graph Neural Network-based Fraud Detectors against Camouflaged Fraudsters
Pandora
⭐
127
PANDORA Advanced Machine Learning for Data Integration, Analysis, and Insightful Discoveries in Health and Disease 💻
Lab Workshops
⭐
120
Materials for workshops on text mining, machine learning, and data visualization
Dh Core
⭐
118
Functional data science
Lola
⭐
118
LoL (League of Legends) game data analysis / analytics
Emotion Recognition From Speech
⭐
116
A machine learning application for emotion recognition from speech
Tiger
⭐
108
Python toolbox to evaluate graph vulnerability and robustness (CIKM 2021)
Books
⭐
106
Books related to AI/ML/DL/GENAI
Pathpy
⭐
102
pathpy is an OpenSource python package for the modeling and analysis of pathways and temporal networks using higher-order and multi-order graphical models
Vizuka
⭐
100
Explore high-dimensional datasets and how your algo handles specific regions.
Bdc2019
⭐
98
2019中国高校计算机大赛——大数据挑战赛 第三名解决方案
Dataminingnotesandpractice
⭐
88
记录我学习数据挖掘过程的笔记和见到的奇技,持续更新~
Osdt
⭐
85
Optimal Sparse Decision Trees
Network Intrusion Detection
⭐
85
Machine Learning with the NSL-KDD dataset for Network Intrusion Detection
Classix
⭐
85
Fast and explainable clustering in Python
Practicalmachinelearning
⭐
79
A curated collection of machine learning resources, including notebooks, code, and books, all of which are either free or open-source
Deep Learning For Bci
⭐
78
Resources for Book: Deep Learning for EEG-based Brain-Computer Interface: Representations, Algorithms and Applications
Related Searches
Python Machine Learning (14,099)
Jupyter Notebook Machine Learning (12,247)
Machine Learning Neural Network (4,397)
Machine Learning Tensorflow (4,050)
Machine Learning Natural Language Processing (3,891)
Machine Learning Artificial Intelligence (3,877)
Machine Learning Data Science (3,802)
Machine Learning Pytorch (2,910)
Machine Learning Dataset (2,298)
Machine Learning Classification (2,099)
1-100 of 285 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.