Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python data mining
data-mining
x
python
x
394 search results found
Easyocr
⭐
20,438
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Lightgbm
⭐
16,053
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
Gensim
⭐
15,180
Topic Modelling for Humans
Python Machine Learning Book
⭐
11,645
The "Python Machine Learning (1st edition)" book code repository and info resource
Ai Learn
⭐
8,256
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Py tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
Pyod
⭐
7,751
A Comprehensive and Scalable Python Library for Outlier Detection (Anomaly Detection)
Anomaly Detection Resources
⭐
7,616
Anomaly detection related books, papers, videos, and toolboxes
Catboost
⭐
7,564
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
Mlxtend
⭐
4,669
A library of extension and helper modules for Python's data analysis and machine learning libraries.
Orange3
⭐
4,469
🍊 📊 💡 Orange: Interactive data analysis
Datascience
⭐
3,955
Curated list of Python resources for data science.
Textract
⭐
3,699
extract text from any document. no muss. no fuss.
Machinelearning
⭐
3,016
Machine learning resources
Awesome Ts Anomaly Detection
⭐
2,320
List of tools & datasets for anomaly detection on time-series data.
Bolt
⭐
2,312
10x faster matrix and vector operations
Pdftabextract
⭐
1,994
A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
Invoice2data
⭐
1,570
Extract structured data from PDF invoices
Research
⭐
1,550
novel deep learning research works with PaddlePaddle
Pycm
⭐
1,413
Multi-class confusion matrix library in Python
Awesome Fraud Detection Papers
⭐
1,364
A curated list of data mining papers about fraud detection.
Clevercsv
⭐
1,168
CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
Awesome Fl
⭐
1,103
Comprehensive and timely academic information on federated learning (papers, frameworks, datasets, tutorials, workshops)
Nfstream
⭐
1,015
NFStream: a Flexible Network Data Analysis Framework.
Astroml
⭐
984
Machine learning, statistics, and data mining for astronomy and astrophysics
Deep_gcns_torch
⭐
940
Pytorch Repo for DeepGCNs (ICCV'2019 Oral, TPAMI'2021), DeeperGCN (arXiv'2020) and GNN1000(ICML'2021): https://www.deepgcns.org
Pyclustering
⭐
853
pyclustring is a Python, C++ data mining library.
Pyhealth
⭐
825
A Deep Learning Python Toolkit for Healthcare Applications.
Feature Engineering And Feature Selection
⭐
798
A Guide for Feature Engineering and Feature Selection, with implementations and examples in Python.
My Tensorflow Tutorials
⭐
794
This repo contains all of my TensorFlow tutorials
Cookbook 2nd
⭐
773
IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Aeon
⭐
723
A toolkit for conducting machine learning tasks with time series data
Unitypy
⭐
665
UnityPy is python module that makes it possible to extract/unpack and edit Unity assets
Interpretable_machine_learning_with_python
⭐
629
Examples of techniques for training interpretable ML models, explaining ML models, and debugging ML models for accuracy, discrimination, and security.
Awesome Deep Graph Clustering
⭐
626
Awesome Deep Graph Clustering is a collection of SOTA, novel deep graph clustering methods (papers, codes, and datasets).
Pm4py Core
⭐
617
Public repository for the PM4Py (Process Mining for Python) project.
Adbench
⭐
609
Official Implement of "ADBench: Anomaly Detection Benchmark".
Combo
⭐
607
(AAAI' 20) A Python Toolbox for Machine Learning Model Combination
Python Twitter Examples
⭐
570
Examples of using Python for Twitter social data mining, using the python-twitter-tools framework.
Pypots
⭐
558
A Python toolbox/library for reality-centric machine learning/deep learning on partially-observed time series with PyTorch, including SOTA models supporting tasks of imputation, classification, clustering, and forecasting on incomplete (irregularly-sampled) multivariate time series with NaN missing values/data. https://arxiv.org/abs/2305.18811
Instascrape
⭐
554
Powerful and flexible Instagram scraping library for Python, providing easy-to-use and expressive tools for accessing data programmatically
Cookbook 2nd Code
⭐
532
Code of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Jekyll
⭐
498
Jekyll-based static site for The Programming Historian
Osintbuddy
⭐
498
Node graphs, OSINT data mining, and plugins. Connect unstructured and public data for transformative insights
Hearthbreaker
⭐
474
A Hearthstone: Heroes of WarCraft Simulator for the purposes of Machine Learning and Data Mining
Ail Framework
⭐
440
AIL framework - Analysis Information Leak framework
Rong360
⭐
438
用户贷款风险预测
Dgfraud
⭐
432
A Deep Graph-based Toolbox for Fraud Detection
Chefboost
⭐
428
A Lightweight Decision Tree Framework supporting regular algorithms: ID3, C4,5, CART, CHAID and Regression Trees; some advanced techniques: Gradient Boosting, Random Forest and Adaboost w/categorical features support for Python
Book Socialmediaminingpython
⭐
415
Companion code for the book "Mastering Social Media Mining with Python"
Rmdl
⭐
409
RMDL: Random Multimodel Deep Learning for Classification
Mli Resources
⭐
405
H2O.ai Machine Learning Interpretability Resources
Suod
⭐
371
(MLSys' 21) An Acceleration System for Large-scare Unsupervised Heterogeneous Outlier Detection (Anomaly Detection)
Pydatalab
⭐
347
open source for wechat-official-account (ID: PyDataLab)
Ml And Dm In Action
⭐
318
Share my code during learning machine learning and data mining
Artificial Adversary
⭐
317
🗣️ Tool to generate adversarial text examples and test machine learning models against them
Python Fp Growth
⭐
316
An implementation of the FP-growth algorithm in pure Python.
Lasio
⭐
315
Python library for reading and writing well data using Log ASCII Standard (LAS) files
Pyss3
⭐
307
A Python package implementing a new interpretable machine learning model for text classification (with visualization tools for Explainable AI :octocat:)
Matrixprofile
⭐
297
A Python 3 library making time series data mining tasks, utilizing matrix profile algorithms, accessible to everyone.
Welly
⭐
294
Welly helps with well loading, wireline logs, log quality, data science
Efficient Apriori
⭐
286
An efficient Python implementation of the Apriori algorithm.
Grimoirelab Perceval
⭐
281
Send Sir Perceval on a quest to retrieve and gather data from software repositories.
Smartproxy
⭐
276
HTTP(S)/SOCKS5 Rotating Residential proxies - Code examples & General information
Deepgraph
⭐
274
Analyze Data with Pandas-based Networks. Documentation:
Lagoujob
⭐
250
Job data mining repo for lagou.com
Tradingview Data Scraper
⭐
250
Extract price and indicator data from TradingView charts to create ML datasets
Tweetfeels
⭐
246
Real-time sentiment analysis in Python using twitter's streaming api
Awesome Python Data Science Books
⭐
242
Probably the best curated list of data science books in Python
Imbalanced Ensemble
⭐
234
Class-imbalanced Ensemble Learning in Python. | 类别不平衡/长尾机器学习库
Gwu_data_mining
⭐
228
Materials for GWU DNSC 6279 and DNSC 6290.
Chirp
⭐
225
Interface to manage and centralize Google Alert information
Rightmove_webscraper.py
⭐
219
Python class to scrape data from rightmove.co.uk and return listings in a pandas DataFrame object
Lihang_algorithms
⭐
219
用python和sklearn两种方法实现李航《统计学习方法》中的算法
Python Dlpy
⭐
215
The SAS Deep Learning Python (DLPy) package provides the high-level Python APIs to deep learning methods in SAS Visual Data Mining and Machine Learning. It allows users to build deep learning models using friendly Keras-like APIs.
Zhihu Analysis Python
⭐
202
Social Network Analysis of Zhihu with Python
Data Science Resources
⭐
197
👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Toloka Kit
⭐
195
Toloka-Kit is a Python library for working with Toloka API.
Data Science Toolkit
⭐
185
Collection of stats, modeling, and data science tools in Python and R.
Learning Data Mining With Python
⭐
183
Code repo for Learning Data Mining with Python, published by Packt Publishing
Python_practice_of_data_analysis_and_mining
⭐
179
《Python数据分析与挖掘实战》随书源码与数据
Tipdm
⭐
178
TipDM建模平台,开源的数据挖掘工具。
Prefixspan Py
⭐
176
The shortest yet efficient Python implementation of the sequential pattern mining algorithm PrefixSpan, closed sequential pattern mining algorithm BIDE, and generator sequential pattern mining algorithm FEAT.
Crowd Kit
⭐
174
Control the quality of your labeled data with the Python tools you already know.
Data Mining Conferences
⭐
173
Ranking, acceptance rate, deadline, and publication tips
Wrapper Feature Selection Toolbox Python
⭐
170
This toolbox offers 13 wrapper feature selection methods (PSO, GA, GWO, HHO, BA, WOA, and etc.) with examples. It is simple and easy to implement.
Msnoise
⭐
156
A Python Package for Monitoring Seismic Velocity Changes using Ambient Seismic Noise | http://www.msnoise.org
Kddcup 2020
⭐
154
6th Solution for 2020-KDDCUP: Multi-Channel Retrieve and Sorting for Debiasing Recommender System
Spypi
⭐
145
An (un-)ethical hacking-station based on Raspberry Pi and Python
Machine_learning_for_good
⭐
145
Machine learning fundamentals lesson in interactive notebooks
Alimusic
⭐
143
🎼天池阿里音乐流行趋势预测大赛,项目中涵盖了从初赛到复赛的全部核心代码。复赛的聚合数据可以在百度网
Transtab
⭐
140
NeurIPS'22 | TransTab: Learning Transferable Tabular Transformers Across Tables
Awesome Ensemble Learning
⭐
129
Ensemble learning related books, papers, videos, and toolboxes
Sparselsh
⭐
128
A Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.
Care Gnn
⭐
127
Code for CIKM 2020 paper Enhancing Graph Neural Network-based Fraud Detectors against Camouflaged Fraudsters
Dcrn
⭐
126
[AAAI 2022] An official source code for paper Deep Graph Clustering via Dual Correlation Reduction.
Lab Workshops
⭐
120
Materials for workshops on text mining, machine learning, and data visualization
Lola
⭐
118
LoL (League of Legends) game data analysis / analytics
Emotion Recognition From Speech
⭐
116
A machine learning application for emotion recognition from speech
Bee University
⭐
111
Project thu thập điểm chuẩn đại học 2014 - 2018 và phân tích dữ liệu
Eyes
⭐
110
Public Opinion Mining System of Taiwanese Forums
Related Searches
Python Machine Learning (20,195)
Python Dataset (14,792)
Python Docker (14,113)
Python Tensorflow (13,736)
Python Deep Learning (13,092)
Python Jupyter Notebook (12,976)
Python Network (11,495)
Python Html (10,924)
Python Algorithms (10,033)
Python Natural Language Processing (9,064)
1-100 of 394 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.