Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for data mining
data-mining
x
926 search results found
Automlpipeline.jl
⭐
331
A package that makes it trivial to create and evaluate machine learning pipeline architectures.
Ml And Dm In Action
⭐
318
Share my code during learning machine learning and data mining
Artificial Adversary
⭐
317
🗣️ Tool to generate adversarial text examples and test machine learning models against them
Python Fp Growth
⭐
316
An implementation of the FP-growth algorithm in pure Python.
Lasio
⭐
315
Python library for reading and writing well data using Log ASCII Standard (LAS) files
Ldetool
⭐
308
Code generator for fast log file parsers
Pyss3
⭐
307
A Python package implementing a new interpretable machine learning model for text classification (with visualization tools for Explainable AI :octocat:)
Matrixprofile
⭐
297
A Python 3 library making time series data mining tasks, utilizing matrix profile algorithms, accessible to everyone.
Welly
⭐
294
Welly helps with well loading, wireline logs, log quality, data science
Efficient Apriori
⭐
286
An efficient Python implementation of the Apriori algorithm.
Grimoirelab Perceval
⭐
281
Send Sir Perceval on a quest to retrieve and gather data from software repositories.
Smartproxy
⭐
276
HTTP(S)/SOCKS5 Rotating Residential proxies - Code examples & General information
Deepgraph
⭐
274
Analyze Data with Pandas-based Networks. Documentation:
2018 Dc Datagrand Textintelprocess
⭐
253
2018-DC-“达观杯”文本智能处理挑战赛:冠军 (1st/3131)
Tradingview Data Scraper
⭐
250
Extract price and indicator data from TradingView charts to create ML datasets
Lagoujob
⭐
250
Job data mining repo for lagou.com
Tweetfeels
⭐
246
Real-time sentiment analysis in Python using twitter's streaming api
Awesome Python Data Science Books
⭐
242
Probably the best curated list of data science books in Python
Pzad
⭐
235
Курс "Прикладные задачи анализа данных" (ВМК, МГУ имени М.В. Ломоносова)
Imbalanced Ensemble
⭐
234
Class-imbalanced Ensemble Learning in Python. | 类别不平衡/长尾机器学习库
Gwu_data_mining
⭐
228
Materials for GWU DNSC 6279 and DNSC 6290.
Chirp
⭐
225
Interface to manage and centralize Google Alert information
Lihang_algorithms
⭐
219
用python和sklearn两种方法实现李航《统计学习方法》中的算法
Rightmove_webscraper.py
⭐
219
Python class to scrape data from rightmove.co.uk and return listings in a pandas DataFrame object
Qminer
⭐
217
Analytic platform for real-time large-scale streams containing structured and unstructured data.
Awesome Deep Graph Anomaly Detection
⭐
215
Awesome graph anomaly detection techniques built based on deep learning frameworks. Collections of commonly used datasets, papers as well as implementations are listed in this github repository. We also invite researchers interested in anomaly detection, graph representation learning, and graph anomaly detection to join this project as contributors and boost further research in this area.
Python Dlpy
⭐
215
The SAS Deep Learning Python (DLPy) package provides the high-level Python APIs to deep learning methods in SAS Visual Data Mining and Machine Learning. It allows users to build deep learning models using friendly Keras-like APIs.
Statistical Learning
⭐
206
Lecture Slides and R Sessions for Trevor Hastie and Rob Tibshinari's "Statistical Learning" Stanford course
Zhihu Analysis Python
⭐
202
Social Network Analysis of Zhihu with Python
Data Science Resources
⭐
199
👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Striplog
⭐
199
Lithology and stratigraphic logs for wells or outcrop.
Raven
⭐
197
RAVEN is a flexible and multi-purpose probabilistic risk analysis, validation and uncertainty quantification, parameter optimization, model reduction and data knowledge-discovering framework.
Toloka Kit
⭐
195
Toloka-Kit is a Python library for working with Toloka API.
Data Science Toolkit
⭐
185
Collection of stats, modeling, and data science tools in Python and R.
Learning Data Mining With Python
⭐
183
Code repo for Learning Data Mining with Python, published by Packt Publishing
Emuto
⭐
183
manipulate JSON files
Sourced Ce
⭐
181
source{d} Community Edition (CE)
Python_practice_of_data_analysis_and_mining
⭐
179
《Python数据分析与挖掘实战》随书源码与数据
Bella
⭐
179
Bella is a pure python post-exploitation data mining tool & remote administration tool for macOS. 🍎💻
Estadistica Con R
⭐
178
Apuntes personales sobre estadística, machine learning y lenguaje de programación R
Tipdm
⭐
178
TipDM建模平台,开源的数据挖掘工具。
Ayakashi
⭐
177
⚡ Ayakashi.io - The next generation web scraping framework
Prefixspan Py
⭐
176
The shortest yet efficient Python implementation of the sequential pattern mining algorithm PrefixSpan, closed sequential pattern mining algorithm BIDE, and generator sequential pattern mining algorithm FEAT.
Crowd Kit
⭐
174
Control the quality of your labeled data with the Python tools you already know.
Data Mining Conferences
⭐
173
Ranking, acceptance rate, deadline, and publication tips
Wrapper Feature Selection Toolbox Python
⭐
170
This toolbox offers 13 wrapper feature selection methods (PSO, GA, GWO, HHO, BA, WOA, and etc.) with examples. It is simple and easy to implement.
Catmandu
⭐
170
Catmandu - a data processing toolkit
2017 Ccf Bdci Aijudge
⭐
170
2017-CCF-BDCI-让AI当法官(初赛):7th/415 (Top 1.68%)
Pipeline
⭐
169
the `pipeline` shell command
Cikm 2019 Analyticup
⭐
167
1st Solution for 2019-CIKM-Analyticup, Efficient and Novel Item Retrieval for Large-scale Online Shopping Recommendation
Openhistorian
⭐
166
The Open Source Time-Series Data Historian
Msnoise
⭐
156
A Python Package for Monitoring Seismic Velocity Changes using Ambient Seismic Noise | http://www.msnoise.org
Etl_unicorn
⭐
156
数据可视化, 数据挖掘, 数据处理 ETL
Kddcup 2020
⭐
154
6th Solution for 2020-KDDCUP: Multi-Channel Retrieve and Sorting for Debiasing Recommender System
Accelerator
⭐
150
The Accelerator is a tool for fast and reproducible processing of large amounts of data.
Machine_learning_for_good
⭐
145
Machine learning fundamentals lesson in interactive notebooks
Spypi
⭐
145
An (un-)ethical hacking-station based on Raspberry Pi and Python
Alimusic
⭐
143
🎼天池阿里音乐流行趋势预测大赛,项目中涵盖了从初赛到复赛的全部核心代码。复赛的聚合数据可以在百度网
Transtab
⭐
140
NeurIPS'22 | TransTab: Learning Transferable Tabular Transformers Across Tables
Xioc
⭐
140
Extract indicators of compromise from text, including "escaped" ones.
Wekadeeplearning4j
⭐
139
Weka package for the Deeplearning4j java library
Hefei_ecg_top1
⭐
137
“合肥高新杯”心电人机智能大赛 —— 心电异常事件预测 TOP1 Solution
Xivapi.com
⭐
137
Source code for XIVAPI.com
Hust Homeworks
⭐
134
HUST Homeworks(Course design / Reports / Labs / etc. )
Pre Modern_chinese_corpus_dataset
⭐
132
近代汉语语料库数据集 自然语言处理 语料库 古代汉语 古汉语 文言文 数字人文 计算语言
Awesome Ensemble Learning
⭐
129
Ensemble learning related books, papers, videos, and toolboxes
Sparselsh
⭐
128
A Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.
Care Gnn
⭐
127
Code for CIKM 2020 paper Enhancing Graph Neural Network-based Fraud Detectors against Camouflaged Fraudsters
Pandora
⭐
127
PANDORA Advanced Machine Learning for Data Integration, Analysis, and Insightful Discoveries in Health and Disease 💻
Research_spatio Temporal Data Mining
⭐
126
A collection of research on spatio-temporal data mining
Dcrn
⭐
126
[AAAI 2022] An official source code for paper Deep Graph Clustering via Dual Correlation Reduction.
Lab Workshops
⭐
120
Materials for workshops on text mining, machine learning, and data visualization
Dh Core
⭐
118
Functional data science
Lola
⭐
118
LoL (League of Legends) game data analysis / analytics
Evine
⭐
117
Interactive CLI Web Crawler
Emotion Recognition From Speech
⭐
116
A machine learning application for emotion recognition from speech
Dataminingnotebooks
⭐
113
This is a collection of iPython notebooks from my course on data mining. Data used in the notebooks can be downloaded from the given links in the notebooks.
Bee University
⭐
111
Project thu thập điểm chuẩn đại học 2014 - 2018 và phân tích dữ liệu
Eyes
⭐
110
Public Opinion Mining System of Taiwanese Forums
Tiger
⭐
108
Python toolbox to evaluate graph vulnerability and robustness (CIKM 2021)
Cogcomp Nlpy
⭐
108
CogComp's light-weight Python NLP annotators
St Ssl
⭐
107
ST-SSL (STSSL): Spatio-Temporal Self-Supervised Learning for Traffic Flow Forecasting/Prediction
Books
⭐
106
Books related to AI/ML/DL/GENAI
Data Mining On Social Media
⭐
105
Python scripts to extract tweets and facebook posts from public users.
Pyprobables
⭐
104
Probabilistic data structures in python http://pyprobables.readthedocs.io/en/latest/index.
Graphcare
⭐
103
[ICLR'24] Enhancing Healthcare Predictions with Personalized Knowledge Graphs
Pathpy
⭐
102
pathpy is an OpenSource python package for the modeling and analysis of pathways and temporal networks using higher-order and multi-order graphical models
Gitlogg
⭐
102
💾 🧮 🤯 Parse the 'git log' of multiple repos to 'JSON'
Gundam
⭐
102
GUNDAM is a data management system that prioritizes data using language models.
Vizuka
⭐
100
Explore high-dimensional datasets and how your algo handles specific regions.
Socialite
⭐
99
SociaLite: query language for large-scale graph analysis and data mining
Anomaly Detection
⭐
99
UnSupervised and Semi-Supervise Anomaly Detection / IsolationForest / KernelPCA Detection / ADOA / etc.
Awesome Time Series Analysis
⭐
99
This list collects learning resource, tools and dataset for time series analysis/time series data mining.
Instagram Comments Scraper
⭐
98
Instagram comment scraper using python and selenium. Save the comments into excel.
Bdc2019
⭐
98
2019中国高校计算机大赛——大数据挑战赛 第三名解决方案
Blinkist M4a Downloader
⭐
97
Grabs all of the audio files from all of the Blinkist books
Teanaps
⭐
92
자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.
Introduction_to_data_mining_r_examples
⭐
91
R Code to accompany the book Introduction to Data Mining by Tan, Steinbach and Kumar (Code by Michael Hahsler)
Evalne
⭐
91
Source code for EvalNE, a Python library for evaluating Network Embedding methods.
Data Competitions
⭐
90
Data competition experience and solutions
101-200 of 926 search results
< Previous
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.