Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for data analysis
data-analysis
x
2,015 search results found
Superset
⭐
58,778
Apache Superset is a Data Visualization and Data Exploration Platform
Scikit Learn
⭐
57,160
scikit-learn: machine learning in Python
Pandas
⭐
41,935
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Metabase
⭐
35,600
The simplest, fastest way to get business intelligence and analytics to everyone in your company 😋
Streamlit
⭐
29,794
Streamlit — A faster way to build and share data apps.
Ai Expert Roadmap
⭐
27,583
Roadmap to becoming an Artificial Intelligence Expert in 2022
Gradio
⭐
25,823
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Cyberchef
⭐
25,521
The Cyber Swiss Army Knife - a web app for encryption, encoding, compression and data analysis
Data Science For Beginners
⭐
25,362
10 Weeks, 20 Lessons, Data Science for All!
Goaccess
⭐
17,131
GoAccess is a real-time web log analyzer and interactive viewer that runs in a terminal in *nix systems or through your browser.
Best Of Ml Python
⭐
14,990
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
Dataease
⭐
14,295
人人可用的开源数据可视化分析工具。
Airbyte
⭐
12,918
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Ydata Profiling
⭐
11,983
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Openrefine
⭐
10,106
OpenRefine is a free, open source power tool for working with messy data and improving it
Pandas_exercises
⭐
9,667
Practice your pandas skills!
Pandas Ai
⭐
9,533
Chat with your data (SQL, CSV, pandas, polars, noSQL, etc). PandasAI makes data analysis conversational
Mlcourse.ai
⭐
9,376
Open Machine Learning Course
Statsmodels
⭐
9,242
Statsmodels: statistical modeling and econometrics in Python
Pygwalker
⭐
8,698
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
Akshare
⭐
8,269
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Ai Learn
⭐
8,256
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Py tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
Cleanlab
⭐
8,182
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Pyod
⭐
7,751
A Comprehensive and Scalable Python Library for Outlier Detection (Anomaly Detection)
Gonum
⭐
6,979
Gonum is a set of numeric libraries for the Go programming language. It contains libraries for matrices, statistics, optimization, and more
Cudf
⭐
6,936
cuDF - GPU DataFrame Library
Imbalanced Learn
⭐
6,680
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
Alluxio
⭐
6,612
Alluxio, data orchestration for analytics and machine learning in the cloud
Pachyderm
⭐
6,035
Data-Centric Pipelines and Data Versioning
Awesome R
⭐
5,645
A curated list of awesome R packages, frameworks and software.
Data Analysis And Machine Learning Projects
⭐
5,596
Repository of teaching materials, code, and data for my data analysis and machine learning projects.
Jimureport
⭐
5,467
🔥「数据可视化报表工具」类似excel操作风格,在线拖拽完成报表设计!功能涵盖: 报表设计、图形报表、打印设计、大屏设计等,完全免费!秉承“简单、易用、专业”的产品理念,极大的降低报
Cloudquery
⭐
5,380
The open source high performance data integration platform built for developers.
Knowledge Repo
⭐
5,344
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
Growthbook
⭐
5,285
Open Source Feature Flagging and A/B Testing Platform
Weibospider
⭐
4,788
⚡ A distributed crawler for weibo, building with celery and requests.
Datasciencepython
⭐
4,776
common data analysis and machine learning tasks using python
Octosql
⭐
4,600
OctoSQL is a query tool that allows you to join, analyse and transform data from multiple databases and file formats using SQL.
Orange3
⭐
4,469
🍊 📊 💡 Orange: Interactive data analysis
Danfojs
⭐
4,416
Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating and processing structured data.
Dtale
⭐
4,407
Visualizer for pandas data structures
Machine Learning Mindmap
⭐
4,400
A mindmap summarising Machine Learning concepts, from Data Analysis to Deep Learning.
Flyte
⭐
4,380
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
Machine_learning_complete
⭐
4,296
A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.
Datascience
⭐
3,955
Curated list of Python resources for data science.
Data Science
⭐
3,898
Collection of useful data science topics along with articles, videos, and code
Sql Translator
⭐
3,842
SQL Translator is a tool for converting natural language queries into SQL code using artificial intelligence. This project is 100% free and open source.
Rath
⭐
3,717
Next generation of automated data exploratory analysis and visualization platform.
Awesome Geospatial
⭐
3,703
Long list of geospatial tools and resources
Plotnine
⭐
3,677
A Grammar of Graphics for Python
Pydata Notebook
⭐
3,666
利用Python进行数据分析 第二版 (2017) 中文翻译笔记
Matplotplusplus
⭐
3,496
Matplot++: A C++ Graphics Library for Data Visualization 📊🗾
Missingno
⭐
3,472
Missing data visualization module for Python.
Tablesaw
⭐
3,328
Java dataframe and visualization library
Ml Workspace
⭐
3,197
🛠 All-in-one web-based IDE specialized for machine learning and data science.
Running_page
⭐
3,052
Make your own running home page
Igel
⭐
3,037
a delightful machine learning tool that allows you to train, test, and use models without writing code
Lance
⭐
3,003
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..
Xlearn
⭐
3,000
High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.
Tad
⭐
2,939
A desktop application for viewing and analyzing tabular data
Datastation
⭐
2,760
App to easily query, script, and visualize data from every database, file, and API.
Pandas Datareader
⭐
2,733
Extract data from a wide range of Internet sources into a pandas DataFrame.
Sweetviz
⭐
2,687
Visualize and compare datasets, target values and associations, with one line of code.
Evadb
⭐
2,561
Database system for AI-powered apps
Quadratic
⭐
2,485
Quadratic | Data Science Spreadsheet with Python & SQL
Gopup
⭐
2,451
数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字
Data Science Roadmap
⭐
2,445
Data Science Roadmap from A to Z
Financial Knowledge Graphs
⭐
2,370
小型金融知识图谱构建流程
Aachartkit Swift
⭐
2,343
📈📊📱💻🖥️An elegant modern declarative data visualization chart framework for iOS, iPadOS and macOS. Extremely powerful, supports line, spline, area, areaspline, column, bar, pie, scatter, angular gauges, arearange, areasplinerange, columnrange, bubble, box plot, error bars, funnel, waterfall and polar chart types. 极其精美而又强大的现代化声明式数据可视化图表框架,支持柱状图、条形图、折线图、曲线图、折线填充图、曲
Root
⭐
2,329
The official repository for ROOT: analyzing, storing and visualizing big data, scientifically
Incubator Devlake
⭐
2,322
Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and community growth.
Awesome Ts Anomaly Detection
⭐
2,320
List of tools & datasets for anomaly detection on time-series data.
Mito
⭐
2,201
The mitosheet package, trymito.io, and other public Mito code.
Linear Algebra With Python
⭐
2,160
Lecture Notes for Linear Algebra Featuring Python. This series of lecture notes will walk you through all the must-know concepts that set the foundation of data science or advanced quantitative skillsets. Suitable for statistician/econometrician, quantitative analysts, data scientists and etc. to quickly refresh the linear algebra with the assistance of Python computation and visualization.
Dataframe
⭐
2,129
C++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types and contiguous memory storage
Awesome Python Data Science
⭐
2,126
Probably the best curated list of data science software in Python.
Secretflow
⭐
2,101
A unified framework for privacy-preserving data analysis and machine learning
Graphic Walker
⭐
2,077
An open source alternative to Tableau. Easily embedded in any web apps.
100 Pandas Puzzles
⭐
1,977
100 data puzzles for pandas, ranging from short and simple to super tricky (60% complete)
Data_hacks
⭐
1,875
Command line utilities for data analysis
Awesome Business Intelligence
⭐
1,862
Actively curated list of awesome BI tools. PRs welcome!
Pymc Resources
⭐
1,838
PyMC educational resources
Vizzu Lib
⭐
1,827
Library for animated data visualizations and data stories.
Tabix
⭐
1,811
Tabix.io UI
Sqliteviz
⭐
1,811
Instant offline SQL-powered data visualisation in your browser
Pandas Videos
⭐
1,808
Jupyter notebook and datasets from the pandas Q&A video series
Plots.jl
⭐
1,775
Powerful convenience for Julia visualizations and data analysis
Octosuite
⭐
1,763
GitHub Data Analysis Framework.
Datatable
⭐
1,763
A Python package for manipulating 2-dimensional tabular data structures
Elementary
⭐
1,721
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Tiledb
⭐
1,700
The Universal Storage Engine
Nannyml
⭐
1,695
nannyml: post-deployment data science in python
Tsne Cuda
⭐
1,610
GPU Accelerated t-SNE for CUDA with Python bindings
Datart
⭐
1,593
Datart is a next generation Data Visualization Open Platform
Ai For Security Learning
⭐
1,571
安全场景、基于AI的安全算法和安全数据分析业界实践
Dat8
⭐
1,549
General Assembly's 2015 Data Science course in Washington, DC
Spark Py Notebooks
⭐
1,515
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Re Data
⭐
1,499
re_data - fix data issues before your users & CEO would discover them 😊
Hacknical
⭐
1,498
Hacknical, hacker & technical. A website for GitHub user to make a better resume.
Bdash
⭐
1,468
Simple SQL Client for lightweight data analysis.
Related Searches
Python Data Analysis (1,858)
Jupyter Notebook Data Analysis (1,768)
Machine Learning Data Analysis (931)
1-100 of 2,015 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.