Awesome Open Source
Awesome Open Source
Combined Topics
statistical-analysis
x
Advertising
📦 9
All Projects
Application Programming Interfaces
📦 120
Applications
📦 181
Artificial Intelligence
📦 72
Blockchain
📦 70
Build Tools
📦 111
Cloud Computing
📦 79
Code Quality
📦 28
Collaboration
📦 30
Command Line Interface
📦 48
Community
📦 81
Companies
📦 60
Compilers
📦 60
Computer Science
📦 74
Configuration Management
📦 39
Content Management
📦 167
Control Flow
📦 197
Data Formats
📦 77
Data Processing
📦 266
Data Storage
📦 132
Economics
📦 60
Frameworks
📦 198
Games
📦 122
Graphics
📦 103
Hardware
📦 148
Integrated Development Environments
📦 47
Learning Resources
📦 147
Legal
📦 28
Libraries
📦 119
Lists Of Projects
📦 21
Machine Learning
📦 336
Mapping
📦 61
Marketing
📦 15
Mathematics
📦 55
Media
📦 228
Messaging
📦 97
Networking
📦 304
Operating Systems
📦 84
Operations
📦 120
Package Managers
📦 52
Programming Languages
📦 229
Runtime Environments
📦 96
Science
📦 42
Security
📦 375
Social Media
📦 26
Software Architecture
📦 70
Software Development
📦 68
Software Performance
📦 57
Software Quality
📦 127
Text Editors
📦 45
Text Processing
📦 131
User Interface
📦 310
User Interface Components
📦 465
Version Control
📦 29
Virtualization
📦 68
Web Browsers
📦 38
Web Servers
📦 25
Web User Interface
📦 194
The Top 331 Statistical Analysis Open Source Projects on Github
Categories
>
Mathematics
>
Statistical Analysis
Pymc3
⭐
6,025
Probabilistic Programming in Python: Bayesian Modeling and Probabilistic Machine Learning with Aesara
Git Quick Stats
⭐
4,963
▁▅▆▃▅ Git quick statistics is a simple and efficient way to access various statistics in git repository.
Miller
⭐
4,387
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Tablesaw
⭐
2,698
Java dataframe and visualization library
Gitinspector
⭐
1,997
📊 The statistical analysis tool for git repositories
Ggstatsplot
⭐
1,251
Enhancing `ggplot2` plots with statistical analysis 📊🎨📣
Pycm
⭐
1,150
Multi-class confusion matrix library in Python
Dataframe
⭐
1,137
C++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types, continuous memory storage, and no pointers are involved
Python For Probability Statistics And Machine Learning
⭐
423
Jupyter Notebooks for Springer book "Python for Probability, Statistics, and Machine Learning"
Atsd Use Cases
⭐
335
Axibase Time Series Database: Usage Examples and Research Articles
Expan
⭐
235
Open-source Python library for statistical analysis of randomised control trials (A/B tests)
Scikit Posthocs
⭐
219
Multiple Pairwise Comparisons (Post Hoc) Tests in Python
Morpheus Core
⭐
186
The foundational library of the Morpheus data science framework
Ee Outliers
⭐
178
Open-source framework to detect outliers in Elasticsearch events
Data Science Toolkit
⭐
164
Collection of stats, modeling, and data science tools in Python and R.
Hdrhistogram_rust
⭐
131
A port of HdrHistogram to Rust
Methylkit
⭐
118
R package for DNA methylation analysis
Ck Autotuning
⭐
57
CK automation actions to let users implement portable, customizable and reusable program workflows for reproducible, collaborative and multi-objective benchmarking, optimization and SW/HW co-design:
Sccoda
⭐
56
A statistical test for compositional changes in scRNA
Metanumerics
⭐
50
Meta.Numerics is library for advanced numerical computing on the .NET platform. It offers an object-oriented API for statistical analysis, advanced functions, Fourier transforms, numerical integration and optimization, and matrix algebra.
Lisp Stat
⭐
46
Lisp-Stat main system
Webmc3
⭐
43
A web interface for exploring PyMC3 traces
Srqm
⭐
39
An introductory statistics course for social scientists, using Stata
Cd Diagram
⭐
37
Critical difference diagram with Wilcoxon-Holm post-hoc analysis.
Loon
⭐
35
A Toolkit for Interactive Statistical Data Visualization
Statkit
⭐
35
A collection of statistical analysis tools for your Swift programs.
Kalepy
⭐
33
Kernel Density Estimation and (re)sampling
Powershell Statistics
⭐
33
Statistical analysis of data on the command line
Pyautofit
⭐
32
Datadoubleconfirm
⭐
31
Simple datasets and notebooks for data visualization, statistical analysis and modelling - with write-ups here: http://projectosyo.wix.com/datadoubleconfirm.
Query2report
⭐
31
Query2Report is a simple open source business intelligence platform that allows users to build report/dashboard for business analytics or enterprise reporting
Statistical Modeling Examples
⭐
28
Basic statistical modelling examples.
Boba
⭐
28
Specifying and executing multiverse analysis
Wink Statistics
⭐
27
Fast & numerically stable statistical analysis
Deepstats
⭐
27
deepStats: a stastitical toolbox for deeptools and genomic signals
Kalmanpy
⭐
27
Implementation of Kalman Filter in Python
Calm
⭐
24
Conditional Associative Logic Memory
Pydtmc
⭐
24
A framework for discrete-time Markov chains analysis.
Treecut
⭐
23
Find nodes in hierarchical clustering that are statistically significant
Springboard Data Science Immersive
⭐
23
Mitre
⭐
22
The Microbiome Interpretable Temporal Rule Engine
Matrix
⭐
18
C++ Matrix -- High performance and accurate (e.g. edge cases) matrix math library with expression template arithmetic operators
Jupyter Notebooks Statistic Walk Throughs Using R
⭐
18
Jupyter notebooks with examples of statistical methods and analyses using R.
Binninganalysis.jl
⭐
17
Statistical standard error estimation tools for correlated data
Microscope
⭐
17
ChIP-seq/RNA-seq analysis software suite for gene expression heatmaps
Titanic_survival_exploration
⭐
16
Udacity Machine Learning Nano degree Program Project Predicting Passenger Survival
Experimentaldesign.jl
⭐
14
Design of Experiments in Julia
Id2t
⭐
13
Official ID2T repository. ID2T creates labeled IT network datasets that contain user defined synthetic attacks.
Nemene
⭐
13
A practical nonparametric statistical tests library for JavaScript
Statisticalhypothesistests
⭐
13
統計的仮説検定&信頼区間推定用スクリプトまとめ
Appliedstats
⭐
12
A repo with homeworks and labs from a course on applied stats taken by me during my bachelor's degree in MIPT, Ru. Course authors: Andrii Hraboviy, @andriygav and Oleg Bakhteev, @bahleg.
Thermodynamicanalyticstoolkit
⭐
12
Sampling-based approach to analyse neural networks using TensorFlow
Uoft_sta130
⭐
11
Introduction to Statistical Reasoning and Data Science
Mlr
⭐
11
Multiple linear regression with statistical inference, residual analysis, direct CSV loading, and other features
Atomai
⭐
11
Deep and machine learning for atomic-scale and mesoscale data
Statistics
⭐
10
A Crystal shard to perform descriptive statistics and sampling on popular distributions
Est Computacional 2019
⭐
10
Class notes for the computational statistics class (Spanish), master in Data Science ITAM
Mcmcdiagnostictools.jl
⭐
10
Roaster
⭐
10
R - Fetch, build and deploy.
Whatscloud
⭐
10
WhatsCloud is an android app which allows you to analyze your WhatsApp chat history on the fly with only one click
Srm
⭐
10
This Chrome Extension automatically performs SRM checks and flags potential data quality issues on supported experimentation platforms.
Kitsu Season Trends
⭐
10
🦊 Kitsu seasonal anime trends
Edx
⭐
10
Data Science courses in R from HarvardX
Triceratops
⭐
9
Tool for Rating Interesting Candidate Exoplanets and Reliability Analysis of Transits Originating from Proximate Stars
Chinaipo Statistic
⭐
9
📊 新三板企业相关数据分析
Hilostwitter
⭐
9
Códigos R de los hilos de Twitter (R codes of Twitter activity) https://twitter.com/dadosdelaplace
Pomashiny
⭐
9
🍎 Web-based User-friendly Workflow for Metabolomics and Proteomics Data Analysis
Ciencia De Dados Projetos
⭐
9
Repositório para meus projetos de ciência de dados (Web scraping e automação + análise exploratória de dados + machine learning + sistema de recomendação)
Terapca
⭐
8
TeraPCA is a multithreaded C++ software suite based on Intel's MKL library (or any other BLAS and/or LAPACK distribution). TeraPCA features no dependencies to external libraries and combines the robustness of subspace iteration with the power of randomization.
Statistics For Data Science
⭐
8
Learning Statistics is one of the most Important step to get into the World of Data Science and Machine Learning. Statistics helps us to know data in a much better way and explains the behavior of the data based upon certain factors. It has many Elements which help us to understand the data better that includes Probability, Distributions, Descriptive Analysis, Inferential Analysis, Comparative Analysis, Chi-Square Test, T Test, Z test, AB Testing etc.
Spotifystatistics
⭐
8
Personalized stats for your Spotify profile.
Imap4
⭐
8
iMap4 - Spatial mapping of eye movement data (e.g., fixation map) using Linear Mixed Models
Anki_revlog_analysis
⭐
8
Anki 复习数据处理与分析
Sparklyr.flint
⭐
8
Sparklyr extension making Flint time series library functionalities (https://github.com/twosigma/flint) easily accessible through R
P2n V3
⭐
8
Open source patent analytics toolkit
Doex
⭐
8
Python library for Design and Analysis of Experiments
Autostopwordgen
⭐
7
stopwordgen automatically builds the stop words for a given dataset.
Fhirextinguisher
⭐
7
FHIR Search Interface & Flatten FHIR resources into CSVs/DataFrames using FHIRPath.
Network Intrusion Detection Using Machine Learning
⭐
7
A Novel Statistical Analysis and Autoencoder Driven Intelligent Intrusion Detection Approach
Shingho
⭐
7
Shingho is a PySpark based statistical library designed for Big Data applications.
Dgoogleanalytics
⭐
7
Classe criada para facilitar a integração do Delphi com Google Analytics
Volbx
⭐
7
Graphical tool for data manipulation written in C++/Qt
Covid19rj
⭐
7
Repositório da iniciativa COVID19: Observatório Fluminense, com dados, gráficos e relatórios sobre a evolução pandemia de COVID-19, com especial interesse no Estado do RJ.
Lfq Analyst
⭐
7
The repo for LFQ-Analyst
Batbstats
⭐
6
🛹 A GraphQL API for Battle at the Berrics data
Risk_calculation_using_backward_elimination_algorithm_in_life_insurance
⭐
6
Implementation of backward elimination algorithm used for dimensionality reduction for improving the performance of risk calculation in life insurance industry.
Mentalhealthdataanalysis
⭐
6
Data Analysis on Mental Health.
Pdsphere
⭐
6
A Riemannian framework for statistical analysis of topological persistence diagrams
Histogram
⭐
6
A python histogram object for scientific data-reduction and statistical analysis
Wilson Interval
⭐
6
A comprehensive module used to calculate the high bound, low bound, and center of a Wilson score interval.
Infectious_disease_predictability
⭐
6
Code and data for On the Predictability of Infectious Disease Outbreaks by SV Scarpino & G Petri
Bitcoin Analysis
⭐
6
Compile, analyse, plot cryptocurrency protocols, networks, marketplaces
Iv And Woe Python
⭐
6
This repository contains analysis of churn in telephone service company (using IV and WOE), comparison of effect size and information value and quick tutorial how to use information value module (created for this analysis).
Camps
⭐
6
Community Atmospheric Model Post-Processing System
Netodyssey
⭐
6
A C# tool to compute windowed statistical estimations of network traffic.
Datascience_in_small_notes
⭐
6
Some little notes from the author for everyone who wants to know or learn about the process that a data scientist must do from the beginning of data collection to making predictions with a model that has been built. These notes are based on the knowledge that the authors have learned and implemented. Enjoy it!
Randomjs
⭐
6
A JavaScript module for generating random seeded distributions and its statistical analysis.
Data Frame
⭐
6
Data frames for Common Lisp
Data Analysis And Visualization Training
⭐
6
Data Analysis and Visualization Training for computer science students.
Powsc
⭐
5
power analysis for scRNA-seq
1-100 of 331 projects
Next >
Advertising
📦 9
All Projects
Application Programming Interfaces
📦 120
Applications
📦 181
Artificial Intelligence
📦 72
Blockchain
📦 70
Build Tools
📦 111
Cloud Computing
📦 79
Code Quality
📦 28
Collaboration
📦 30
Command Line Interface
📦 48
Community
📦 81
Companies
📦 60
Compilers
📦 60
Computer Science
📦 74
Configuration Management
📦 39
Content Management
📦 167
Control Flow
📦 197
Data Formats
📦 77
Data Processing
📦 266
Data Storage
📦 132
Economics
📦 60
Frameworks
📦 198
Games
📦 122
Graphics
📦 103
Hardware
📦 148
Integrated Development Environments
📦 47
Learning Resources
📦 147
Legal
📦 28
Libraries
📦 119
Lists Of Projects
📦 21
Machine Learning
📦 336
Mapping
📦 61
Marketing
📦 15
Mathematics
📦 55
Media
📦 228
Messaging
📦 97
Networking
📦 304
Operating Systems
📦 84
Operations
📦 120
Package Managers
📦 52
Programming Languages
📦 229
Runtime Environments
📦 96
Science
📦 42
Security
📦 375
Social Media
📦 26
Software Architecture
📦 70
Software Development
📦 68
Software Performance
📦 57
Software Quality
📦 127
Text Editors
📦 45
Text Processing
📦 131
User Interface
📦 310
User Interface Components
📦 465
Version Control
📦 29
Virtualization
📦 68
Web Browsers
📦 38
Web Servers
📦 25
Web User Interface
📦 194
"GitHub" is a registered trademark of GitHub, Inc. Awesome Open Source is not affiliated with GitHub.