Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Springboard Data Science Immersive | 23 | 5 years ago | Jupyter Notebook | |||||||
Smda | 7 | 10 months ago | apache-2.0 | Go | ||||||
Exploratory data analysis toolkit - load large CSVs and grep them with SQL in an all-in-one package. | ||||||||||
Adventureworks Db Eda With Sql And Powerbi | 6 | 13 days ago | apache-2.0 | TSQL | ||||||
Microsoft AdventureWorks Database analysis using MS SQL Server and visualization with PowerBI. | ||||||||||
Edasql | 5 | a year ago | mit | Python | ||||||
edaSQL is a python library to bridge the SQL with Exploratory Data Analysis where you can connect to the Database and insert the queries. The query results can be passed to the EDA tool which can give greater insights to the user. | ||||||||||
Sf_dst | 4 | 13 days ago | cc0-1.0 | Jupyter Notebook | ||||||
Data Science Professional course | ||||||||||
Trainity_data_analytics_trainee | 3 | a month ago | mit | |||||||
This repository has all the Pdfs along with Data Analysis Portfolio and Data Analytics Certificates from Trainity | ||||||||||
Projects | 3 | 2 years ago | 3 | Jupyter Notebook | ||||||
Projects completed as part of the springboard curriculum | ||||||||||
Dev | 2 | 5 years ago | mit | Jupyter Notebook | ||||||
Video Games | 2 | 12 days ago | 7 | Jupyter Notebook | ||||||
Using Python and SQL to clean, analyse and visualise video games' data from Metacritic | ||||||||||
Pcparts_app | 2 | 6 years ago | Jupyter Notebook | |||||||
Scraping PCPartPicker to create value-focused PC part recommendation engine |
edaSQL is a library to link SQL to Exploratory Data Analysis and further more in the Data Engineering. This will solve many limitations in the SQL studios available in the market. Use the SQL Query language to get your Table Results.
Install dependency Packages before installing edaSQL
pip install pyodbc
pip install ipython
Optional dependency for better visualization - Jupyter Notebook
pip install notebook
Now Install using pip . Offical Python Package Here!!
pip install edaSQL
(OR)
Clone this Repository. Run this from the root directory to install
python setup.py install
Read the detailed documentation in readthedocs.io (still under the development)
The license for edaSQL is MIT license
Stuck on your edaSQL code or problem? Any other questions? Don't hestitate to send me an email ([email protected]).
Access the sample Jupyter Notebook here!!
Access the Sample Data Used in this Repo
edaSQL for DataFrame: If you are using the CSV or Excel as a source , Read using the Pandas & start from the 3. Data Overview
import edaSQL
import pandas as pd
edasql = edaSQL.SQL()
edasql.connectToDataBase(server='your server name',
database='your database',
user='username',
password='password',
sqlDriver='ODBC Driver 17 for SQL Server')
sampleQuery = "select * from INX"
data = pd.read_sql(sampleQuery, edasql.dbConnection)
insights = edaSQL.EDA(dataFrame=data,HTMLDisplay=True)
dataInsights =insights.dataInsights()
deepInsights = insights.deepInsights()
eda = edaSQL.EDA(dataFrame=data)
eda.pearsonCorrelation()
eda.spearmanCorrelation()
eda.kendallCorrelation()
eda.missingValuesPlot(plot ='matrix')
eda.missingValuesPlot(plot ='bar')
eda.missingValuesPlot(plot ='heatmap')
eda.missingValuesPlot(plot ='dendrogram')
eda.outliersVisualization(plot = 'box')
eda.outliersVisualization(plot = 'scatter')
outliers = eda.getOutliers()