Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Agroanalytics | 13 | 5 years ago | Python | |||||||
Agro Analytics - Data Mining/Machine Learning Project based on Agricultural datasets. For more info, go to www.agroanalytics.info | ||||||||||
Melodist | 12 | 2 | 3 years ago | 5 | May 05, 2020 | 2 | gpl-3.0 | Python | ||
MELODIST is an open-source toolbox written in Python for disaggregating daily meteorological time series to hourly time steps. It is licensed under GPLv3 (see license file). The software framework consists of disaggregation functions for each variable including temperature, humidity, precipitation, shortwave radiation, and wind speed. These functions can simply be called from a station object, which includes all relevant information about site characteristics. The data management of time series is handled using data frame objects as defined in the pandas package. In this way, input and output data can be easily prepared and processed. For instance, the pandas package is data i/o capable and includes functions to plot time series using the matplotlib library. | ||||||||||
Course Pandas | 11 | 3 years ago | Awk | |||||||
Data processing with Pandas course for the CM Hub at Imperial College | ||||||||||
Ut330b | 8 | 2 years ago | mit | Python | ||||||
Operating system independent controller and data visualizer for the Uni-Trend UT330B temperature and humidity logger. Full documentation provided. Uses Python, Pandas, and Bokeh. | ||||||||||
Raspberrypi Automatically Report | 8 | 9 months ago | gpl-3.0 | Python | ||||||
Easy, lightweight, Python Script for monitoring Raspberry Pi | ||||||||||
Pandas_degreedays | 8 | 4 years ago | 4 | June 22, 2015 | 4 | bsd-3-clause | Python | |||
A Python package to calculate degree days (DD or in french DJU - degré jour unifié) from measured outdoor temperatures and to make it possible to quantify drift of energy consumption for heating (or cooling) | ||||||||||
Air Quality Analysis | 8 | 2 years ago | mit | HTML | ||||||
Jupyter notebooks and Python code for analyzing air quality (fine particle, PM2.5) | ||||||||||
Hawaii Flask Api | 3 | 5 years ago | Jupyter Notebook | |||||||
Creating an API app with flask, python and sqlAlchemy | ||||||||||
Agrimine | 3 | 4 years ago | Python | |||||||
[CSIAR Inspire Challenge 2018] Using Machine Learning to improve agriculture in India | ||||||||||
Thesis | 2 | 5 years ago | Python | |||||||
Mining Sensor Data to Evaluate Indoor Environmental Quality of Public Educational Buildings |
Jupyter notebooks and Python code for analyzing air quality (fine particles, PM2.5)
1. Basic data visualization
2. Correlation of PM2.5
2.1 Correlation of PM2.5 with time
2.2. Correlation of PM2.5 with wind and temperature (data cleaning)
2.2. Correlation with wind and temperature (analysis)
2.3 Correlation with MERRA-2 data
2.4 Conversion wind (U,V) component, RH from temperatures
3.1 Data selection
3.2 Regression
TODO
Tool and packages
4. Credits
PDF version is in PDF folder, likewise HTML's
introduce to basic setup of folder, install pandas
, matplotlib
, seaborn
(using pip
for Python package), Anaconda
is a good choice if you are using Windows (or even Mac, Linux). Alternatively, try out Google Colaboratory
basic use of those tools (clean, explore, plot, interpret)
work with a CSV file from Airnow.gov
here are some graphs produced from this exercise
explore data source (specifically working with archieved meteorologcal data from NOAA.GOV
clean the data (which is formatted with Integrated Surface Data (ISD) style)
use windrose
package to make windrose plot
explore correlation between meteorological paramters to observed PM2.5 concentration such wind, temperature, height above ground
capture espisode and examine relevant inputs with PM2.5
some examples from this exercise
work with MERRA-2 reanalysis data from NASA
find the correlation from main groups (single level, surface turbulent flux, aerosols mixing ratio) and PM2.5
here is the 3 summary graphs:
a detour to look at conversion of wind data (U, V) vectors to speed and direction in degree
how to use MetPy packages calculate such conversion instead of manually undertake
explore data for the next which is selecting relevant data for predicting PM2.5
some graph examples:
compare values from different sources (such as from observed station, a public API, or reanalysis product)
correlation of wind speed in different altitude to PM2.5 concentration
preliminary heatmap (of all most input parameters, don't worry about the name just yet):
a final version of selected data with correlation with PM2.5
and if you are curious about the full name of each parameter, here it is. Note that in the final version of CSV data, all temperature was converted from Kelvin (K) to Celsius (C).
{'TQV': 'total_precipitable_water_vapor, kg m-2',
'T2MDEW': 'dew_point_temperature_at_2_m, K',
'HLML': 'surface_layer_height, m',
'FRCAN': 'areal_fraction_of_anvil_showers, 1',
'T2M': '2-meter_air_temperature, K',
'WS': 'observed ground wind speed, m/s',
'DISPH': 'zero_plane_displacement_height, m',
'TQL': 'total_precipitable_liquid_water, kg m-2',
'v_50m': 'wind speed at 50m, m/s',
'v_850': 'wind speed at 850hPa (~1450m)',
'v_2m': 'wind speed at 2m, m/s',
'CLDCR': 'cloud cover, 1',
'CIG': 'ceiling height dimension, m',
'PS': 'surface_pressure, Pa',
'RHOA': 'air_density_at_surface, kg m-3',
'H1000': 'height_at_1000_mb, m'}
Work with Scikit-learn
library with regression models such Linear, DecisionTree, RandomForest
Evaluate performance of each model and an ensamble by PM2.5 and meteorological data for Hanoi, 2018. Datasets are cleaned and reduced from the previous excercise
Apply a model with less feastures (DarkSky), but easiler to extract via API.
Graphs from this excercise:
perfomance on train dataset (using ensemble regression)
performance on test dataset
relative standard deviation on each model (lower is better)
an hourly update web-interface using the same concept can be found here with selected sites at my personal website b-io.info
Binh Nguyen, Air Quality Analysis, GitHub repository: https://github.com/binh-bk/air-quality-analysis