Awesome Open Source
Awesome Open Source

Python-for-Data-Analysis-and-Machine-Learning

This repo contains the exercises made for the course of Jose Portilla on Udemy.

The course teaches how to use NumPy, Pandas, Seaborn , Matplotlib , Plotly , Scikit-Learn , Machine Learning models, Tensorflow, Deep Neural Networks and more.

The exercises offer the possibility to reproduce the real analysis process using real data (like datasets on Kaggle) or fake data (created for the scope of the course).

Details of each project:

  1. 911 Calls Data Capstone Project .ipynb : Analysis of the Kaggle 911 Dataset with visualizations in Seaborn and Matplotlib.

  2. Choropleth Maps Exercise .ipynb: choropleth maps with Plotly.

  3. Decision Trees and Random Forest Project .ipynb: Analysis of a dataset from LendingClub.com with visualizations in Seaborn and Matplotlib and Machine Learning models with Scikit-Learn.

  4. Ecommerce Purchases Exercise .ipynb: analysis of a dataset with fake Amazon data with Pandas.

  5. K Nearest Neighbors Project .ipynb: using Numpy, Maplotlib, Pandas, Seaborn to analyze a dataset and the applying the KNN model to the data using Scikit-Learn.

  6. Linear Regression - Project Exercise .ipynb: using Numpy, Maplotlib, Pandas, Seaborn to analyze a dataset about Ecommerce customers and the applying the Linear Regression model to the data using Scikit-Learn.

  7. Logistic Regression Project .ipynb: using Numpy, Maplotlib, Pandas, Seaborn to work with a fake advertising data set, indicating whether or not a particular internet user clicked on an Advertisement. Cresating a Logistic Regression model with Scikit-Learn to predict whether or not the users will click on an ad based off the features of the users.

  8. NLP Project .ipynb: Classify Yelp Reviews into 1 star or 5 star categories based off the text content in the reviews. The analysis has been done using the Yelp Review Data Set from Kaggle.

  9. SF Salaries Exercise .ipynb: simple Pandas analysis using the SF Salaries datased from Kaggle.

  10. Support Vector Machines Project .ipynb: SVM application with Scikit-Learn on the Iris dataset.

  11. Tensorflow Project Exercise .ipynb: Tensorflow application on the UCI dataset - created a Deep Neural Network using the contrib.learn module.

Related Awesome Lists
Top Programming Languages
Top Projects

Get A Weekly Email With Trending Projects For These Topics
No Spam. Unsubscribe easily at any time.
Python (805,863
Jupyter Notebook (153,914
Machine Learning (37,019
Dataset (33,042
Pandas (6,601
Numpy (5,903
Data Analysis (4,798
Kaggle (4,416
Deep Neural Networks (3,855
Scikit Learn (2,352
Udemy (1,603
Ipynb (593
Jose (581
Seaborn (574