Ames_house_price_prediction

This is a house price prediction study which utilized Exploratory Data Analysis, Dealing with Missing Values, Linear Regression with LASSO and Ridge regularization to predict house prices in the Ames Housing Data Set
Alternatives To Ames_house_price_prediction
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
100 Days Of Ml Code201
2 months agoJupyter Notebook
A day to day plan for this challenge. Covers both theoritical and practical aspects
Ineuron Full Stack Data Science Assignments68
4 months agoJupyter Notebook
This Repository consists of Assignments and projects of the iNeuron Full Stack Data Science Course
Automobile Dataset Analysis20
2 years agoJupyter Notebook
This project analyzes and visualizes the Used Car Prices from the Automobile dataset in order to predict the most probable car price
Walmart Sales Prediction19
5 years agoJupyter Notebook
The Data Analysis Workshop18
2 years agomitJupyter Notebook
A New Interactive Approach to Learning Data Analysis
Machine Learning Bookcamp 202215
4 months agocc0-1.0Jupyter Notebook
Solutions for the Machine Learning Zoomcamp 2022 by DataTalks.Club.
Bangalore House Prediction App7
2 years agoJupyter Notebook
Predicts home prices of Bangalore. Used Flutter, Flask and Jupyter Notebook.
Predicting Baseball Statistics7
2 years agoJupyter Notebook
Predicting Baseball Statistics: Classification and Regression Applications in Python Using scikit-learn
Data Science7
10 months agoJupyter Notebook
EDA and Machine Learning Models in R and Python (Regression, Classification, Clustering, SVM, Decision Tree, Random Forest, Time-Series Analysis, Recommender System, XGBoost)
Olist_ecom_analysis6
a year ago4Jupyter Notebook
Data analysis about Brazilian e-commerce business Olist
Alternatives To Ames_house_price_prediction
Select To Compare


Alternative Project Comparisons
Readme

Case study: Feature Engineering-- Ames house price prediction

1 Problem statement

In this case study, you will prepare Ames Housing Dataset in a csv file in a way that it is suitable for a ML algorithm. You will achieve this by first exploring the data and performing feature transformations on provided dataset of house price prediction ML problem. You are required to train a ML model by using linear regression, ridge regression and lasso regression for predicting house prices.

2 Steps

  • 2.1 Load data set
  • 2.2 Exploratory Data Analysis (EDA)
    1. Histograms
    1. Heatmap
    1. Scatterplots

scatter-view

    1. Scatter matrix

scatter_matrix-view

    1. Correlation between other features and 'SalePrice'

The target 'SalePrice' variable is highly correlated with features such as OverallQual, GrLivArea, GarageCars, GarageArea and TotalBsmtSF among others.

  • 2.3 Process dataset for ML

Steps:

    1. Handle missing values
    1. Fill nulls for 'LotFrontage' with median value calculated after grouping by 'Neighborhood'
    1. Fill nulls for 'GarageYrBlt','MasVnrArea' with 0
    1. Apply log-transform on target feature 'SalePrice'
    1. One-hot encoding

3 Train Linear Regression

Split dataset in training set (X_train, y_train) and test set (X_test, y_test)

4 Evaluate Linear Regression model

R^2 score on trainig set: 0.94609, MSE score on trainig set: 0.00808

R^2 score on test set: 0.89136, MSE score on test set: 0.01472

linear_regression-view

5 Model refinement with Ridge regression and Lasso regression

Ridge regression (alpha=0.05): R^2 score on training set: 0.94598, R^2 score on test set: 0.89410

Lasso regression (alpha= 0.0001): R^2 score on trainig set: 0.94169, R^2 score on test set: 0.90843

6 Conslusion:

6.1 In practice, ridge regression is usually the first choice between two models.

6.2 However, if you have a large amount of features and expect only a few of them to be important, Lasso might be a better choice.

R^2 score Linear Regression Ridge Regression Lasso Regression
training set 0.94609 0.94598 0.94169
test set 0.89136 0.89410 0.90843
Popular Linear Regression Projects
Popular Exploratory Data Analysis Projects
Popular Machine Learning Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Jupyter Notebook
Linear Regression
Exploratory Data Analysis