Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A...

42
Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple Regression and Model Building By Groebner, Shannon, Fry, & Smith Prentice-Hall Publishing Company Copyright, 2011

Transcript of Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A...

Page 1: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Guide to Using Excel 2007 For Basic Statistical Applications

Guide to Using Excel 2007 For Basic Statistical Applications

To Accompany

Business Statistics: A Decision Making Approach, 8th Ed.

Chapter 15:

Multiple Regression and Model Building

By

Groebner, Shannon, Fry, & SmithPrentice-Hall Publishing Company

Copyright, 2011

Page 2: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Chapter 15 Excel Examples

Multiple Regression First City Real Estate

Multiple Regression –Dummy Variable First City Real Estate

Curvilinear RegressionAshley Investment Services

Interaction Effects Ashley Investment Services

More Examples

Page 3: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Chapter 15 Excel Examples

Residual Analysis First City Real Estate

Page 4: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Multiple Regression – First City Real Estate

Issue: First City management wishes to build a model that can be used to predict sales prices for residential property.

Objective: Use Excel 2007 to build a multiple regression model relating sales price to a set of measurable variables.

Data file is FirstCity.xls

Page 5: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Multiple Regression – First City Real Estate

• Open the file FirstCity.xls, Sheet Homes-Sample 1

• Select the Data Tab• Select Data Analysis• Select Correlation• OK

Page 6: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Multiple Regression – First City Real Estate

• Input Range: A1:G320• Grouped by: Columns• New Worksheet Ply: corr-

1• OK

Page 7: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Multiple Regression – First City Real Estate

This cell shows the correlation, 0.7477, between Price and Square Feet

Page 8: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Multiple Regression – First City Real Estate

Return to Homes-Sample 1• Select the Data tab• Select Data Analysis• Select Regression• OK

Page 9: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Multiple Regression – First City Real Estate

• Input Y Range: A1:A320• Input X Range: B1:F320• Select labels• New Worksheet Ply:

regress-1• OK

Page 10: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Multiple Regression – First City Real Estate

Excel produces the Regression Model

Page 11: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Multiple Regression – First City Real Estate

To determine if multicollinearity is a problem:• Select the Add-Ins tab• Select PHStat• Select Regression• Select Multiple Regression

Page 12: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Multiple Regression – First City Real Estate

• Y variable … : A1:A320• X variable … : B1:F320• Select First Cells in both …• Select Regression Statistics

…• Select ANOVA and …• Select Variance Inflation … • OK

Page 13: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Multiple Regression – First City Real Estate

PHStat will find Variance Inflation Factors for all independent variables. This slide shows the VIF for Garage # and all other X variables. Click on the sheet tabs X4,X3,X2,X1 to see the VIF for the other variables.

Page 14: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Issue: First City managers wish to improve the model by adding a location variable for the area.

Objective: Use Excel 2007 to improve a regression model by adding a dummy variable for the area either foothills or flatland.

Data file is First City.xls

Multiple Regression - Dummy Variable -First City Real EstateMultiple Regression - Dummy Variable -First City Real Estate

Page 15: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Multiple Regression- Dummy Variable – First City Real Estate

• Select the Home-Sample 2 sheet

• Select the Add-In tab• Select PHStat• Select Regression• Select Multiple

Regression

Page 16: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Multiple Regression- Dummy Variable – First City Real Estate

• Cut Bedrooms and Bathrooms columns and paste in cell G1. Delete columns C and D so that all data is in connected columns

• Select Data tab• Select Regression• Input Y Range: A1 – A320• Input X Range B1-E320 • Select Labels• New Worksheet: Regress-2• OK

Page 17: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Multiple Regression- Dummy Variable – First City Real Estate

All variables are significant and have the expected sign

Page 18: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Curvilinear Relationships - Ashley Investment ServicesCurvilinear Relationships - Ashley Investment Services

Issue: The director of personnel is trying to determine whether there is a relationship between employee burnout and time spent socializing with co-workers.

Objective: Use Excel 2007 to determine whether the relationship between the two measures is statistically significant.

Data file is Ashley.xls

Page 19: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Curvilinear Relationships – Ashley Investment Services

Open the file Ashley.xls

Page 20: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Curvilinear Relationships – Ashley Investment Services

• Select A1:B21• Select the Insert Tab• Select Scatter Plot• Select the Scatter Plot desired

Page 21: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Curvilinear Relationships – Ashley Investment Services

Page 22: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Curvilinear Relationships – Ashley Investment Services

• Select the Data tab• Select Data Analysis• Select Regression

Page 23: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Curvilinear Relationships – Ashley Investment Services

• Y Range A1:A21• X Range B1:B21• Labels• New Worksheet Ply:

Ashley-1• OK

Page 24: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Curvilinear Relationships – Ashley Investment Services

Page 25: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Curvilinear Relationships – Ashley Investment Services

To develop a nonlinear model, return to the data file. Create a new variable Socialization Squared

Page 26: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Curvilinear Relationships – Ashley Investment Services

• Select the Data tab• Select Data Analysis• Select Regression• OK

Page 27: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Curvilinear Relationships – Ashley Investment Services

• Input Y Range: A1:A21• Input X Range: B1:C21• Select Labels• New Worksheet Ply: Ashley-2

Page 28: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Curvilinear Relationships – Ashley Investment Services

The output shows the R Square value and the Regression Coefficients.

Page 29: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Interaction Effects - Ashley Investment ServicesInteraction Effects - Ashley Investment Services

Issue: The director of personnel is trying to determine whether the model can be improved by separating observations between those taken from men and women.

Objective: Use Excel 2007 to determine whether the relationship between the measures can be improved.

Data file is Ashley-2.xls

Page 30: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Interaction Effects – Ashley Investment Services

• Open file Ashley-2.xls• Insert a new column C as

(Socialization Squared) which is Column B squared

• Add Column E as Columns B * D

• Add Column F as Columns C * D

Page 31: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Interaction Effects – Ashley Investment Services

Using The Insert tab and Chart tools set up a Scatter Plot for one gender

Page 32: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Interaction Effects – Ashley Investment Services

Add the second gender

Add exponential trend line for male and female

Page 33: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Interaction Effects – Ashley Investment Services

The regression for the curvilinear model.

Page 34: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Issue: The company is interested in analyzing the residuals of the regression model to determine whether the assumptions of multiple regression are satisfied.

Objective: Use Excel 2007 to analyze residuals from a regression model.

Data file is First City-3.xls

Residual Analysis - First City Real Estate

Residual Analysis - First City Real Estate

Page 35: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Residual Analysis – First City Real Estate

Open the file FirstCity3.xls

Since Excel requires independent variables to be in adjacent columns – cut and paste these columns.

We will be using:• Price• Sq. Feet• Bedrooms• Garage #• Log of Lot Size

• Note: I will swap Lot Size and Log Lot Size to simplify the operation

Page 36: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Residual Analysis – First City Real Estate

• Select the Data tab

• Select Data Analysis

• Select Regression

• OK

Page 37: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Residual Analysis – First City Real Estate

Define the range for X and Y variables.Excel gives several options for Residual Analysis but does not have as complete a set as Minitab

Page 38: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Residual Analysis – First City Real Estate

This is the Residual Plot for Square Feet

Page 39: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Residual Analysis – First City Real Estate

While Excel will not automatically generate a histogram of the standardized residuals , one can be created.

Page 40: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Residual Analysis – First City Real Estate

Define Bin values for the Histogram

Page 41: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Residual Analysis – First City Real Estate

• Select the Data tab

• Select Data Analysis

• Select Histogram• On the Histogram

Chart identify the data and bins

• Select Chart output

Page 42: Guide to Using Excel 2007 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 15: Multiple.

Residual Analysis – First City Real Estate