Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

24
Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha

description

What is Multiple Regression? Linear Regression is a model to predict the value of one variable from another. Multiple Regression is a natural extension of this model: We use it to predict values of an outcome from several predictors. It is a hypothetical model of the relationship between several variables.

Transcript of Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

Page 1: Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

Applied Quantitative Analysis and Practices

LECTURE#28

By Dr. Osman Sadiq Paracha

Page 2: Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

Previous Lecture Summary Further SPSS Application of Simple Linear

regression Multiple Regression Introduction Multiple Regression Equation Sample Size considerations

Page 3: Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

What is Multiple Regression? Linear Regression is a model to predict the

value of one variable from another. Multiple Regression is a natural extension

of this model: We use it to predict values of an outcome from

several predictors. It is a hypothetical model of the relationship

between several variables.

Page 4: Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

Regression: An Example

A record company boss was interested in predicting album sales from advertising.

Data 200 different album releases

Outcome variable: Sales (CDs and Downloads) in the week after release

Predictor variables The amount spent promoting the album before release Number of plays on the radio

Page 5: Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

Multiple Regression as an Equation

With multiple regression the relationship is described using a variation of the equation of a straight line.

110 Xbby innXbXb 22

Page 6: Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

b0

b0 is the intercept. The intercept is the value of the

Y variable when all Xs = 0.

This is the point at which the regression plane crosses the Y-axis (vertical).

Page 7: Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

Beta Values

b1 is the regression coefficient for variable 1.

b2 is the regression coefficient for variable 2.

bn is the regression coefficient for nth variable.

Page 8: Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

The Model with Two Predictors

Page 9: Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

Sample Size Considerations• Simple regression can be effective with a sample size

of 20, but maintaining power at .80 in multiple regression requires a minimum sample of 50 and preferably 100 observations for most research situations.

• The minimum ratio of observations to variables is 5 to 1, but the preferred ratio is 15 or 20 to 1, and this should increase when stepwise estimation is used.

• Maximizing the degrees of freedom improves generalizability and addresses both model parsimony and sample size concerns.

Page 10: Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

Sample Size Considerations

Page 11: Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

Estimating the Regression Model and Assessing Overall Model Fit

The analyst must accomplish three basic tasks:

1. Select a method for specifying the regression model to be estimated,

2. Assess the statistical significance of the overall model in predicting the dependent variable, and

3. Determine whether any of the observations exert an undue influence on the results.

Page 12: Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

Methods of Regression Hierarchical:

Experimenter decides the order in which variables are entered into the model.

Forced Entry: All predictors are entered simultaneously.

Stepwise: Predictors are selected using their semi-partial

correlation with the outcome.

Page 13: Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

Hierarchical Regression

Known predictors (based on past research) are entered into the regression model first.

New predictors are then entered in a separate step/block.

Experimenter makes the decisions.

Page 14: Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

Hierarchical Regression It is the best method:

Based on theory testing. You can see the unique predictive

influence of a new variable on the outcome because known predictors are held constant in the model.

Bad Point: Relies on the experimenter knowing

what they’re doing!

Page 15: Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

Forced Entry Regression All variables are entered into the

model simultaneously. The results obtained depend on

the variables entered into the model. It is important, therefore, to have

good theoretical reasons for including a particular variable.

Page 16: Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

Stepwise Regression I

Variables are entered into the model based on mathematical criteria.

Computer selects variables in steps. Step 1

SPSS looks for the predictor that can explain the most variance in the outcome variable.

Page 17: Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

Stepwise Regression II Step 2:

Having selected the 1st predictor, a second one is chosen from the remaining predictors.

The semi-partial correlation is used as a criterion for selection.

Page 18: Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

Semi-Partial Correlation Partial correlation:

measures the relationship between two variables, controlling for the effect that a third variable has on them both.

A semi-partial correlation: Measures the relationship between two

variables controlling for the effect that a third variable has on only one of the others.

Page 19: Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

Semi-Partial Correlation in Regression

The semi-partial correlation Measures the relationship between a

predictor and the outcome, controlling for the relationship between that predictor and any others already in the model.

It measures the unique contribution of a predictor to explaining the variance of the outcome.

Page 20: Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

Problems with Stepwise Methods

Rely on a mathematical criterion. Variable selection may depend upon only slight

differences in the Semi-partial correlation. These slight numerical differences can lead to

major theoretical differences. Should be used only for exploration

Page 21: Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

R and R2

R The correlation between the observed values of

the outcome, and the values predicted by the model.

R2

The proportion of variance accounted for by the model.

Adj. R2

reduces the R2 by taking into account the sample size and the number of independent variables in the regression model (It becomes smaller as we have fewer observations per independent variable)..

Page 22: Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

Analysis of Variance: ANOVA The F-test

looks at whether the variance explained by the model (SSR) is significantly greater than the error within the model (SSE).

It tells us whether using the regression model is significantly better at predicting values of the outcome than using the mean.

Page 23: Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

How to Interpret Beta Values

Beta values: the change in the outcome

associated with a unit change in the predictor.

Standardised beta values: tell us the same but expressed as

standard deviations.

Page 24: Applied Quantitative Analysis and Practices LECTURE#28 By Dr. Osman Sadiq Paracha.

Lecture Summary Methods of Multiple Regression Hierarchical Regression Forced Entry Method Stepwise Method Problems with multiple regression R and R2

Beta values