Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n...

24
Chapter 12 Chapter 12 Simple Linear Regression Simple Linear Regression Simple Linear Regression Model Simple Linear Regression Model Least Squares Method Least Squares Method Coefficient of Determination Coefficient of Determination Model Assumptions Model Assumptions Testing for Significance Testing for Significance

Transcript of Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n...

Page 1: Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.

Chapter 12Chapter 12 Simple Linear Regression Simple Linear Regression

Simple Linear Regression ModelSimple Linear Regression Model Least Squares MethodLeast Squares Method Coefficient of DeterminationCoefficient of Determination Model AssumptionsModel Assumptions Testing for SignificanceTesting for Significance

Page 2: Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.

Simple Linear Regression ModelSimple Linear Regression Model

yy = = 00 + + 11xx + +

where:where:

00 and and 11 are called are called parameters of the modelparameters of the model,,

is a random variable called theis a random variable called the error term error term..

The The simple linear regression modelsimple linear regression model is: is:

The equation that describes how The equation that describes how yy is related to is related to xx and and an error term is called the an error term is called the regression modelregression model..

Page 3: Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.

Simple Linear Regression EquationSimple Linear Regression Equation

The The simple linear regression equationsimple linear regression equation is: is:

• EE((yy) is the expected value of ) is the expected value of yy for a given for a given xx value. value.• 11 is the slope of the regression line. is the slope of the regression line.• 00 is the is the yy intercept of the regression line. intercept of the regression line.• Graph of the regression equation is a straight line.Graph of the regression equation is a straight line.

EE((yy) = ) = 00 + + 11xx

Page 4: Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.

Simple Linear Regression EquationSimple Linear Regression Equation

Positive Linear RelationshipPositive Linear Relationship

EE((yy))EE((yy))

xxxx

Slope Slope 11

is positiveis positive

Regression lineRegression line

InterceptIntercept00

Page 5: Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.

Simple Linear Regression EquationSimple Linear Regression Equation

Negative Linear RelationshipNegative Linear Relationship

EE((yy))EE((yy))

xxxx

Slope Slope 11

is negativeis negative

Regression lineRegression lineInterceptIntercept00

Page 6: Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.

Simple Linear Regression EquationSimple Linear Regression Equation

No RelationshipNo Relationship

EE((yy))EE((yy))

xxxx

Slope Slope 11

is 0is 0

Regression lineRegression lineInterceptIntercept

00

Page 7: Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.

Estimated Simple Linear Regression Estimated Simple Linear Regression EquationEquation

The The estimated simple linear regression estimated simple linear regression equationequation

0 1y b b x 0 1y b b x

• is the estimated value of is the estimated value of yy for a given for a given xx value. value.yy• bb11 is the slope of the line. is the slope of the line.• bb00 is the is the yy intercept of the line. intercept of the line.

• The graph is called the estimated regression line.The graph is called the estimated regression line.

Page 8: Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.

Estimation ProcessEstimation Process

Regression ModelRegression Modelyy = = 00 + + 11xx + +

Regression EquationRegression EquationEE((yy) = ) = 00 + + 11xx

Unknown ParametersUnknown Parameters00, , 11

Sample Data:Sample Data:x yx y

xx11 y y11

. .. . . .. . xxnn yynn

bb00 and and bb11

provide estimates ofprovide estimates of00 and and 11

EstimatedEstimatedRegression EquationRegression Equation

Sample StatisticsSample Statistics

bb00, , bb11

0 1y b b x 0 1y b b x

Page 9: Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.

Least Squares MethodLeast Squares Method

Least Squares CriterionLeast Squares Criterion

min (y yi i )2min (y yi i )2

where:where:

yyii = = observedobserved value of the dependent variable value of the dependent variable

for the for the iith observationth observation^yyii = = estimatedestimated value of the dependent variable value of the dependent variable

for the for the iith observationth observation

Page 10: Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.

Least Squares MethodLeast Squares Method

Slope for the Estimated Slope for the Estimated Regression EquationRegression Equation

n

xx

n

yxyx

bi

i

iiii

2

2

1

Page 11: Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.

yy-Intercept for the Estimated Regression -Intercept for the Estimated Regression EquationEquation

Least Squares MethodLeast Squares Method

0 1b y b x 0 1b y b x

where:where:xxii = value of independent variable for = value of independent variable for iithth observationobservation

nn = total number of observations = total number of observations

__yy = mean value for dependent variable = mean value for dependent variable

__xx = mean value for independent variable = mean value for independent variable

yyii = value of dependent variable for = value of dependent variable for iithth observationobservation

Page 12: Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.

Reed Auto periodically hasReed Auto periodically has

a special week-long sale. a special week-long sale.

As part of the advertisingAs part of the advertising

campaign Reed runs one orcampaign Reed runs one or

more television commercialsmore television commercials

during the weekend preceding the sale. Data from aduring the weekend preceding the sale. Data from a

sample of 5 previous sales are shown on the next sample of 5 previous sales are shown on the next slide.slide.

Simple Linear RegressionSimple Linear Regression

Example: Reed Auto SalesExample: Reed Auto Sales

Page 13: Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.

Simple Linear RegressionSimple Linear Regression

Example: Reed Auto SalesExample: Reed Auto Sales

Number ofNumber of TV AdsTV Ads

Number ofNumber ofCars SoldCars Sold

1133221133

14142424181817172727

Page 14: Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.

Estimated Regression EquationEstimated Regression Equation

ˆ 10 5y x ˆ 10 5y x

1 2

( )( ) 205

( ) 4i i

i

x x y yb

x x

1 2

( )( ) 205

( ) 4i i

i

x x y yb

x x

0 1 20 5(2) 10b y b x 0 1 20 5(2) 10b y b x

Slope for the Estimated Regression EquationSlope for the Estimated Regression Equation

yy-Intercept for the Estimated Regression Equation-Intercept for the Estimated Regression Equation

Estimated Regression EquationEstimated Regression Equation

Page 15: Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.

Scatter Diagram and Trend LineScatter Diagram and Trend Line

y = 5x + 10

0

5

10

15

20

25

30

0 1 2 3 4TV Ads

Ca

rs S

old

Page 16: Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.

Coefficient of DeterminationCoefficient of Determination

Relationship Among SST, SSR, SSERelationship Among SST, SSR, SSE

where:where: SST = total sum of squaresSST = total sum of squares SSR = sum of squares due to regressionSSR = sum of squares due to regression SSE = sum of squares due to errorSSE = sum of squares due to error

SST = SSR + SST = SSR + SSE SSE

2( )iy y 2( )iy y 2ˆ( )iy y 2ˆ( )iy y 2ˆ( )i iy y 2ˆ( )i iy y

Page 17: Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.

The The coefficient of determinationcoefficient of determination is: is:

Coefficient of DeterminationCoefficient of Determination

where:where:

SSR = sum of squares due to regressionSSR = sum of squares due to regression

SST = total sum of squaresSST = total sum of squares

rr22 = SSR/SST = SSR/SST

Page 18: Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.

Coefficient of DeterminationCoefficient of Determination

rr22 = SSR/SST = 100/114 = .8772 = SSR/SST = 100/114 = .8772

The regression relationship is very strong; 88%The regression relationship is very strong; 88%of the variability in the number of cars sold can beof the variability in the number of cars sold can beexplained by the linear relationship between theexplained by the linear relationship between thenumber of TV ads and the number of cars sold.number of TV ads and the number of cars sold.

Page 19: Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.

Sample Correlation CoefficientSample Correlation Coefficient

21 ) of(sign rbrxy 21 ) of(sign rbrxy

ionDeterminat oft Coefficien ) of(sign 1brxy ionDeterminat oft Coefficien ) of(sign 1brxy

where:where:

bb11 = the slope of the estimated regression = the slope of the estimated regression

equationequation xbby 10ˆ xbby 10ˆ

Page 20: Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.

21 ) of(sign rbrxy 21 ) of(sign rbrxy

The sign of The sign of bb11 in the equation in the equation is “+”. is “+”.ˆ 10 5y x ˆ 10 5y x

=+ .8772xyr =+ .8772xyr

Sample Correlation CoefficientSample Correlation Coefficient

rrxyxy = +.9366 = +.9366

Page 21: Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.

Assumptions About the Error Term Assumptions About the Error Term

1. The error 1. The error is a random variable with mean of zero. is a random variable with mean of zero.1. The error 1. The error is a random variable with mean of zero. is a random variable with mean of zero.

2. The variance of 2. The variance of , denoted by , denoted by 22, is the same for, is the same for all values of the independent variable.all values of the independent variable.2. The variance of 2. The variance of , denoted by , denoted by 22, is the same for, is the same for all values of the independent variable.all values of the independent variable.

3. The values of 3. The values of are independent. are independent.3. The values of 3. The values of are independent. are independent.

4. The error 4. The error is a normally distributed random is a normally distributed random variable.variable.4. The error 4. The error is a normally distributed random is a normally distributed random variable.variable.

Page 22: Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.

Testing for SignificanceTesting for Significance

To test for a significant regression relationship, weTo test for a significant regression relationship, we must conduct a hypothesis test to determine whethermust conduct a hypothesis test to determine whether the value of the value of 11 is zero. is zero.

To test for a significant regression relationship, weTo test for a significant regression relationship, we must conduct a hypothesis test to determine whethermust conduct a hypothesis test to determine whether the value of the value of 11 is zero. is zero.

Two tests are commonly used:Two tests are commonly used: Two tests are commonly used:Two tests are commonly used:

tt Test Test andand FF Test Test

Both the Both the tt test and test and FF test require an estimate of test require an estimate of 22,, the variance of the variance of in the regression model. in the regression model. Both the Both the tt test and test and FF test require an estimate of test require an estimate of 22,, the variance of the variance of in the regression model. in the regression model.

Page 23: Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.

An Estimate of An Estimate of

Testing for SignificanceTesting for Significance

210

2 )()ˆ(SSE iiii xbbyyy 210

2 )()ˆ(SSE iiii xbbyyy

where:where:

ss 22 = MSE = SSE/( = MSE = SSE/(n n 2) 2)

The mean square error (MSE) provides the estimateThe mean square error (MSE) provides the estimate

of of 22, and the notation , and the notation ss22 is also used. is also used.

Page 24: Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.

Testing for SignificanceTesting for Significance

An Estimate of An Estimate of

2

SSEMSE

n

s2

SSEMSE

n

s

• To estimate To estimate we take the square root of we take the square root of 22..

• The resulting The resulting ss is called the is called the standard error ofstandard error of the estimatethe estimate..