Regression Analysis Simple Regression. y = mx + b y = a + bx.
-
Upload
elmer-powell -
Category
Documents
-
view
224 -
download
2
Transcript of Regression Analysis Simple Regression. y = mx + b y = a + bx.
![Page 1: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/1.jpg)
Regression Analysis
Simple Regression
![Page 2: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/2.jpg)
y = mx + b
y = a + bx
![Page 3: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/3.jpg)
y = a + bxwhere:
y dependent variable (value depends on x)
a y-intercept (value of y when x = 0)
b slope (rate of change in ratio of delta y divided by delta x)
x independent variable
![Page 4: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/4.jpg)
Assumptions
Linearity
Independence of Error
Homoscedasticity
Normality
![Page 5: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/5.jpg)
Linearity
The most fundamental assumption is that the model fits the situation [i.e.: the Y
variable is linearly related to the value of the X variable].
![Page 6: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/6.jpg)
Independence of Error
The error (residual) is independent
for each value of X.
[Residual = observed - predicted]
![Page 7: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/7.jpg)
Homoscedasticity
The variation around
the line of regression
constant
for all values of X.
![Page 8: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/8.jpg)
Normality
The values of Y be normally distributed at
each value of X.
![Page 9: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/9.jpg)
Diagnostic Checking Linearity
Independence
Examine scatter plot of residuals versus fitted [Yhat] for evidence of nonlinearity
Plot residuals in time order and look for patterns
![Page 10: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/10.jpg)
Diagnostic Checking Homoscedasticity
Normality
Examine scatter plots of residuals versus fitted [Yhat] and residuals vs time order and look for changing scatter.
Examine histogram of residuals. Look for departures from normal curve.
![Page 11: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/11.jpg)
Goal
Develop a statistical model that can predict the values of a dependent (response) variable based upon the values of the independent (explanatory) variable(s).
![Page 12: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/12.jpg)
Goal
![Page 13: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/13.jpg)
Simple Regression
A statistical model that utilizes one quantitativequantitative independent variable “X” to predict the quantitativequantitative dependent variable “Y.”
![Page 14: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/14.jpg)
Mini-Case
Since a new housing complex is being developed in Carmichael, management is under pressure to open a new pie restaurant. Assuming that population and annual sales are related, a study was conducted to predict expected sales.
![Page 15: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/15.jpg)
Mini-Case(Descartes Pie Restaurants)
RestaurantPopulation
(1000)Annual Sales
($1000)1 2 58
2 6 105
3 8 88
::: ::: :::
9 22 149
10 26 202
![Page 16: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/16.jpg)
Mini-Case What preliminary conclusions
can management draw from the data?
What could management expect sales to be if population of the new complex is approximately 18,000 people?
![Page 17: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/17.jpg)
Scatter Diagrams The values are
plotted on a two-dimensional graph called a “scatter diagram.”
Each value is plotted at its X and Y coordinates.
![Page 18: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/18.jpg)
Scatter Plot of Pieshop
0 5 10 15 20 25 30
Population (1000’s)
0
40
80
120
160
200
240
Sales ($1000’s_sal
ScatterPlot of PIESHOP
![Page 19: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/19.jpg)
Types of Models
No relationship between X and Y
Positive linear relationship
Negative linear relationship
![Page 20: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/20.jpg)
Method of Least Squares The straight line that best fits the data.
Determine the straight line for which the differences between the actual values (Y) and the values that would be predicted from the fitted line of regression (Y-hat) are as small as possible.
![Page 21: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/21.jpg)
Measures of Variation
Explained
Unexplained
Total
![Page 22: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/22.jpg)
Explained Variation
Sum of Squares(Yhat - Ybar)
2
due to Regression
[SSR]
![Page 23: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/23.jpg)
Unexplained Variation
Sum of Squares(Yobs - Yhat)2
Error
[SSE]
![Page 24: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/24.jpg)
Total Variation
Sum of Squares(Yobs - Ybar)2
Total
[SST]
![Page 25: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/25.jpg)
H0:
There is no linear relationship between the
dependent variable and the explanatory variable
![Page 26: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/26.jpg)
Hypotheses
H0: = 0
H1: 0
or
H0: No relationship exists
H1: A relationship exists
![Page 27: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/27.jpg)
Analysis of Variance for Regression
Sourceof
VariationSum ofSquares d.f. Mean Square
[Regression]Model SSR k - 1 SSR/dfn
[Residual]Error SSE n - k SSE/dfd
Total SST n - 1test:
p 0.05SST
![Page 28: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/28.jpg)
Standard Error of the Estimate
sy.x
- the measure of variability around the line of regression
![Page 29: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/29.jpg)
Relationship
When null hypothesis is rejected, a
relationship between Y and X variables exists.
![Page 30: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/30.jpg)
Coefficient of Determination
R2 measures the proportion of variation that is explained
by the independent variable
in the regression model.
R2 = SSR / SST
![Page 31: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/31.jpg)
Confidence interval estimates
»True mean
YX
»Individual
Y-hat
![Page 32: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/32.jpg)
Pieshop Forecasting
0 5 10 15 20 25 30
Population (1000’s)
0
40
80
120
160
200
240
Sales ($1000’s)
PIESHOP Forecasts
![Page 33: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/33.jpg)
Coefficient of Sanity
![Page 34: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/34.jpg)
Diagnostic Checking
H0 retain or reject
{Reject if p-value 0.05}
R2 (larger is “better”)
sy.x (smaller is “better”)
![Page 35: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/35.jpg)
Analysis of Variance for Regression for Pieshop
SourceSum ofSquares d.f. Mean Square
Model 14,200.0 1 14,200.0
Error 1,530.0 8 191.25
Total 15,730.0 9test:
p = 0.00003SST
![Page 36: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/36.jpg)
Coefficient of Determination
R2 = SSR / SST
= 90.27 %
thus, 90.27 percent of the variation in annual sales is
explained by the population.
![Page 37: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/37.jpg)
Standard Error of the Estimate
sy.x = 13.8293
with
SSE = 1,530.0
![Page 38: Regression Analysis Simple Regression. y = mx + b y = a + bx.](https://reader036.fdocuments.us/reader036/viewer/2022062300/56649cd75503460f9499e750/html5/thumbnails/38.jpg)
Regression Analysis[Simple Regression]
*** End of Presentation ***
Questions?