Simplifying stats
-
Upload
arvind-maharajan -
Category
Technology
-
view
48 -
download
0
Transcript of Simplifying stats
![Page 1: Simplifying stats](https://reader033.fdocuments.us/reader033/viewer/2022052301/55900a751a28ab31218b4606/html5/thumbnails/1.jpg)
Analytics using SAS © Beacon Learning
Regression Models
![Page 2: Simplifying stats](https://reader033.fdocuments.us/reader033/viewer/2022052301/55900a751a28ab31218b4606/html5/thumbnails/2.jpg)
Analytics using SAS © Beacon Learning
Regression
Predictive Modeling
Which Factors Explain?
Regressive vs. Correlation
Examples:
What will be India’s Energy Consumption as GDP grows by 6.5%?
What is the probability that a customer will default on housing loan
How many fatal road accidents will you have in Delhi if the traffic
volume increases by 10%?
![Page 3: Simplifying stats](https://reader033.fdocuments.us/reader033/viewer/2022052301/55900a751a28ab31218b4606/html5/thumbnails/3.jpg)
Analytics using SAS © Beacon Learning
Simple vs multiple Regression
Define Y
Identify X
Estimate
Interpret
uxxxy kk ....22110
![Page 4: Simplifying stats](https://reader033.fdocuments.us/reader033/viewer/2022052301/55900a751a28ab31218b4606/html5/thumbnails/4.jpg)
Analytics using SAS © Beacon Learning
Non Linear Probability Models
ktiit Xfp ,
![Page 5: Simplifying stats](https://reader033.fdocuments.us/reader033/viewer/2022052301/55900a751a28ab31218b4606/html5/thumbnails/5.jpg)
Analytics using SAS © Beacon Learning
Linear Probability Model (LPM)
uxxxp kk ....22110 …….
where, p
kxxx ,...,, 10
is the probability of default
kare the explanatory variables
.
yRegress an indicator variable on kxxx ,...,, 10
y is a dichotomous variable with possible values
defaultednot has firm theif 0
defaulted has firm theif 1y
![Page 6: Simplifying stats](https://reader033.fdocuments.us/reader033/viewer/2022052301/55900a751a28ab31218b4606/html5/thumbnails/6.jpg)
Analytics using SAS © Beacon Learning
Problems with LPM
Goodness of Fit
Improbable Probability Estimates
Linear Incremental Effect of variables on Default Probability
![Page 7: Simplifying stats](https://reader033.fdocuments.us/reader033/viewer/2022052301/55900a751a28ab31218b4606/html5/thumbnails/7.jpg)
Analytics using SAS © Beacon Learning
Goodness of Fit
![Page 8: Simplifying stats](https://reader033.fdocuments.us/reader033/viewer/2022052301/55900a751a28ab31218b4606/html5/thumbnails/8.jpg)
Analytics using SAS © Beacon Learning
Improbable Probability Estimates
Linear Incremental Effect of variables on Default
Probability
Other Problems with LPM
![Page 9: Simplifying stats](https://reader033.fdocuments.us/reader033/viewer/2022052301/55900a751a28ab31218b4606/html5/thumbnails/9.jpg)
Analytics using SAS © Beacon Learning
How should it look like?
![Page 10: Simplifying stats](https://reader033.fdocuments.us/reader033/viewer/2022052301/55900a751a28ab31218b4606/html5/thumbnails/10.jpg)
Analytics using SAS © Beacon Learning
Non Linear Probability Models
Linear vs Non Linear Regression
Logit Model
Probit Model
![Page 11: Simplifying stats](https://reader033.fdocuments.us/reader033/viewer/2022052301/55900a751a28ab31218b4606/html5/thumbnails/11.jpg)
Analytics using SAS © Beacon Learning
Logistic Model (Logit Models)
k
jij
k
jij
e
eP
10
10
1
![Page 12: Simplifying stats](https://reader033.fdocuments.us/reader033/viewer/2022052301/55900a751a28ab31218b4606/html5/thumbnails/12.jpg)
Analytics using SAS © Beacon Learning
Linear Transformation
k
j
ij
i
i
iP
PL
1
01 ln
Log-Odds Ratio
![Page 13: Simplifying stats](https://reader033.fdocuments.us/reader033/viewer/2022052301/55900a751a28ab31218b4606/html5/thumbnails/13.jpg)
Analytics using SAS © Beacon Learning
How does the probability change?
)1( PPdx
dPj
j
![Page 14: Simplifying stats](https://reader033.fdocuments.us/reader033/viewer/2022052301/55900a751a28ab31218b4606/html5/thumbnails/14.jpg)
Analytics using SAS © Beacon Learning
Estimation and Interpretation
01
1
ii y
i
y
i PP
Maximum Likelihood Technique
Likelihood function
.Choose j to maximize
![Page 15: Simplifying stats](https://reader033.fdocuments.us/reader033/viewer/2022052301/55900a751a28ab31218b4606/html5/thumbnails/15.jpg)
Analytics using SAS © Beacon Learning
Goodness of Fit: Concordant Analysis/Specificity vs Sensitivity
Estimated
Equation
Actual Won Lost Total
Predicted
Won 160 61 221
Lost 133 1345 1478
Total 293 1406 1699
Correct 160 1345 1505
% Correct 54.6 95.7 88.6
% Incorrect 45.4 4.3 11.4
Constant
Probability
Won Lost Total
0 0 0
293 1406 1699
293 1406 1699
0 1406 1406
0.0 100.0 82.8
100.0 0.0 17.2
Sensitivity 54.61%
Specificity 95.66%
Positive predictive value 72.40%
Negative predictive value 91.00%
![Page 16: Simplifying stats](https://reader033.fdocuments.us/reader033/viewer/2022052301/55900a751a28ab31218b4606/html5/thumbnails/16.jpg)
Analytics using SAS © Beacon Learning
Probit Model
![Page 17: Simplifying stats](https://reader033.fdocuments.us/reader033/viewer/2022052301/55900a751a28ab31218b4606/html5/thumbnails/17.jpg)
Analytics using SAS © Beacon Learning
Logit versus Probit