Discriminant analysis
-
Upload
amritashish-bagchi -
Category
Education
-
view
1.053 -
download
2
description
Transcript of Discriminant analysis
![Page 1: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/1.jpg)
Discrim
inan
t anal
ysis
Presented byAmritashish Bagchi, Anshuman Mishra & Sukanta Goswami
![Page 2: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/2.jpg)
Definition
To understand group differences and to predict the likelihood that a particular entity will belong to a particular class or group based on independent variables.
OBJECTIVE
Discriminant analysis is a multivariate statistical technique used for classifying a set of observations into pre defined groups.
![Page 3: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/3.jpg)
Purpose
1)The main purpose is to classify a subject into one of the two groups on the basis of some independent traits.
2)A second purpose of the discriminant analysis is to study the relationship between group membership and the variables used to predict the group membership.
![Page 4: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/4.jpg)
Situations for its use When the dependent variable is
dichotomous or multichotomous.
Independent variables are metric, i.e. interval or ratio.
![Page 5: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/5.jpg)
Application of discriminant analysis To identify the characteristics on the
basis of which one can classify an individual as-
1. basketballer or volleyballer on the basis of anthropometric variables.
2. High or low performer on the basis of skill.
3. Juniors or seniors category on the basis of the maturity parameters.
![Page 6: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/6.jpg)
What we do in discriminant analysis It is also known as discriminant function analysis. In, discriminant analysis, the dependent variable is a
categorical variable, whereas independent variables are metric.
after developing the discriminant model, for a given set of new observation the discriminant function Z is computed, and the subject/ object is assigned to first group if the value of Z is less than 0 and to second group if more than 0. This criterion holds true if an equal number of observations are taken in both the groups for developing a discriminant function.
![Page 7: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/7.jpg)
Assumptions
1. Sample size group sizes of the dependent should not be
grossly different i.e. 80:20, here logistic regression may be prefer.
should be at least five times the number of independent variables.
2. Normal distribution Each of the independent variable is normally
distributed.
![Page 8: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/8.jpg)
3. Homogeneity of variances / covariances All variables have linear and
homoscedastic relationships.
4. Outliers Outliers should not be present in the data.
DA is highly sensitive to the inclusion of outliers.
5. Non-multicollinearity There should be any correlation among
the independent variables.
![Page 9: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/9.jpg)
6. Mutually exclusive The groups must be mutually exclusive,
with every subject or case belonging to only one group.
7. Classification Each of the allocations for the dependent
categories in the initial classification are correctly classified.
8. Variability No independent variables should have a
zero variability in either of the groups formed by the dependent variable.
![Page 10: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/10.jpg)
Terminology 1) Variables in the analysis2) Discriminant function A discriminant function is a latent variable which
is constructed as a linear combination of independent variables, such that
Z= c+b1X1+ b2X2+…+bnXn The discriminant function is also known as
canonical root. This discriminant function is used to classify the subject/cases into one of the two groups on the basis of the observed values of the predictor variables
![Page 11: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/11.jpg)
3) Classification matrix In DA, it serves as a yardstick in measuring the
accuracy of a model in classifying an individual /case into one of the two groups. It is also known as confusion matrix, assignment matrix,or prediction matrix. It tells us as to what percentage of the existing data points are correctly classified by the model developed in DA.
4) Stepwise method of discriminant analysis Discriminant function can be developed either by
entering all independent variables together or in stepwise depending upon whether the study is confirmatory or exploratory.
![Page 12: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/12.jpg)
5) Power of discriminatory variables After developing the model in the discriminant analysis
based on the selected independent variables, it is important to know the relative importance of the variables so selected.
6) Box’s M Test By using Box’s M Tests, we test a null hypothesis that the
covariance matrices do not differ between groups formed by the dependent variable. If the Box’s M Test is insignificant, it indicates that the assumptions required for DA holds true.
7) Eigen values Eigen value is the index of overall fit.
![Page 13: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/13.jpg)
8) WILKS lambda It measures the efficiency of discriminant
function in the model. Its value shows, how much percentage of
variability in dependent variable is not explained by the independent variables.
9) Cannonial correlation The canonical correlation is the multiple
correlation between the predictors and the discriminant function. With only one function it provides an index of overall model fit which is interpreted as being the proportion of variance explained (R2).
![Page 14: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/14.jpg)
Detailed procedure
![Page 15: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/15.jpg)
STEPS IN ANALYSIS :
STEP 1.
In step one the independent variables which have the discriminating power are being chosen.
STEP 2.A discriminant function model is developed by using the coefficients of independent variables
![Page 16: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/16.jpg)
STEPS IN ANALYSIS Contd…
STEP 3.
In step three Wilk’s lambda is computed for testing the significance of discriminant function.
STEP 4.
In step four the independent variables which possess importance in discriminating the groups are being found.
![Page 17: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/17.jpg)
STEPS IN ANALYSIS Contd…
STEP 5.
In step five classification of subjects to their respective group is being made.
![Page 18: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/18.jpg)
APPLICATION OF SPSSEg. To identify the players into different
categories during selection process.
![Page 19: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/19.jpg)
![Page 20: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/20.jpg)
![Page 21: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/21.jpg)
![Page 22: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/22.jpg)
![Page 23: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/23.jpg)
![Page 24: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/24.jpg)
![Page 25: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/25.jpg)
Group statistics
![Page 26: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/26.jpg)
Box's Test of Equality of Covariance Matrices
![Page 27: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/27.jpg)
Z= -24.880 + .169 × + .466 × - .423 × - .204 ×
Where,
Height
Back explosive power
Judgement
Patience
![Page 28: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/28.jpg)
![Page 29: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/29.jpg)
![Page 30: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/30.jpg)
![Page 31: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/31.jpg)
![Page 32: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/32.jpg)
Means of the Transformed Groups Centroids
Mean of group 1Mean of group 2
(Batsmen )(Bowler)
-4.390 04.390
![Page 33: Discriminant analysis](https://reader036.fdocuments.us/reader036/viewer/2022081414/54c700c84a79591c038b475f/html5/thumbnails/33.jpg)
Thank
you