Statistical analysis by iswar
-
Upload
iswar-hazarika -
Category
Education
-
view
636 -
download
1
description
Transcript of Statistical analysis by iswar
STATISTICAL ANALYSISPresented by:
Iswar Hazarika1st yr M. Pharm Pharmacology
The Oxford college of Pharmacy
Contents:1. Statistics - Introduction 2. Scope of statistic3. Normal Distribution4. Central Tendency
1. Arithmetic mean2. Median 3. Mode
5. Dispersion1. Standard deviation (SD)
6. Standard error of mean (SEM)7. Probability8. Test for significance
1. Student ‘t’ test 2. Chi square test
Statistics:“Statistics is a science which deals with the collection, classification and tabulation of numerical facts as the basis for explanation, description and comparison of phenomena.”
Here, the data are numbers which contain information.
Scope: Industries Medical Science Agricultural biology Social Science Planning and economics Space research
Normal Distribution When many independent random
factors act in an additive manner to create variability, the data set follows a bell shaped distribution called as normal distribution.
Mathematicians De Moivre and Laplace used this distribution in the 1700's.
In the early 1800's, German mathematician and physicist Karl Gauss used it to analyze astronomical data, and known as the Gaussian distribution.
Normal Distribution
Normal Distribution
When maximum frequency of distribution occurring at the centre of the curve and the remaining evenly distributed around it, it follows normal distribution.
Normal distribution is described by its mean (µ) and standard deviation (σ).
Central tendency Arithmetic mean Geometric mean Median Mode
Arithmetic Mean: It is defined as the sum of the all
variates of a variable divided by the total number of item in a sample.
It is expressed by the symbol
Where, = Arithmetic mean n = frequency Xi = all the varietes of Variable
Arithmetic Mean:
Example:
Geometric Mean: It is defined as the nth root of the
product of the n items in an ungrouped data.
When percentage increase or decrease is expressed over a period of time, the mean percentage is find out by using geometric mean.
If X1, X2, X3,…. Xn are the n variates of the variable X then,
Geometric Mean =
Geometric Mean: Example:
Following administration of a drug in a laboratory mammal, the blood glucose level increased by 5% in the first hour, by 8% in the second hour and 77% in the third hour. What is the mean percentage increase during the observation period?
Here, we assume that the glucose level at the beginning of every hour as 100mg%
Then the level of blood sugar
Geometric Mean: At the end of 1 hour= 100+5 =105mg
% At the end of 2 hour=100+8 = 108mg
% At the end of 3 hour=100+77 =
177mg%
So, geometric mean=
= 126.14 So the mean percentage increase
= 126.14 – 100 = 26.14
Median: It is the central value of all observations
arranged from the lowest to the highest.
Example: (1) For Odd number of variates
Weight of frog in gram. n = 7 75, 66, 55, 68, 71, 78, 72.
Data in ascending order of value: 55, 66, 68, 71, 72, 75, 78.
Here, Median is 71.
Median:
Example: (1) For Even number of variates
Height of Students in cm, n = 8
165, 175, 161, 155, 169, 171, 152, 166.
Data in ascending order of value: 152, 155, 161, 165, 166, 169, 171, 175.
Here, Median is = 165.5
Mode: It is defined as the value which
occurs most frequently in the sample.
ExampleWeight of tablet in mg:52, 48, 50, 51, 50, 51, 50, 49.
In the above data, 50 occurs 3 timesSo mode of above data = 50 mg
Dispersion:
Range Mean deviation Standard deviation Variance (σ2) Standard Error Mean (SEM)
Standard Deviation: It is defined as the square root of
the arithmetic mean of the squared deviations of the various items from arithmetic mean.
It is expressed as SD
It is calculated by the following formula
Standard Deviation: Weight in gram of 6 Frogs.
30, 90, 20, 10, 80, 70. For the above data: = 50.
X weight in gram
10 10 – 50 = - 40 +1600
20 20 – 50 = - 30 +900
30 30 – 50 = - 20 +400
70 70 – 50 = 20 +400
80 80 – 50 = 30 +900
90 90 – 50 = 40 +1600
= 300 = 5800
Standard Deviation:
SD =
=
=
= 34.05
Text Book : Basic Concepts and Methodology for the Health Sciences 21
Variance:It measure dispersion relative to the scatter of
the values about there mean. a) Sample Variance ( ) : ,where is sample meanx
2S
1
)(1
2
2
n
xxS
n
ii
Text Book : Basic Concepts and Methodology for the Health Sciences 22
b)Population Variance ( ) :
where , is Population meanExample: slide no:20
Varience=( )2
= 1160
2
N
xN
ii
1
2
2
)(
Standard Error Mean: In a small sample size the arithmetic
mean would be an approximation of the true mean of the whole population, and therefore subject to error.
In such cases the error of the observed mean is calculated.
The SE allows to find out the range in which the true mean would lie.
It gives an estimate of the extent to which the mean will vary if the experiment is repeated.
Standard Error of the Mean:
SE=
SE of the previous example.
SE=
= 13.05
Probability: The term probability means
“chance” or “likelihood” of the occurrence of the event.
It is defined as the symbol ‘P’.
Where, m= Number of favorable events
N= Total number of events
Test of Significance
In scientific research, a sample investigation produces results which are helpful in making decisions about a population
We are interested in comparing the characteristics of two or more groups.
The two samples drawn from the same population will show some difference
Difference can be controlled by “Test of significance”
Procedure for Test of Significance1. Laying down Hypothesis:a) Null hypothesis: Hypothesis which is to be actually
tested for acceptance. b) Alternative hypothesis: Hypothesis which is
complementary to the to the null hypothesis. Eg. avg of gene length is 170 kbpHo:µ=170
H1:µ=170
i.e, µ>170 or µ<170
2. Two types of error in testing of hypothesisa) Type I error: Rejection of null hypothesis which is trueb) Type II error: Acceptance of null hypothesis which is
false
3. Level of significance Minimize Type I & II error Level of significance is denoted by α α is conventionally chosen as 0.05 (moderate
precision) or 0.01 (high precision) In most biostatistical test α is fixed at 5%, means
probability of accepting a true hypothesis is 95%
4. One & two tailed tests of hypothesis In a test the area under probability curve is divided
into Acceptance region Critical/ rejection region
Types of test of Significance Two types of test used in
interpretation of results.
(1)Parametric test:- It involves normal distribution.
It includes: Student’s t-test Analysis of variance(ANOVA)
Regression Correlation
Z- test
Test of Significance
(2)Non-Parametric test:- It involves when the sample
data does not follow normal distribution.
It includes: Chi-squared test Wilcoxon Signed-
rank test Kruskal-Wallis test
Student ‘t’ test:
This test is applied to assess the statistical significance of difference between two independently drawn sample means obtained from two series of data with an assumption that the two mean are from normal distribution population, with no significant variation
t= (difference of means of two samples)/(std error of difference)
Standard error of difference(Sd) = √{(S12/n1)+
(S22/n2)}
t= |X1 – X2|/ √{(S12/n1)+(S2
2/n2)}
Degrees of freedom = (n1+n2-2)
Ex. Following data related to disintegration time(DT) of Chloroquine tablets using diluent, Lactose monohydrate(LM), dibasic calcium phosphate (DCP).Determine whether the two means are significantly different.
Lactose Monohydrate
DCP
n 3o 35
mean 32 38
variance 9.62 14.23
Null hypothesis: Ho: There is no significant difference between the mean DT in choroquine tablets between LM & DCP
Sd = √{(S12/n1)+(S2
2/n2)} = √(9.62/30)+(14.23/35)=√0.73 = 0.85
Difference between mean = 38-32 = 6
t= |X1 – X2|/ √{(S12/n1)+(S2
2/n2)}
= |32-38|/ √{(9.62/30)+(14.23/35)}= 6/√o.73 = 7.06
Degrees of freedom = (n1+n2-2)= (30+35-2)=63
Conclusion: Calculated value of t(7.06)> tabulated value of t for
63(at 1%=2.66)So the two mean are very much differentSo the null hypothesis is rejected at p=0.01The difference between the two sample means is a
real difference because the level of significance is very high
Chi-square test:-In biological research apart from quantitative
characters one has to deal with qualitative data like flower color or seed color
Results of breeding experiments and genetical analysis comes under chi-square test
The quantity x2 describes the magnitude of difference between the observed & the expected frequency
x2 = ∑(fo - fe)2/fefo – observed frequencyfe – effective frequency
Determination of value of x2
1. Calculate the expected frequency(fe)2. Find out the difference between the observed
frequency(fo) and expected frequency(fe)3. Square the value of (fo-fe) i.e (fo-fe)2
4. Divide each value of fe & obtain the total ∑(fo - fe)2/fe value
5. The calculated value of x2 is compared with the table value for the given degrees of freedom(d.f)
d.f= (r-1) (c-1)where, r- no. of rows in table
c- no. of columns in table
Examples of x2 testIn F2 generation, Mendel obtained 621 tall
plants & 187 dwarf plants out of the total of 808. test whether these two types of plants are in accordance with the Mendelian monohybrid ratio of 3:1 or they deviate from ratio
Solution:Tall plants
Dwarf plants
Total
Observed frequency(fo)
621 187 808
Expected frequency(fe)
606 202 808
Deviation(fo-fe) 15 -15
Formula appliedx2 = ∑(fo - fe)2/fe
=(15)2/606+(-15)2/202= 225/606+ 225/202= 0.3713+ 1.1139= 1.4852
Tabulated value is 3.84 at 5% level of probability for d.f= 2-1 =1Therefore the difference between the observed
& expected frequencies is not significantHence the null hypothesis is true
Application of x2 test1. To test the goodness of fit2. To test the independence of attributes3. To test the homogeneity of independent
estimates of the population varience4. To test the detection of linkage
References Khan IA, Khatum A. Fundamentals of
Biostatistics. 3rd revised edition. Ukazz publication, Hyderabad
Brahmankar DM, Jaiswal SB. Biopharmaceutics & Pharmacokinetics.
Kulkarni SK. Textbook of Experimental pharmacology. Khan IA, Khatum A. Biostatistics in Pharmacy. 3rd edition.
Ukazz publikation, HydrabadJeffery GH, Bassett J,Mendham J, Denney RC. Textbook of
quantitative chemical analysis. Fifth edition. Vogel’s publication.