lec-1 final

download lec-1 final

of 48

Transcript of lec-1 final

  • 8/2/2019 lec-1 final

    1/48

    M.B.A. SEMM.B.A. SEM--11

    QuantitativeQuantitative AnalysisAnalysis

    Devina UpadhyayDevina Upadhyay

    M.Sc. , M.phil., PhD (pursuing)M.Sc. , M.phil., PhD (pursuing)

  • 8/2/2019 lec-1 final

    2/48

    Statistics

    Descriptive theory Inferential theory Decision theory

  • 8/2/2019 lec-1 final

    3/48

    Introduction to DescriptiveIntroduction to Descriptivestatisticsstatistics

  • 8/2/2019 lec-1 final

    4/48

    Descriptive theory

    Measures of

    central tendency

    Measures of

    Dispersion

    Measures of

    Variation

  • 8/2/2019 lec-1 final

    5/48

    Ungrouped VersusUngrouped Versus

    Grouped DataGrouped Data

    Ungrouped dataUngrouped data

    have not been summarized in anyhave not been summarized in anywayway

    are also calledare also called raw dataraw data

    Grouped dataGrouped data

    have been organized into ahave been organized into a

    frequency distributionfrequency distribution

  • 8/2/2019 lec-1 final

    6/48

    Example ofUngroupedExample ofUngrouped

    DataData42

    30

    53

    50

    52

    30

    55

    49

    61

    74

    26

    58

    40

    40

    28

    36

    30

    33

    31

    37

    32

    37

    30

    32

    23

    32

    58

    43

    30

    29

    34

    50

    47

    31

    35

    26

    64

    46

    40

    43

    57

    30

    49

    40

    25

    50

    52

    32

    60

    54

    Ages of a Sample of

    Managers from

    Urban Child CareCenters in the

    United States

  • 8/2/2019 lec-1 final

    7/48

    Frequency Distribution ofFrequency Distribution of

    Child Care Managers AgesChild Care Managers Ages

    Class IntervalClass Interval FrequencyFrequency

    2020--under30under30 66

    3030--under40under40 1818

    4040--under50under50 1111

    5050--under60under60 1111

    6060--under70under70 33

    7070--under80under80 11

  • 8/2/2019 lec-1 final

    8/48

    Data RangeData Range

    4

    230

    53

    50

    52

    30

    55

    49

    61

    74

    26

    58

    40

    40

    28

    36

    30

    33

    31

    37

    32

    37

    30

    32

    23

    32

    58

    43

    30

    29

    34

    50

    47

    31

    35

    26

    64

    46

    40

    43

    57

    30

    49

    40

    25

    50

    52

    32

    60

    54

    Smallest

    Largest

    51=

    23-74=

    Smallest-Largest=Range

  • 8/2/2019 lec-1 final

    9/48

    Relative FrequencyRelative Frequency

    RelativeRelativeClass IntervalClass Interval FrequencyFrequency FrequencyFrequency

    2020--under30under30 66 .12.12

    3030--under40under40 1818 .36.36

    4040--under50under50 1111 .22.22

    5050--under60under60 1111 .22.22

    6060--under70under70 33 .06.06

    7070--under80under80 11 .02.02

    TotalTotal 5050 1.001.00

    6

    50!

    18

    50!

  • 8/2/2019 lec-1 final

    10/48

    Cumulative FrequencyCumulative Frequency

    CumulativeCumulativeClass IntervalClass Interval FrequencyFrequency FrequencyFrequency

    2020--under30under30 66 66

    3030--under40under40 1818 2424

    4040--under50under50 1111 3535

    5050--under60under60 1111 4646

    6060--under70under70 33 4949

    7070--under80under80 11 5050

    TotalTotal 5050

    18 + 6

    11 + 24

  • 8/2/2019 lec-1 final

    11/48

    Class Midpoints, RelativeClass Midpoints, Relative

    Frequencies, and CumulativeFrequencies, and Cumulative

    FrequenciesFrequencies

    Relative CumulativeRelative Cumulative

    Class IntervalClass IntervalFrequencyFrequency MidpointMidpoint FrequencyFrequency FrequencyFrequency

    2020--under30under30 66 2525 .12.12 663030--under40under40 1818 3535 .36.36 2424

    4040--under50under50 1111 4545 .22.22 3535

    5050--under60under60 1111 5555 .22.22 4646

    6060--under70under70 33 6565 .06.06 4949

    7070--under80under80 11 7575 .02.02 5050

    TotalTotal 5050 1.001.00

  • 8/2/2019 lec-1 final

    12/48

    Common Statistical GraphsCommon Statistical Graphs

    HistogramHistogram ---- vertical bar chart ofvertical bar chart of

    frequenciesfrequencies

    Frequency PolygonFrequency Polygon ---- line graph ofline graph of

    frequenciesfrequencies

    Pie ChartPie Chart ---- proportionalproportional

    representation for categories of arepresentation for categories of a

    whole.whole.

  • 8/2/2019 lec-1 final

    13/48

    HistogramHistogram ---- vertical bar chart ofvertical bar chart of

    frequenciesfrequencies

    Class Interval Frequency

    20-under 30 6

    30-under 40 18

    40-under 50 11

    50-under 60 11

    60-under 70 3

    70-under 80 1 0

    10

    20

    0 10 20 30 40 50 60 70 80

    Years

    Freq

    uen

    cy

  • 8/2/2019 lec-1 final

    14/48

    Histogram ConstructionHistogram Construction

    Class IntervalClass Interval FrequencyFrequency

    2020--under 30under 30 66

    3030--under 40under 40 1818

    4040--under 50under 50 1111

    5050--under 60under 60 1111

    6060--under 70under 70 33

    7070--under 80under 80 110

    10

    20

    0 10 20 30 40 50 60 70 80

    Years

    Freq

    u

    en

    cy

  • 8/2/2019 lec-1 final

    15/48

    Frequency PolygonFrequency Polygon ---- line graphline graph

    of frequenciesof frequencies

    Class Interval Frequency

    20-under 30 6

    30-under 40 18

    40-under 50 11

    50-under 60 11

    60-under 70 3

    70-under 80 10

    10

    20

    0 10 20 30 40 50 60 70 80

    Years

    Frequency

  • 8/2/2019 lec-1 final

    16/48

    TruckTruckProduction inProduction in

    the U.S. inthe U.S. in

    last yearlast year

    (Hypothetical(Hypothetical

    values)values)

    TruckProduction

    Company

    A

    B

    C

    D

    E

    Totals

    357,411

    354,936

    160,997

    34,099

    12,747

    920,190

  • 8/2/2019 lec-1 final

    17/48

    39%

    39%

    17%

    4%1%

    A B C D E

    . . ruc ro uc on. . ruc ro uc on---- propor onapropor ona

    representation for categories of arepresentation for categories of a

    whole.whole.

  • 8/2/2019 lec-1 final

    18/48

    Descriptive statistics, measure ofDescriptive statistics, measure of

    central tendency, Measure ofcentral tendency, Measure of

    Variability, For group andVariability, For group andUngrouped data, Measures ofUngrouped data, Measures of

    shapeshape ..

  • 8/2/2019 lec-1 final

    19/48

    Measures of Central Tendency:Measures of Central Tendency:

    Ungrouped Data

    Ungrouped DataMeasures of central tendency meansMeasures of central tendency means

    measures of location.measures of location.

    Common Measures of LocationCommon Measures of Location

    ModeMode

    MedianMedian MeanMean

    PercentilesPercentiles

    QuartilesQuartiles

  • 8/2/2019 lec-1 final

    20/48

    MeanMean

    Arithmetic mean : Simple averageArithmetic mean : Simple average

    Geometric mean: Relative percentageGeometric mean: Relative percentage

    Weighted mean: Weights associated withWeighted mean: Weights associated with

    every units.every units.

  • 8/2/2019 lec-1 final

    21/48

    ModeMode

    The most frequently occurring value in aThe most frequently occurring value in a

    data setdata set

    BimodalBimodal ---- Data sets that have two modesData sets that have two modes

    MultimodalMultimodal ---- Data sets that contain moreData sets that contain morethan two modesthan two modes

  • 8/2/2019 lec-1 final

    22/48

    The mode is 44.The mode is 44.

    There are more 44sThere are more 44s

    than any other value.than any other value.

    35

    37

    37

    39

    40

    40

    41

    41

    43

    43

    43

    43

    44

    44

    44

    44

    44

    45

    45

    46

    46

    46

    46

    48

    ModeMode ---- ExampleExample

  • 8/2/2019 lec-1 final

    23/48

    MedianMedian

    Middle value in an ordered array ofMiddle value in an ordered array of

    numbers.numbers.

    Unaffected by extremely large andUnaffected by extremely large and

    extremely small values.extremely small values.

  • 8/2/2019 lec-1 final

    24/48

    Median: ComputationalMedian: Computational

    ProcedureProcedureFirst ProcedureFirst Procedure

    Arrange the observations in an ordered array.Arrange the observations in an ordered array.

    If there is an odd number of terms, theIf there is an odd number of terms, themedian is the middle term of the orderedmedian is the middle term of the ordered

    array.array.

    If there is an even number of terms, theIf there is an even number of terms, the

    median is the average of the middle twomedian is the average of the middle twoterms.terms.

    Second ProcedureSecond Procedure

    The medians position in an ordered array isThe medians position in an ordered array is

    iven b n+1 /2.iven b n+1 /2.

  • 8/2/2019 lec-1 final

    25/48

    Median: Example

    with an Even Number of Terms

    Ordered Array

    3 4 5 7 8 9 11 14 15 16 16 17 19 19 20 21

    There are 16 terms in the ordered array. Position of median = (n+1)/2 = (16+1)/2 = 8.5

    The median is between the 8th and 9th terms,14.5.

    If the 21 is replaced by 100, the median is14.5.

    If the 3 is replaced by -88, the median is 14.5.

  • 8/2/2019 lec-1 final

    26/48

    Median: ExampleMedian: Example

    with an Odd Number ofT

    ermswith an Odd Number ofT

    ermsOrdered ArrayOrdered Array3 4 5 7 8 9 11 14 15 16 16 17 19 19 20 213 4 5 7 8 9 11 14 15 16 16 17 19 19 20 21

    2222

    There are 17 terms in the ordered array.There are 17 terms in the ordered array.Position of median = (n+1)/2 = (17+1)/2 =Position of median = (n+1)/2 = (17+1)/2 =99T

    he median is the 9th term, 15.T

    he median is the 9th term, 15.If the 22 is replaced by 100, the median isIf the 22 is replaced by 100, the median is15.15.If the 3 is replaced byIf the 3 is replaced by --103, the median is103, the median is15.15.

  • 8/2/2019 lec-1 final

    27/48

    VariabilityVariability

    No Variability

    Variability

  • 8/2/2019 lec-1 final

    28/48

    Measures of Variability:Measures of Variability:

    Ungrouped Data

    Ungrouped DataMeasures of variability describe theMeasures of variability describe the

    spread or the dispersion of a set of data.spread or the dispersion of a set of data.

    Common Measures of VariabilityCommon Measures of Variability RangeRange

    Interquartile RangeInterquartile Range

    VarianceVariance

    Standard DeviationStandard Deviation

    Coefficient of VariationCoefficient of Variation

  • 8/2/2019 lec-1 final

    29/48

    RangeRange

    The difference between the largest andThe difference between the largest and

    the smallest values in a set of datathe smallest values in a set of data

    Simple to computeSimple to compute

    Ignores all data points exceptIgnores all data points except thethe

    two extremestwo extremes

    Example:Example:

    RangeRange ==LargestLargest -- SmallestSmallest ==

    4848 -- 35 = 1335 = 13

    35

    37

    37

    39

    40

    40

    41

    41

    43

    43

    43

    43

    44

    44

    44

    44

    44

    45

    45

    46

    46

    46

    46

    48

    35

    48

  • 8/2/2019 lec-1 final

    30/48

    Interquartile RangeInterquartile Range

    Range of values between the first and thirdRange of values between the first and third

    quartilesquartiles

    Less influenced by extremesLess influenced by extremes

    Interquartile Range Q Q! 3 1

  • 8/2/2019 lec-1 final

    31/48

    Population VariancePopulation Variance

    Average of theAverage of the squaredsquared deviations fromdeviations from

    the arithmetic mean.the arithmetic mean.

    mean is 13.mean is 13.Observations:5,9,16,17,18.Observations:5,9,16,17,18.

    59

    16

    17

    18

    -8-4

    +3

    +4

    +5

    0

    6416

    9

    16

    25

    130

    XX Q 2X Q 2

    2

    130

    5

    260

    W

    Q!

    !

    !

    XN

    .

  • 8/2/2019 lec-1 final

    32/48

    Population Standard DeviationPopulation Standard Deviation

    Square root of theSquare root of thevariancevariance

    2

    2

    2

    130

    5

    26 0

    26 0

    51

    WQ

    WW

    !

    !

    !

    !

    !

    !

    X

    N

    .

    .

    .

    5

    9

    16

    17

    18

    -8

    -4

    +3

    +4

    +5

    0

    64

    16

    9

    16

    25

    130

    XX Q 2X Q

  • 8/2/2019 lec-1 final

    33/48

    Sample VarianceSample Variance

    Average of theAverage of the squaredsquared deviations fromdeviations fromthe arithmetic mean.the arithmetic mean.Mean:1773Mean:1773

    Observations:2398,1844,1539,1311.Observations:2398,1844,1539,1311.

    2,398

    1,8441,539

    1,311

    7,092

    625

    71-234

    -462

    0

    390,625

    5,04154,756

    213,444

    663,866

    X X X 2X X 2

    2

    1

    663 866

    3

    221 288 67

    SX X

    n

    !

    !

    !

    ,

    , .

  • 8/2/2019 lec-1 final

    34/48

    Sample Standard DeviationSample Standard Deviation

    Square root of theSquare root of the

    sample variancesample variance 22

    2

    1

    663866

    3

    22128867

    22128867

    47041

    SX X

    S

    n

    S

    !

    !

    !

    !

    !

    !

    ,

    , .

    , .

    .

    2,398

    1,844

    1,539

    1,311

    7,092

    625

    71

    -234

    -462

    0

    390,625

    5,041

    54,756

    213,444

    663,866

    XX X 2X X

  • 8/2/2019 lec-1 final

    35/48

    Coefficient of VariationCoefficient of Variation

    Ratio of the standard deviation to theRatio of the standard deviation to the

    mean, expressed as a percentagemean, expressed as a percentage

    Measurement ofMeasurement of relativerelative dispersiondispersion

    100..Q

    W!VC

  • 8/2/2019 lec-1 final

    36/48

    Less c.v more consistencyLess c.v more consistency

    Less c.v more uniformityLess c.v more uniformity

    Less c.v less riskLess c.v less risk

  • 8/2/2019 lec-1 final

    37/48

    Coefficient of VariationCoefficient of Variation

    A BA B

    129

    4 6

    100

    4 629

    100

    1586

    1

    1

    1

    1

    Q

    W

    WQ

    !

    !

    !

    !

    !

    .

    .

    .

    . .CV

    284

    10

    100

    1084

    100

    1190

    2

    2

    2

    2

    Q

    W

    WQ

    !

    !

    !

    !

    !

    CV. .

    .

  • 8/2/2019 lec-1 final

    38/48

    Measures of Central TendencyMeasures of Central Tendency

    and Variability: Grouped Dataand Variability: Grouped Data

    Measures of Central TendencyMeasures of Central Tendency

    MeanMean MedianMedian

    ModeMode

    Measures of VariabilityMeasures of Variability

    VarianceVariance

    Standard DeviationStandard Deviation

  • 8/2/2019 lec-1 final

    39/48

    Mean of Grouped DataMean of Grouped Data

    Weighted average of class midpointsWeighted average of class midpoints

    Class frequencies are the weightsClass frequencies are the weights

    Q !

    !

    !

    fMf

    fM

    N f M f M f M f M

    f f f f

    i i

    i

    1 1 2 2 3 3

    1 2 3

  • 8/2/2019 lec-1 final

    40/48

    Calculation of Grouped MeanCalculation of Grouped Mean

    Class Interval Frequency Class Midpoint fM

    20-under 30 6 25 150

    30-under 40 18 35 630

    40-under 50 11 45 49550-under60 11 55 605

    60-under 70 3 65 195

    70-under 80 1 75 75

    50 2150

    Q ! ! !fM

    f

    2150

    5043 0.

  • 8/2/2019 lec-1 final

    41/48

    Median of Grouped DataMedian of Grouped Data

    Median L

    Ncf

    fW

    Where

    p

    med

    !

    !

    2

    :

    L the lower limit of the median class

    cf = cumulative frequency of class preceding the median class

    f = frequency of the median classW = width of the median class

    N = total of frequencies

    p

    med

  • 8/2/2019 lec-1 final

    42/48

    Median of Grouped DataMedian of Grouped Data ----

    ExampleExampleCumulative

    Class Interval Frequency Frequency

    20-under 30 6 630-under 40 18 24

    40-under 50 11 35

    50-under 60 11 46

    60-under 70 3 49

    70-under 80 1 50

    N = 50

    Md L

    Ncf

    fW

    p

    med

    !

    !

    !

    2

    40

    50

    224

    1110

    40 909.

  • 8/2/2019 lec-1 final

    43/48

    Mode of Grouped DataMode of Grouped Data

    Midpoint of the modal classMidpoint of the modal class

    Modal class has the greatestModal class has the greatest

    frequencyfrequencyClass Interval Frequency20-under 30 6

    30-under 40 18

    40-under 50 11

    50-under 60 1160-under 70 3

    70-under 80 1

    1579.33

    10*11636

    61830

    *2012

    01mod

    352

    4030

    !

    !

    !

    !

    !

    Wfff

    ffLe

    Mode

    ar ance an tan arar ance an tan ar

  • 8/2/2019 lec-1 final

    44/48

    ar ance an tan ar ar ance an tan ar

    DeviationDeviation

    of Grouped Dataof Grouped Data

    22

    2

    WQ

    WW

    !

    !

    fN

    M

    Population

    22

    2

    1S

    M X

    S

    f

    n

    S

    !

    !

    Sample

    opu at on ar ance anopu at on ar ance an

  • 8/2/2019 lec-1 final

    45/48

    opu at on ar ance anopu at on ar ance an

    Standard Deviation of GroupedStandard Deviation of Grouped

    DataData

    1944

    115244

    1584

    1452

    1024

    7200

    20-under 30

    30-under 4040-under 50

    50-under 60

    60-under 70

    70-under 80

    Class Interval

    6

    1811

    11

    3

    1

    50

    f

    25

    3545

    55

    65

    75

    M

    150

    630495

    605

    195

    75

    2150

    fM

    -18

    -82

    12

    22

    32

    M Q f M2

    Q

    324

    644

    144

    484

    1024

    2

    MQ

    2

    2

    7200

    50144W

    Q! ! !

    fN

    M W W! ! !2

    144 12

  • 8/2/2019 lec-1 final

    46/48

    Measures of ShapeMeasures of Shape

    SkewnessSkewness Absence of symmetryAbsence of symmetry Extreme values in one side of a distributionExtreme values in one side of a distribution

    KurtosisKurtosis Peakedness of a distributionPeakedness of a distribution

    Dispersion:Dispersion: Spread of the dataSpread of the data

  • 8/2/2019 lec-1 final

    47/48

    SkewnessSkewness

    Symmetric skewedSymmetric skewed

  • 8/2/2019 lec-1 final

    48/48

    Dispersion KurtosisDispersion Kurtosis