Stevens Direct Scaling Methods and the Uniqueness Problem:

43
1 Stevens Direct Scaling Methods and the Uniqueness Problem: Empirical Evaluation of an Axiom fundamental to Interval Scale Level

description

Stevens Direct Scaling Methods and the Uniqueness Problem: Empirical Evaluation of an Axiom fundamental to Interval Scale Level. - PowerPoint PPT Presentation

Transcript of Stevens Direct Scaling Methods and the Uniqueness Problem:

Page 1: Stevens Direct Scaling Methods and the Uniqueness Problem:

1

Stevens Direct Scaling Methods and the

Uniqueness Problem:

Empirical Evaluation of an Axiom fundamental to Interval Scale Level

Page 2: Stevens Direct Scaling Methods and the Uniqueness Problem:

2

OverviewHypothesis – The Interval Scale Axiom

MethodParticipantsStimuliProcedure

ResultsTesting the Interval Scale AxiomViolations of the Interval Scale Axiom

DiscussionCommutativityMathematical knowledge

Page 3: Stevens Direct Scaling Methods and the Uniqueness Problem:

3

Simple visual task:

Ratio Production Experiment: Subjects were required to adjust the area of variable squares to a prescribed ratio p.

Is a single subject able to produce adjustments on an interval scale level?

Page 4: Stevens Direct Scaling Methods and the Uniqueness Problem:

4

The Interval Scale Axiom

The hypothesis that Φ is a subscale of an interval scale is equivalent to the assumption that the following interval scale axiom is satisfied: Let us assume real numbers p, q, r, r´ and stimuli s, t, x, y such that:

Page 5: Stevens Direct Scaling Methods and the Uniqueness Problem:

5

___________ (t, q, x) E,x q t __________________ (s, p, t) E, t p s______________________________ (s, r, x) E,x r s

______________________________________ (y, r´, x) E,x r´ y

Then and

___________________________ (y, p´, t) E t p´ y

qr

pqrprp

)1´(

´:

Page 6: Stevens Direct Scaling Methods and the Uniqueness Problem:

6

The Interval Scale Axiom can be put to an empirical test

Page 7: Stevens Direct Scaling Methods and the Uniqueness Problem:

7

Participants

- 10 students and 7 graduates- 9 female and 8 male- Age: between 21 and 35 years old ( 26.9 years)- None had prior knowledge of hypothesis- Normal vision or corrected to normal vision

Method

Page 8: Stevens Direct Scaling Methods and the Uniqueness Problem:

8

Stimuli

White squares on a black background wereprojected on a white screen by means of a video projector

Page 9: Stevens Direct Scaling Methods and the Uniqueness Problem:

9

Apparatus

White screenVideo projector (Sony VPL-X1000E)Keyboard (Cherry G83), PCSoftware Orange TA (Michael Kickmeier)

Page 10: Stevens Direct Scaling Methods and the Uniqueness Problem:

10

Procedure• Welcome...• Thank you very much for participating...

• Collection of personal data name, age, normal sight? (y/n), student (y/n)• Instruction• Practice session (4 trials)

test trial – collecting data

darkened room (Heizhaus, SR 12.31) participants distance from the screen was 3 m

Page 11: Stevens Direct Scaling Methods and the Uniqueness Problem:

11

Instruction

„... The task is to adjust the right square in such a way, that its area appers to be n times as large as the area of the standard stimulus on the left. Use the „“ and the „“ keys to adjust the right square and indicate a satisfactory match by pressing the „Return“ key. If the projector screen appears to be to small to adjust the area to the prescribed ratio n, then you have the opportunity to press the „Esc“ key and the next task will appear ...“

Page 12: Stevens Direct Scaling Methods and the Uniqueness Problem:

12

Page 13: Stevens Direct Scaling Methods and the Uniqueness Problem:

13

2 standard squares (x)- 80 Pixel (13.3 cm on the screen)- 120 Pixel side length (20 cm)

11 ratio production factors- (q = 2,3,4,....,12)

for each (x, q) combination: 10 adjustments 2 11 10 = 220 adjustments

Session 1

Page 14: Stevens Direct Scaling Methods and the Uniqueness Problem:

14

Evaluation of Session 1

Calculating the mean areas of 10 adjustments 80 2 „802“ 10 adjustments 80 3 „803“ 10 adjustments 120 2 „1202“ 10 adjustments 120 3 „1203“

(will be the standard squares in Session 2)

Page 15: Stevens Direct Scaling Methods and the Uniqueness Problem:

15

Session 2

4 standard squares (different for each participant)

802 1202

803 1203

5 ratio production factors(p = 2,3,4,5,6)

for each (xq, p) combination: 10 adjustments 4 5 10 = 200 adjustments

Page 16: Stevens Direct Scaling Methods and the Uniqueness Problem:

16

Notation: mean area for the 10 adjustments802 3: „802,3“

In both Sessions:

Standard square and ratio production factors were randomly intermixed

The 2 Sessions were conducted on 2 different days

Page 17: Stevens Direct Scaling Methods and the Uniqueness Problem:

17

Results

A few descriptives

- One participant had to be excluded after Session 1 - „Esc“ key was not used

no ceiling effects - Session 1 took an average time of 46 min - Session 2 took an average time of 30 min

Page 18: Stevens Direct Scaling Methods and the Uniqueness Problem:

18

Ratio Production Factor

12111098765432M

ean

Are

as in

Pix

el²

200000

180000

160000

140000

120000

100000

80000

60000

40000

20000

0

adjusted

"correct"Ratio Production Factor

12111098765432

Mea

n A

reas

in P

ixel

²100000

90000

80000

70000

60000

50000

40000

30000

20000

10000

0

adjusted

"correct"

Mean adjustments in Session 1. Standard = 80 Pixel side length Mean adjustments in Session 1.

Standard = 120 Pixel side length

Page 19: Stevens Direct Scaling Methods and the Uniqueness Problem:

19

Testing the interval scale axiom

Analyzable Quadruple: There exists a quadruple (x, q, p ,r) such that the q p adjustment xq,p is statistically indistinguishable from the r adjustment xr. Potentially 4 analyzable quadruples for any participant

(1) 802 p1 = 80 r1

(2) 803 p2 = 80 r2

(3) 1202 p3 = 80 r3

(4) 1203 p4 = 80 r4

Page 20: Stevens Direct Scaling Methods and the Uniqueness Problem:

20

Data Selection

Rank OrderThe interval scale axiom is based on the assumption, that the adjustments preserve the mathematical order of the ratio production factors.

n adjustments < (n+1) adjustments 2 < 3 3 < 4

Page 21: Stevens Direct Scaling Methods and the Uniqueness Problem:

21

For any participant all such tuples were analyzed:

We evaluated only those participants with less than 2 rank order violations

4 particpants were excluded

p(rank order violation) 1/36 < .03

Page 22: Stevens Direct Scaling Methods and the Uniqueness Problem:

22

After Data Selection:

33 opportunities to check for the Interval Scale Axiom

Page 23: Stevens Direct Scaling Methods and the Uniqueness Problem:

23

One single case (Participant M.A)

Standard square: 80 Pixel side length q p = r 3 2 = 5 z(U)= .23, p = .82, n.s

q = 3 p = 2 r = 5

Analyzable Quadruple: (80, 3, 2, 5)

A natural number r´ > r was fixed in such a way, that the ratio production factor p´ is an integer.

Page 24: Stevens Direct Scaling Methods and the Uniqueness Problem:

24

Adjustments generated by a single participant (M.A.) starting from the standard square having 80 Pixel side length. Tripling the standard and doubling the outcome is statistically indistinguishable from the 5 adjustment. Therefore, tripling the standard and then tripling the outcome should be statistically indistinguishable from the 7 adjustment. This is not the case. The axiom is violated.

Page 25: Stevens Direct Scaling Methods and the Uniqueness Problem:

25

qr

pqrprp

)1´(

´:

5.235

325)12(6´:

p ... r´=6

335

325)12(7´

p ... r´=7

Page 26: Stevens Direct Scaling Methods and the Uniqueness Problem:

26

If adjustments are on an interval scale level, then:

If (80 3) 2 = 80 5

then there also must hold

(80 3) 3 = 80 7

Formally:

803,2 = 805

803,3 = 807

Page 27: Stevens Direct Scaling Methods and the Uniqueness Problem:

27

The area 803,3 is different from the area 807

(z(U) = 3.41, p < .01)

The axiom is violated.

Page 28: Stevens Direct Scaling Methods and the Uniqueness Problem:

28

Violations of the interval scale axiom in the whole sample

33 Tests: 18 violations vs. 15 non-violations Only 3 of the (remaining) 12 participants showed no violation of the axiom

Page 29: Stevens Direct Scaling Methods and the Uniqueness Problem:

29

Transform the pattern of outcomes into an overall statistical statement: binomial distribution

Probability for a single violation by chance is .05

Probability for observing 18 or more violations in 33 tests is:

p = 1.91 10-15

Assuming that the interval scale axiom holds for the whole sample, then the above pattern is highly unlikely.

Page 30: Stevens Direct Scaling Methods and the Uniqueness Problem:

30

ConclusionFor area production of squares:

Single subject is not able to produce adjustments on an

interval scale level

Page 31: Stevens Direct Scaling Methods and the Uniqueness Problem:

31

Discussion

The study generalizes earlier findings by Ellermeier and Faulhammer (2000) indicating that multiplicativity fails to hold.

Loudness Production: Data are not on a Ratio Scale Level

Page 32: Stevens Direct Scaling Methods and the Uniqueness Problem:

32

CommutativityDoubling a standard and tripling the outcome converges on the same final outcome as first tripling the (same) standard and then doubling the outcome

2 3 = 3 2 (?)Ellermeier & Faulhammer (2000), Zimmer (in press) found Commutativity to hold.

32 tests (16 participants, 2 standard squares)Commutativity was violated in 15 cases.

Commutativity is not a general law that holds across all sensory modalities.

Page 33: Stevens Direct Scaling Methods and the Uniqueness Problem:

33

Mathematical knowledge

4 and 9 ... play a special role

Phyiscal area of a square can be quadrupled by doubling the side length

Coefficient of Variation separately for the 11 ratio production factors

Page 34: Stevens Direct Scaling Methods and the Uniqueness Problem:

34

Mean coefficients of variation calculated separately for each ratio production factor p. The 4 and 9 adjustments show less variability than their respective neighbors.

x

sV Mathematical knowledge:

Page 35: Stevens Direct Scaling Methods and the Uniqueness Problem:

35

Participants applied mathematical knowledge

Teghtsoonian (1965)„ ...ability to make accurate area-judgements depends on (1) his ability accurately to

estimate length, and (2) his knowledge that the area of a two-dimensional figure is proportional to the square of a linear dimension“

Page 36: Stevens Direct Scaling Methods and the Uniqueness Problem:

36

Stevens psychophysical functions are based on mean values over the whole sample

The aim of the present study was to investigate, whether a single subject is able to produce adjustments on an interval scale level.

area production of squares: no (too many violations of the interval scale axiom)

Consequences: Parametric statistics have to be replaced by nonparametric methods.

Page 37: Stevens Direct Scaling Methods and the Uniqueness Problem:

37

Further studies:

- different sensory modalities (loudness, brightness,...)- other direct scaling methods (ratio estimation)- are different scale-types more appropriate

Page 38: Stevens Direct Scaling Methods and the Uniqueness Problem:

38

Thank you very much

Page 39: Stevens Direct Scaling Methods and the Uniqueness Problem:

39

Anderson, N. H. (1970). Functional Measurement and Psychophysical Judgment.Psychological Review, 77, 153-170.

Anderson, N. H. (1976). Integration Theory, Functional Measurement andthe Psychophysical Law. In H. G. Geissler, & Y. M. Zabrodin (Eds.),Advances in Psychophysics. Berlin: VEB Deutscher Verlag der Wissenschaften.

Ellermeier, W., & Faulhammer, G. (2000). Empirical Evaluation of AxiomsFundamental to Stevens's Ratio-Scaling Approach: I. Loudness Production.Perception & Psychophysics, 62, 1505-1511.

Graham, C. H. (1958). Sensation and Perception in an Objective Psychology.Psychological Review, 65, 65-76.

Krider, R. E, Raghubir, P., & Krishna, A. (2001) "Pizzas: or Square? PsychophysicalBiases in Area Comparisons," Marketing Science, 20(4), Fall, 405-425.

Luce, R. D. (2002). A Psychophysical Theory of Intensity Proportions, JointPresentations, and Matches. Psychological Review, 109, 520-532.

Luce, R. D. (2004). Symmetric and Asymmetric Matching of Joint Presentations.Psychological Review, 111, 446-454.

McKenna, F. P. (1985). Another Look at the „New Psychophysics“. BritishJournal of Psychology, 76, 97-109.

References

Page 40: Stevens Direct Scaling Methods and the Uniqueness Problem:

40

Narens, L. (1996) A Theory of Ratio Magnitude Estimation. Journal ofMathematical Psychology, 40, 109-129.

Narens, L. (2002) The Irony of Measurement by Subjective Estimations.Journal of Mathematical Psychology, 46, 769-788.

Orth, B. (1982). Zur Bestimmung der Skalenqualität bei ,direkten´ Skalierungsverfahren. Zeitschrift für experimentelle und

angewandte Psychologie, Band XXIX, Heft 1, S. 160-178. Peißner, M. (1999) Experimente zur direkten Skalierbarkeit von gesehenen

Helligkeiten [Experiments on the direct scalability of perceived brightness].Unpublished master's thesis, Universität Regensburg.

Shepard, R. N. (1978). On the Status of „Direct“ Psychological Measurement.In C. W. Savage (Ed.), Minnesota Studies in the Philosophy ofScience (Vol. 9, pp. 441-490). Minneapolis: University of Minnesota Press.

Shepard, R. N. (1981). Psychological Relations and Psychophysical Scales:On the Status of „Direct" Psychological Measurement. Journal of Math-ematical Psychology, 24, 21-57.

Stevens, S. S. (1936). A Scale for the Measurement of a Psychological Magnitude:Loudness. Psychological Review, 43, 405-416.

Page 41: Stevens Direct Scaling Methods and the Uniqueness Problem:

41

Stevens, S. S., & Guirao, M. (1963). Subjective Scaling of Length and Areaand the Matching of Length to Loudness and Brightness. Journal ofExperimental Psychology, 66, 177-186.

Teghtsoonian, M. (1965). The Judgment of Size. American Journal of Psycholoy,78, 392-402.

Zimmer, K. (in press). Examining the Validity of Numerical Ratios in LoudnessFractionation. Perception & Psychophysics.

Allgemein zur Methodik

Huber, O. (2000) Das psychologische Experiment: Eine Einführung. Hans Huber Verlag, Bern.

Bortz, Lienert, Boehnke: (2000) Verteilungsfreie Methoden in der Biostatistik. Springer Verlag, Berlin.Vorberg, D. & Blankenberger, S. (1999). Die Auswahl statistischer Tests und Maße. Psychologische Rundschau, 50, 157-164

Page 42: Stevens Direct Scaling Methods and the Uniqueness Problem:

42

Erg

ebni

stab

elle

. E

in |z

(U)|

Wer

t >

1.96

bed

eute

t, da

ß di

e be

iden

Fl

äche

n di

e lt

. A

xiom

gle

ich

sein

sol

lten

, ve

rsch

iede

n gr

sind

. In

dies

en

Fäll

en w

urde

das

A

xiom

ver

letz

t (f

ettg

edru

ckt)

. D

as V

orze

iche

n gi

bt d

ie R

icht

ung

der

Ver

letz

ung

an. E

in n

egat

ives

V

orze

iche

n be

deut

et, d

aß d

ie

q

Schä

tzun

g kl

eine

r is

t als

die

Sc

hätz

ung.

Page 43: Stevens Direct Scaling Methods and the Uniqueness Problem:

43

Es geht in dieser Untersuchung um das Schätzen von Flächen. 

Du wirst an der Wand eine Fläche sehen. Darüber steht eine Aufforderung, zb. „2“. Das bedeutet, daß es Deine Aufgabe ist, eine 2 mal so große Fläche

herzustellen. 

Es erscheint eine weitere Fläche, dessen Größe Du mit der „Pfeil aufwärts“ (↑) - Taste oder mit der „Pfeil abwärts“ (↓) - Taste entsprechend verändern kannst. Stelle also einfach eine Fläche her, welche Dir entsprechend der Aufforderung zb. 2 mal, 3

mal,... so groß erscheint.  

Wenn Du denkst, die von Dir hergestellte Fläche ist entsprechend der Aufforderung 2 mal, 3 mal,... so groß, dann drücke bitte die Return-Taste.

Wenn Du denkst, die Leinwand ist zu klein für Deine Schätzung, dann drücke einfach die „Esc“-Taste und die nächste Aufgabe erscheint.

 Es gibt keinen Zeitdruck. Trotzdem solltest du nicht allzu lange an einer Aufgabe

„herumgrübeln“. Die heutige Sitzung besteht aus ca. 200 Schätzungen. Nach 20 min gibt´s eine kurze Pause. Du kannst auch jederzeit selber mal durchatmen, die Augen

fest schließen,... . Bitte versuche, dich so gut wie möglich zu konzentrieren.

Instruktion