Main Point This study investigates how voice onset time (VOT) and word duration are affected by...

1
Main Point This study investigates how voice onset time (VOT) and word duration are affected by lexical frequency for words read in isolation and in phrasal context. The VOT is shorter for hi frequency words in phrasal context, and the word duration is shorter for both hi frequency words and for words in phrasal context. Summary of results: Results can be accounted for in a usage- based model (e.g., exemplar, prototype), but pose substantial theoretical and implementation problems for a traditional linguistic model. Background 1. The traditional linguistic model depends on a. discrete, invariant features b. economy of linguistic/phonetic features c. lexical representation similar to orthography d. a 'competence' versus 'performance' dichotomy 2. The traditional model does not systematically account for variation due to any of the following: a. non-Neogrammarian diachronic sound or lexical change (Labov 1981, Phillips 1984) b. sociolinguistic factors such as language contact (Meyers-Scotton 2002), ethnography (Eckert 2000) c. timing, rhythm, prosody (Browman and Goldstein 1992, Hayes 1995, Goldinger and Azuma 2003, Port 2003) d. tone and related prosodic phenomena (Goldsmith 1976) e. linguistic context: phonological (Luce 1985), syntactic (Gahl and Garnsey 2004), metrical (Hayes 1995), syllabic (Davis 1984) f. effects of type and token usage frequency (Bloomfield 1884, Francis and Kučera 1982, Bybee 2001) Conclusions 1. The duration of hi-frequency words is slightly shorter than lo- frequency words (about 5%), and much shorter in phrasal context than in isolation (about 30%). 2. The VOT of hi-frequency words is not systematically shorter, but was shorter by 18%, or about 12 ms, in phrasal context. We do not know why. 3. Since frequency information is apparently stored with each lexical item, these effects support a "usage-based model" that records frequency information in memory. References Bloomfield, M. 1884. On the probability of the existence of phonetic law. American Journal of Philology V. Browman, C. P., and Goldstein, L. M. 1992. "Articulatory phonology: an overview". Phonetica 49:159-180. Bybee, J. 2001. Phonology and Language Use. Cambridge Studies in Linguistics 94. Cambridge: Cambridge University Press. Davis, S. 1984. "Some Implications of Onset-Coda Constraints For Syllable Phonology." CLS 20 (Part One): 46-51. Eckert, P. 2000. Linguistic Variation as Social Practice. Malden, Massachusetts: Blackwell. Francis, W. N., and Kučera, H. 1982. Frequency Analysis of English Usage. Boston: Houghton Mifflin. Gahl, S., and Garnsey, S. M. 2004. "Knowledge of grammar, knowledge of usage: syntactic probabilities affect pronunciation variation". Language 80:748-775. Goldinger, S. D., and Azuma, T. 2003. "Puzzle-solving science: the quixotic quest for units in speech perception". Journal of Phonetics 31:305-320. Hayes, B. 1995. Metrical stress theory: Principles and case studies . Chicago: University of Chicago Press. Huckvale, M. 2005. ProRec 1.01: Speech Prompt and Record System: University College London. Labov, W. 1981. "Resolving the Neogrammarian controversy". Language 57:267-308. Luce, P.A. 1985. Structural distinctions between high and low frequency words in auditory word recognition. unpublished doctoral dissertation, Indiana University. Meyers-Scotton, C. 2002. Contact Linguistics: Bilingual Encounters and Grammatical Outcomes. Oxford: Oxford University Press. Phillips, B. 1984. "Word frequency and the actuation of sound change". Research Questions: 1. Is VOT shorter in high frequency words? 2. Is overall word duration shorter in high frequency words? 3. Is the frequency effect the same for words read in phrasal context as in words in isolation? Methods MATERIALS: SAMPLE PHRASAL CONTEXT: target words medially in partially redundant phrases: knowing test materials will help you pass any town in that country is small every tome by that author is difficult reading oily teak is highly prized in Washington PROCEDURE: materials randomly presented in mixed blocks. Trials consisted of a word or phrase, a 1600 ms of silence, a tone, and the talker's production. TOTAL CORPUS: 20 lo-frequency tokens (×8 repetitions) + 10 hi-frequency tokens (×8 repetitions) × 4 talkers 960 total tokens TALKERS: -4 talkers: 2 male (age 24, 54), 2 female (age 19, 55) -native English with no obvious regional dialect -all report normal hearing and speech MEASUREMENTS: based on waveform, spectral representations, and audio playback. Accuracy was confirmed via random subset sampling. VOT was measured from release of the stop consonant to onset of voicing. Results Statistical Reports items in red are significant, p<.05 VOT and word duration: effects of frequency Mark VanDam & Robert Port [email protected], [email protected] Department of Linguistics, Indiana University Acoustical Society of America 5pSC33 Vancouver, BC, May 2005 66.6 76.7 80.9 78.9 50 55 60 65 70 75 80 85 90 95 100 hi-frequency lo-frequency VO T (m s) phrasal isolation Talker-3 63.5 70.5 69.7 68.5 50 55 60 65 70 75 80 85 90 95 100 hi-frequency lo-frequency VOT (ms) Talker-4 70.1 79.8 79.7 83.2 50 55 60 65 70 75 80 85 90 95 100 hi-frequency lo-frequency VOT (ms) Talker-2 77.8 92.3 98.3 92.6 50 55 60 65 70 75 80 85 90 95 100 hi-frequency lo-frequency VOT (ms) Talker-1 54.9 64.5 75.3 71.5 50 55 60 65 70 75 80 85 90 95 100 hi-frequency lo-frequency VOT (ms) hi-frequency lo-frequency isolation table taint phrasal context a fancy table is made of oak venom can taint the blood supply hi-frequency (>100) lo-frequency (≤1) times tell town talk test ten table take teeth too tusk toque tab tort tuft tier taint tights taupe tint tyke toil talc tote tiff tinge teak tongs tome tine ANOVA VOT all Talkers d.f. F-value p-value frequency 1, 949 .355 .552 context 1, 949 57.74 <.001 context * frequency 3, 949 31.40 <.001 295.2 327.4 461.4 474.2 270 300 330 360 390 420 450 480 510 hi-frequency lo-frequency W ord Length (m s) phrasal isolation Talker-1 277.9 303.8 506.8 505.7 270 300 330 360 390 420 450 480 510 hi-frequency lo-frequency W ordLength(ms) Talker-2 306.3 360.4 497.3 524.4 270 300 330 360 390 420 450 480 510 hi-frequency lo-frequency W ordLength(ms) Talker-3 278.1 295.3 380.4 377.5 270 300 330 360 390 420 450 480 510 hi-frequency lo-frequency W ordLength(ms) Talker-4 318.5 351.0 461.0 489.7 270 300 330 360 390 420 450 480 510 hi-frequency lo-frequency W ordLength(ms) ANOVA Word duration all Talkers d.f. F-value p-value frequency 1, 949 14.87 .001 context 1, 949 706.85 <.001 context * frequency 3, 949 2.63 .105 VOT word duration VOT average duration (ms) change in duration (ms) change in duration (%) hi- frequency 73.8 −4.0 ms −5.2 % lo- frequency 77.8 phrasal context 71.6 −8.3 ms −10.4 % isolated context 79.9 Word Duratio n average duration (ms) change in duration (ms) change in duration (%) hi- frequency 378.3 −21.6 ms −5.4 % lo- frequency 399.9 phrasal context 311.3 −155.6 ms −33.3 % isolated context 466.9

Transcript of Main Point This study investigates how voice onset time (VOT) and word duration are affected by...

Page 1: Main Point This study investigates how voice onset time (VOT) and word duration are affected by lexical frequency for words read in isolation and in phrasal.

Main PointThis study investigates how voice onset time (VOT) and word duration are affected by lexical frequency for words read in isolation and in phrasal context. The VOT is shorter for hi frequency words in phrasal context, and the word duration is shorter for both hi frequency words and for words in phrasal context.

Summary of results:

Results can be accounted for in a usage-based model (e.g., exemplar, prototype), but pose substantial theoretical and implementation problems for a traditional linguistic model.

Background

1. The traditional linguistic model depends ona. discrete, invariant featuresb. economy of linguistic/phonetic featuresc. lexical representation similar to orthographyd. a 'competence' versus 'performance' dichotomy

2. The traditional model does not systematically account for variation due to any of the following:a. non-Neogrammarian diachronic sound or lexical change

(Labov 1981, Phillips 1984)b. sociolinguistic factors such as language contact (Meyers-

Scotton 2002), ethnography (Eckert 2000)c. timing, rhythm, prosody (Browman and Goldstein 1992,

Hayes 1995, Goldinger and Azuma 2003, Port 2003)d. tone and related prosodic phenomena (Goldsmith 1976)e. linguistic context: phonological (Luce 1985), syntactic

(Gahl and Garnsey 2004), metrical (Hayes 1995), syllabic (Davis 1984)

f. effects of type and token usage frequency (Bloomfield 1884, Francis and Kučera 1982, Bybee 2001)

3. The factors above have been observed in several domains: quality (vowels, voicing, etc), alteration (truncation, substitution, assimilation, metathesis, etc), and quantity (elision, shortening, etc)—but relatively little literature on VOT.

Conclusions1. The duration of hi-frequency words is slightly shorter than lo-frequency words (about 5%), and much shorter in phrasal context than in isolation (about 30%).

2. The VOT of hi-frequency words is not systematically shorter, but was shorter by 18%, or about 12 ms, in phrasal context. We do not know why.

3. Since frequency information is apparently stored with each lexical item, these effects support a "usage-based model" that records frequency information in memory.

ReferencesBloomfield, M. 1884. On the probability of the existence of phonetic law. American Journal of

Philology V.Browman, C. P., and Goldstein, L. M. 1992. "Articulatory phonology: an overview". Phonetica

49:159-180.Bybee, J. 2001. Phonology and Language Use. Cambridge Studies in Linguistics 94. Cambridge:

Cambridge University Press.Davis, S. 1984. "Some Implications of Onset-Coda Constraints For Syllable Phonology." CLS

20 (Part One): 46-51.Eckert, P. 2000. Linguistic Variation as Social Practice. Malden, Massachusetts: Blackwell.Francis, W. N., and Kučera, H. 1982. Frequency Analysis of English Usage. Boston: Houghton

Mifflin.Gahl, S., and Garnsey, S. M. 2004. "Knowledge of grammar, knowledge of usage: syntactic

probabilities affect pronunciation variation". Language 80:748-775.Goldinger, S. D., and Azuma, T. 2003. "Puzzle-solving science: the quixotic quest for units in

speech perception". Journal of Phonetics 31:305-320.Hayes, B. 1995. Metrical stress theory: Principles and case studies. Chicago: University of

Chicago Press.Huckvale, M. 2005. ProRec 1.01: Speech Prompt and Record System: University College

London.Labov, W. 1981. "Resolving the Neogrammarian controversy". Language 57:267-308.Luce, P.A. 1985. Structural distinctions between high and low frequency words in auditory word

recognition. unpublished doctoral dissertation, Indiana University.Meyers-Scotton, C. 2002. Contact Linguistics: Bilingual Encounters and Grammatical Outcomes.

Oxford: Oxford University Press.Phillips, B. 1984. "Word frequency and the actuation of sound change". Language 60: 320-342.Port, R. 2003. "Meter and Speech". Journal of Phonetics 31: 599-611.

Research Questions:

1. Is VOT shorter in high frequency words?2. Is overall word duration shorter in high frequency words?3. Is the frequency effect the same for words read in phrasal context as in words in isolation?

MethodsMATERIALS:

SAMPLE PHRASAL CONTEXT: target words medially in partially redundant phrases:

knowing test materials will help you passany town in that country is smallevery tome by that author is difficult readingoily teak is highly prized in Washington

PROCEDURE: materials randomly presented in mixed blocks. Trials consisted of a word or phrase, a 1600 ms of silence, a tone, and the talker's production.

TOTAL CORPUS: 20 lo-frequency tokens (×8 repetitions)+ 10 hi-frequency tokens (×8 repetitions)× 4 talkers

960 total tokens

TALKERS:-4 talkers: 2 male (age 24, 54), 2 female (age 19, 55)-native English with no obvious regional dialect-all report normal hearing and speech

MEASUREMENTS: based on waveform, spectral representations, and audio playback. Accuracy was confirmed via random subset sampling.

VOT was measured from release of the stop consonant to onset of voicing.

Word duration was measured from release of the word-initial stop consonant to the final glottal pulse (if ending in a voiced segment) or release burst (if ending in voiceless segment).

Results

Statistical Reports items in red are significant, p<.05

VOT and word duration: effects of frequency

Mark VanDam & Robert [email protected], [email protected]

Department of Linguistics, Indiana UniversityAcoustical Society of America

5pSC33Vancouver, BC, May 2005

66.6

76.7

80.9

78.9

50

55

60

65

70

75

80

85

90

95

100

hi-frequency lo-frequency

VO

T (

ms)

phrasal

isolation

Talker-3

63.5

70.569.7

68.5

50

55

60

65

70

75

80

85

90

95

100

hi-frequency lo-frequency

VO

T (m

s)

Talker-4

70.1

79.879.7

83.2

50

55

60

65

70

75

80

85

90

95

100

hi-frequency lo-frequency

VO

T (m

s)

Talker-2

77.8

92.3

98.3

92.6

50

55

60

65

70

75

80

85

90

95

100

hi-frequency lo-frequency

VO

T (m

s)

Talker-1

54.9

64.5

75.3

71.5

50

55

60

65

70

75

80

85

90

95

100

hi-frequency lo-frequency

VO

T (m

s)

hi-frequency lo-frequency

isolation table taint

phrasal context a fancy table is made of oakvenom can taint the blood supply

hi-frequency (>100) lo-frequency (≤1)

times

tell

town

talk

test

ten

table

take

teeth

too

tusk

toque

tab

tort

tuft

tier

taint

tights

taupe

tint

tyke

toil

talc

tote

tiff

tinge

teak

tongs

tome

tine

ANOVA

VOT

all Talkers

d.f. F-value p-value

frequency 1, 949 .355 .552

context 1, 949 57.74 <.001

context * frequency 3, 949 31.40 <.001

295.2

327.4

461.4

474.2

270

300

330

360

390

420

450

480

510

hi-frequency lo-frequency

Wor

d Le

ngth

(m

s)

phrasalisolation

Talker-1

277.9

303.8

506.8 505.7

270

300

330

360

390

420

450

480

510

hi-frequency lo-frequency

Wor

d Le

ngth

(ms)

Talker-2

306.3

360.4

497.3

524.4

270

300

330

360

390

420

450

480

510

hi-frequency lo-frequency

Wor

d Le

ngth

(ms)

Talker-3

278.1

295.3

380.4 377.5

270

300

330

360

390

420

450

480

510

hi-frequency lo-frequency

Wor

d Le

ngth

(ms)

Talker-4

318.5

351.0

461.0

489.7

270

300

330

360

390

420

450

480

510

hi-frequency lo-frequency

Wor

d Le

ngth

(ms)

ANOVA

Word duration

all Talkers

d.f. F-value p-value

frequency 1, 949 14.87 .001

context 1, 949 706.85 <.001

context * frequency 3, 949 2.63 .105VOT word duration

VOT average duration (ms)

change in duration (ms)

change in duration (%)

hi-frequency 73.8−4.0 ms −5.2 %

lo-frequency 77.8

phrasal context 71.6−8.3 ms −10.4 %

isolated context 79.9

Word Duration

average duration (ms)

change in duration (ms)

change in duration (%)

hi-frequency 378.3 −21.6 ms −5.4 %

lo-frequency 399.9

phrasal context 311.3−155.6 ms −33.3 %

isolated context 466.9