From Sound to Sense and back again: The integration of lexical and speech processes

63
From Sound to Sense and From Sound to Sense and back again: back again: The integration of lexical and speech The integration of lexical and speech processes processes David Gow Massachusetts General Hospital Bob McMurray Dept. of Brain and Cognitive Science University of Rochester

description

From Sound to Sense and back again: The integration of lexical and speech processes. David Gow Massachusetts General Hospital. Bob McMurray Dept. of Brain and Cognitive Sciences University of Rochester. Sense. Sound. The Speech Chain. - PowerPoint PPT Presentation

Transcript of From Sound to Sense and back again: The integration of lexical and speech processes

Page 1: From Sound to Sense and back again: The integration of lexical and speech processes

From Sound to Sense and From Sound to Sense and back again:back again:

The integration of lexical and The integration of lexical and speech processesspeech processesDavid Gow

Massachusetts General HospitalBob McMurray

Dept. of Brain and Cognitive SciencesUniversity of Rochester

Page 2: From Sound to Sense and back again: The integration of lexical and speech processes

Complex computations from sound to sense must be broken up for study.

The Speech Chain

Sound

Sense

Assume intermediate representations:

Phonemes…Words…Syntactic

Phrases…

Page 3: From Sound to Sense and back again: The integration of lexical and speech processes

The Standard Paradigm

The Standard Paradigm

Sense

Phon

olog

y Words

Phonemes

Sound

Page 4: From Sound to Sense and back again: The integration of lexical and speech processes

The Standard Paradigm

The Standard Paradigm

Phon

olog

y Words

Phonemes

Delimited fields of study.

Sound

•Speech Perception•Spoken Word

Recognition•Phonology

Phonemes* essential

* or other sublexical category

Sense

Page 5: From Sound to Sense and back again: The integration of lexical and speech processes

Why? Categorical Perception (CP)

•Sharp identification of tokens on a continuum.

VOT0

100

PB

% /p

/

ID (%/pa/) 0

100Discrimination

Discrimination

•Discrimination poor within a phonetic category.

Continuous Acoustic Detail => Discrete CategoriesDoes CAD affect speech categorization?

Page 6: From Sound to Sense and back again: The integration of lexical and speech processes

Categorical Perception (CP)

Defined fundamental computational problems.

CP is output of •Speech perception

Input to •Phonology•Word recognition.

Phon

olog

y Words

Phonemes

Sense

Sound

Page 7: From Sound to Sense and back again: The integration of lexical and speech processes

But… • Not all speech contrasts are categorical.

• Lots of tasks show non-categorical perception.

Fry, Abramson, Eimas & Liberman (1962) Pisoni & Tash (1974) Pisoni & Lazarus (1974) Carney, Widden & Viemeister (1977) Hary & Massaro (1982) Pisoni, Aslin, Perey & Hennessy (1982) Healy & Repp (1982) Massaro & Cohen (1983) Miller (1997) Samuel (1997)…

CP

Page 8: From Sound to Sense and back again: The integration of lexical and speech processes

Categorical Perception is about phonetic classification.

Why has the Standard Paradigm persisted?

Sound

SenseThe minimal computational problem: compute meaning from sound.

CP tasks don’t necessarily tap a stage of this problem.

?CPWords

Lexical activation… seems a good bet.

Page 9: From Sound to Sense and back again: The integration of lexical and speech processes

Even when continuous acoustic detail affects word recognition, it is seen as outside of core word recognition.

Why has the Standard Paradigm persisted?

Page 10: From Sound to Sense and back again: The integration of lexical and speech processes

Example: Word Segmentation

• Vowel Length• Stress/Meter• Coarticulation

Words

Phonemes

CAD

Segmentation

Cue extra-segmental process.W

ord

Reco

gniti

on

Even when continuous acoustic detail affects word recognition, it is seen as outside of core word recognition.

Why has the Standard Paradigm persisted?

Page 11: From Sound to Sense and back again: The integration of lexical and speech processes

No. Standard Paradigm is fine… Yes. Hmm…

Does continuous acoustic detail affect interpretation via core word-recognition processes?

Need to use stimuli with:•Precise control over CAD

Need to use tasks that:•reflect only minimal computational problem:

meaning.•are sensitive to acoustic detail.

Sublexical Filter(phonemes)

Page 12: From Sound to Sense and back again: The integration of lexical and speech processes

Visual World Paradigm

Visual World Paradigm

•Subjects hear spoken language and manipulate objects in a visual world.

•Visual world includes set of objects with interesting linguistic properties (names)

•Eye-movements to each object are monitored throughout the task.

Tanenhaus, Spivey-Knowlton, Eberhart & Sedivy (1995)Allopenna, Magnuson & Tanenhaus (1998)

Page 13: From Sound to Sense and back again: The integration of lexical and speech processes

•Meaning based, natural task: Subjects must interpret speech to perform task.

•Eye-movements fast and time-locked to speech.

•Fixation probability maps onto dynamics of lexical activation.

•Context is controlled: meaning lexical

activation.

Page 14: From Sound to Sense and back again: The integration of lexical and speech processes

?Does continuous acoustic detail affect

interpretation?

Is lexical activation sensitive to continuous

acoustic detail?

Page 15: From Sound to Sense and back again: The integration of lexical and speech processes

Combine tools of

• speech perception:

9-step VOT continuum.

• spoken word recognition:

visual world paradigm

McMurray, Tanenhaus & Aslin (2003)

Page 16: From Sound to Sense and back again: The integration of lexical and speech processes

A moment to view the items

Methods

Page 17: From Sound to Sense and back again: The integration of lexical and speech processes

500 ms later

Page 18: From Sound to Sense and back again: The integration of lexical and speech processes

Bear

Repeat 1080 times…

Page 19: From Sound to Sense and back again: The integration of lexical and speech processes

Target = BearCompetitor = PearUnrelated = Lamp, Ship

Time

200 ms

1

2

3

4

5

Trials

Page 20: From Sound to Sense and back again: The integration of lexical and speech processes

Time (ms)

VOT=0 Response=

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

0 400 800 1200 1600

Fixa

tion

pro

port

ion

Page 21: From Sound to Sense and back again: The integration of lexical and speech processes

Systematic effect on competitor dynamics.Fixations to the competitor.

Predictions

Categorical Results Gradient Effect

target

competitortime

Fixa

tion

pro

port

ion

target

competitor competitorcompetitortime

Fixa

tion

pro

port

ion

target

What would lexical sensitivity to CAD look like?

Page 22: From Sound to Sense and back again: The integration of lexical and speech processes

Results

0 400 800 1200 160000.020.040.060.080.1

0.120.140.16

0 ms5 ms10 ms15 ms

VOT

0 400 800 1200 1600 2000

20 ms25 ms30 ms35 ms40 ms

VOT

Com

peti

tor

Fixa

tion

s

Time since word onset (ms)

Response= Response=

Page 23: From Sound to Sense and back again: The integration of lexical and speech processes

Task?

P

B Sh

LPhoneme ID

Not part of minimal computational

problem.Same stimuli in metalinguistic task…

…more categorical pattern of fixationsContinuous acoustic detail is not helpful in metalinguistic tasks…

Page 24: From Sound to Sense and back again: The integration of lexical and speech processes

Summary

Word recognition shows gradient sensitivity to continuous acoustic detail.

Not extra-segmental: VOT

CAD affects higher-level processes.

Consistent with other studies:Andruski, Blumstein & Burton (1994)Marslen-Wilson & Warren (1994)Utman, Blumstein & Burton (2000)Dahan, Magnuson, Tanenhaus & Hogan (2001)McMurray, Clayards, Aslin & Tanenhaus (2004)McMurray, Aslin, Tanenhaus, Spivey & Subik (in prep)

Page 25: From Sound to Sense and back again: The integration of lexical and speech processes

The Standard Paradigm?

Sense

Phon

olog

y Words

Phonemes

Continuous Acoustic Detail

CAD affects higher-level processes.

From other work:

Lexical activation influences sublexical representations.

Samuel & Pitt (2003)Magnuson, McMurray, Tanehaus & Aslin (2003)Samuel (1997)Elman & McClelland (1988)

Page 26: From Sound to Sense and back again: The integration of lexical and speech processes

The Standard Paradigm?

Sense

Phon

olog

y Words

Phonemes

Continuous Acoustic Detail

CAD affects higher-level processes.

From other work:

Lexical activation influences sublexical representations.

Phonological regularity affectssignal interpretation.

Massaro & Cohen (1983)Halle, Segui, Frauenfelder & Meunier (1998)Pitt (1998)Dupoux,Kakehi, Hirose, Pallier & Mehler, (1999)

Page 27: From Sound to Sense and back again: The integration of lexical and speech processes

?Sense

Phon

olog

y Words

Phonemes

Continuous Acoustic Detail

Perhaps interaction and integration make sense.

Do they help solve sticky problems?

YES

Page 28: From Sound to Sense and back again: The integration of lexical and speech processes

The Emerging Paradigm

Integration of work in:• spoken word recognition • speech perception• phonology

New computations simplify old problems and solve new ones.

•Cognitive processes: Lexical activation & competition.

•Perceptual processes: sensitivity to CAD & perceptual grouping.

Page 29: From Sound to Sense and back again: The integration of lexical and speech processes

CAD is helpful in language comprehension.

• Word segmentation

• Coping with lawful variability due to assimilation

Combination of approaches helps solve both problems.

Page 30: From Sound to Sense and back again: The integration of lexical and speech processes

Some lexical processes can’t work

in the Standard Paradigm

Lexical Segmentation

Page 31: From Sound to Sense and back again: The integration of lexical and speech processes

[ ]

The SWR Solution

Page 32: From Sound to Sense and back again: The integration of lexical and speech processes

active

[ ]

Page 33: From Sound to Sense and back again: The integration of lexical and speech processes

active department

[ ]

Page 34: From Sound to Sense and back again: The integration of lexical and speech processes

active departmentact of dip art mint

a partdepart in

arepar

Standard Paradigm: Template matching overgenerates

[ ]

Page 35: From Sound to Sense and back again: The integration of lexical and speech processes

•Overgeneration resolved through competition in TRACE (McClelland & Elman 1986)

Problem: What if the speaker is trying to say “suck seeds”?

‘ k s I d -

succeed

suck

seed

activation

Cycle

Frauenfelder & Peeters (1990)

Page 36: From Sound to Sense and back again: The integration of lexical and speech processes

Cues shown to affect segmentation:

•Initial strong syllable•Initial lengthening•Increased aspiration•Increased glottalization

Lehiste, 1960; Garding,1967; Lehiste, 1972; Umeda, 1975; Nakatani & Dukes, 1977; Nakatani & Schaffer,1978; Cutler & Norris, 1988…..

Implied processing model requires separate segmentation process

Words

Segmentation

Phonemes

CAD

Recognition

The Speech Solution

Page 37: From Sound to Sense and back again: The integration of lexical and speech processes

Problem: cues are subtle and varied, extra-segmental processes are inelegant

?Is there a better mechanism?

Words

Segmentation

Phonemes

CAD

Recognition

Page 38: From Sound to Sense and back again: The integration of lexical and speech processes

The proposal had a strange syntax that nobody liked. ^

The proposal had a strange sin tax that nobody liked. ^

• CAD affects interpretation.• does not trigger segmentation.

Gow & Gordon (1995)

GRAMMAR primedSyntaxTax INCOME inhibited

GRAMMAR primedSyntaxTax INCOME primed

Page 39: From Sound to Sense and back again: The integration of lexical and speech processes

•Observation: All segmentation cues happen to enhance word-initial features

• Strengthened cues facilitate activation, making intended words stronger competitors

Incorporating CAD:

• Solves overgeneration problem.

•No extra-segmental segmentation process.

Good Start Model

Gow & Gordon (1995)

Page 40: From Sound to Sense and back again: The integration of lexical and speech processes

When continuous acoustic detail affects lexical

activation, speech and SWR models can

be integrated and simplified

Summary

Page 41: From Sound to Sense and back again: The integration of lexical and speech processes

The emerging paradigm reframes

computational problems

Assimilation

Page 42: From Sound to Sense and back again: The integration of lexical and speech processes

English coronal place assimilation

/coronal # labial/ [labial # labial]

/coronal #velar/ [velar # velar]Standard Paradigm: Change is • discrete• phonemically neutralizing

Redefining Computational Problems

[ ]# berries nonword?

right berries?

ripe berries?[ ]# berries

Page 43: From Sound to Sense and back again: The integration of lexical and speech processes

Standard Paradigm solution: Phonological inference (Gaskell & Marslen-Wilson, 1996; 1998; 2001)

Knowledge driven inference:

If [labial # labial] infer /coronal # labial/

greem beans green (Gaskell & Marslen-Wilson, 1996; Gow, 2001)

ripe berries right (Gaskell & Marslen-Wilson, 2001; Gow, 2002)

Moreover: Assimilation effects dissociated from linguistic knowledge (Gow & Im, in press)

ripe

Page 44: From Sound to Sense and back again: The integration of lexical and speech processes

Assimilatory modification is acoustically continuous

This is not discrete feature change!

Assimilation Produces CAD

F2 Transitions in /æC/ Contexts

1550160016501700175018001850

Pitch Period

Freq

uenc

y (H

z)

coronalassimilatedlabial

F3 Transitions in /æC/ Contexts

2550

2600

2650

2700

2750

2800

Pitch PeriodFr

eque

ncy

(Hz)

coronalassimilatedlabial

Page 45: From Sound to Sense and back again: The integration of lexical and speech processes

SmaSelect thecat

p box

Regressive Context Effects

Page 46: From Sound to Sense and back again: The integration of lexical and speech processes

Subject Hears: Assim_Non-Coronal (cat/p box)

0

0.1

0.2

0.3

0.4

0.5

0.6

0 400 800 1200 1600Time (ms)

Fixa

tion

Pro

port

ion

Coronal (cat)Non-Coronal (cap)

Page 47: From Sound to Sense and back again: The integration of lexical and speech processes

Subject Hears: Assim Non-Coronal (cat/p drawing)

00.10.20.30.40.50.6

0 400 800 1200 1600Time (ms)

Fixa

tion

Pro

port

ion

Coronal (cat)Non-Coronal (cap)

Page 48: From Sound to Sense and back again: The integration of lexical and speech processes

Looks to Final Non-coronal (box)

00.10.20.30.40.50.60.7

0 400 800 1200 1600Time (ms)

Fixat

ion

Prop

ortio

n

Assim Non-CoronalCoronal Non-Coronal

Progressive Context Effects

Progressive effect in the same experiment

Page 49: From Sound to Sense and back again: The integration of lexical and speech processes

Assimilation is resolved through phonological context.

Fully assimilated items show neither* (Gaskell & Marslen-Wilson, 2001; Gow, 2002;2003)

Assimilation: Use of CAD

Partially-assimilated items show

regressive context effects (Gow, 2002; 2003)

progressive context effects (Gow, 2001; 2003)

Page 50: From Sound to Sense and back again: The integration of lexical and speech processes

assimilation # context

Infinite regress (eternal ambiguity)…. or something more interesting?

Page 51: From Sound to Sense and back again: The integration of lexical and speech processes

Continuous acoustic detail is subject to basic perceptual

processes

Page 52: From Sound to Sense and back again: The integration of lexical and speech processes

Feature cue parsing (Gow, 2003)

Time (s)0 0.760454

0

3000

[

A Perceptual Account

Page 53: From Sound to Sense and back again: The integration of lexical and speech processes

Feature cue parsing (Gow, 2003)

Time (s)0 0.760454

0

3000

Features encoded by multiple cues that are integrated

Page 54: From Sound to Sense and back again: The integration of lexical and speech processes

Feature cue parsing (Gow, 2003)

Time (s)0 0.760454

0

3000

Page 55: From Sound to Sense and back again: The integration of lexical and speech processes

Feature cue parsing (Gow, 2003)

Time (s)0 0.760454

0

3000

Assimilation creates cues consistent with multiple places

Page 56: From Sound to Sense and back again: The integration of lexical and speech processes

Feature cue parsing (Gow, 2003)

Extract feature cues

Page 57: From Sound to Sense and back again: The integration of lexical and speech processes

Feature cue parsing (Gow, 2003)

Group feature cues by similarity and resolve ambiguity

Page 58: From Sound to Sense and back again: The integration of lexical and speech processes

Feature cue parsing (Gow, 2003)

example: eight….

catp # box cat

p # drawing catp

# | | | |

[cor] [cor] [COR] [cor] [lab] [LAB] [lab] [lab]

Page 59: From Sound to Sense and back again: The integration of lexical and speech processes

example: eight….

catp # Box cat

p # Drawing catp #

| | [cor] [cor] [COR] [cor] [lab] [LAB] [lab] [lab]

Feature cue parsing (Gow, 2003)

Progressive and regressive effects fall out of grouping

Page 60: From Sound to Sense and back again: The integration of lexical and speech processes

SWR problem (eternal ambiguity) replaced by simpler perceptual problem

CAD important in solution: processing obstacle facilitates perception.

Integration of continuous perceptual features facilitates higher-level processes.

Facilitation via core-word recognition mechanisms—no extra-segmental routines required.

Summary

Page 61: From Sound to Sense and back again: The integration of lexical and speech processes

Standard paradigm

•Created artificial boundaries that misframed issues.

•Continous acoustic detail is variability to be conquered..

The Standard Paradigm

The basis of the standard paradigm is undercut.

•Meaning-based processes are affected by CAD.

•CAD is an essential component of word recognition.

Page 62: From Sound to Sense and back again: The integration of lexical and speech processes

The emerging paradigm

•Emphasis on methodologies that tap the minimal computational problem: meaning.

•Stresses integration of speech and spoken word recognition, questions methods and theory.

•Continuous acoustic detail is useful signal, not noise.

The Emerging Paradigm

Page 63: From Sound to Sense and back again: The integration of lexical and speech processes

From Sound to Sense and From Sound to Sense and back again:back again:

The integration of lexical and The integration of lexical and speech processesspeech processesDavid Gow

Massachusetts General HospitalBob McMurray

Dept. of Brain and Cognitive SciencesUniversity of Rochester