Cross-modal Prediction in Speech Perception Carolina Sánchez, Agnès Alsius, James T. Enns &...

Cross-modal Prediction in Speech

PerceptionCarolina Sánchez, Agnès Alsius, James T. Enns & Salvador

Soto-Faraco

Multisensory Research Group

Universitat Pompeu Fabra

Barcelona

Auditory + visual performanceMSI enhancement

Background

Visual + Auditory

Improve Speech Perception

Multisensory Integration

Background

• Prediction within one sensory modality• Many levels of information processing

– Phonological prediction “ This morning I went to the library and borrowed a … book” (De Long, 2005; Pickering, 20707)

– Visual prediction: Visual search (Enns, 2008; Dambacher, 2009)

– Sensorimotor prediction: forward model (Wolpert, 1997)

Predictive coding

Pickering, 2007

Hypothesis

• If there exists prediction within the same modality,

and if predictive coding models can account for prediction at a phonological level, then …

Predictive Coding could occur across different sensory modalities too.

Indirect evidences of cross-modal transfer in speech

van Wassenhove’s , 2005

• Amplitud reduction

• Shortening latency

/pa/ high visual saliency

/ka/ short visual saliency

Our study

• Visual prediction

• Auditory prediction

• Visual-to-auditory cross-modal prediction

• Auditory-to-visual cross-modal prediction

Visual prediction

Visual stream

Auditory stream

With visual informative visual context

Without informative context

Task :

AV Match vs. AV Mismatch

Target fragment

Context fragment

speechnon speech

Results

1200Reaction time

match mismatch

With visual informative context

* With previous context participants respond faster than without it.

VISUAL PREDICTION

Auditory prediction

Visual stream

Auditory stream

With auditory informative auditory context

speechnon speech

Task :

AV Match vs. AV Mismatch

Target fragment

Context fragment

Results

With auditory informative context

Reaction time

match mismatch

* With previous context participants respond faster than without it.

AUDITORY PREDICTION

Visual vs. Auditory Visual prediction Auditory

prediction

1200Rts

congruent incongruent

With visual informative context

Without informative context*

With auditory informative context

congruent incongruent

Conclusions

• Visual prediction

• Auditory prediction

Is this prediction cross-modal?

Predictability of Vision-to-Audition Design of the experiment

AMismatch

Unimodal continued

Auditory stream

Visual stream

Unimodal continuedV

Discontinued

Mismatch

Cross-modal continued

Mismatch

Predictability of Vision-to-Audition Stimuli

AMismatch

Unimodal continued Discontinued Cross-modal continued

Results

Participants were faster in the cross-modal condition than in the completely incongruent one.

VISUAL –TO-AUDITORY PREDICTION

Reaction time

VisualAuditory

Unimodal continued

Discontinued Cross-modal continued

Predictability of Audition-to-Vision Design of the experiment

Auditory stream

Visual stream

Unimodal continued

AMismatch

Unimodal continued

AMatch

Discontinued

AMismatch

Discontinued

AMismatch

Cross-modal continued

1200Reaction time

Visual

Auditory

Unimodal continued

Results

We didn’t find any difference between the mismatch condicions

NO AUDITORY-TO-VISUAL PREDICTION

Conclusions

• There is some kind of prediction from vision-to-auditory modality

• There is not any prediction from auditory-to-vision modality

Does this prediction depend on the language?

Canadian participants with english sentences

VISUAL –TO-AUDITORY PREDICTION IN NATIVE LANGUAGE

1000Reaction time

Visual

Auditory

Unimodal continued

Reaction time

VisualAuditory

Unimodal continued

Spanish participants with spanish sentences

Results (L1)

Canadian participants with english sentences

1200Reaction time

No differences between the mismatch conditions

No prediction from auditory-to-visual modality in native language

Spanish participants with spanish sentences

1200Reaction time

Visual

Auditory

Unimodal continued

Visual

Auditory

Unimodal continued

Conclusions

• There is some kind of prediction from vision-to-auditory modality in L1

• There is not any prediction from auditory-to-vision modality L1

What happens with an unknown language?

Unknown language : visual to auditory

Canadian participants with spanish sentences

NO VISUAL-TO-AUDITORY IN OTHER LANGUAGE

1200Reaction time

Visual

Auditory

Unimodal continued

Unknown language: auditory to visual

Spanish participants with english sentences

Canadian participants with spanish sentences

1200Reaction time

No differences between the mismatch conditions

No prediction from auditory-to-visual modality in other language

Visual

Auditory

Unimodal continued

Visual

Auditory

Unimodal continued

Conclusions

• No visual-to-auditory cross-modal prediction in an unknown language…

it seems that some level of knowledge about the articulatory phonetics of the language is required to obtain the advantage of the predictive coding

• No auditory-to-visual cross-modal prediction

General Conclusions

• Unimodal prediction from visual to visual modality from auditory to auditory

• L1: ASYMMETRY– Cross-modal prediction from visual-to-auditory

modality– No cross-modal prediction from auditory-to-visual

modality

• Unknown language: previous knowledge of the language is neccesary to make the prediction– No cross-modal prediction from visual-to-auditory

modality– No cross-modal prediction from auditory-to-visual

modality

- Agnès Alsius, Postdoc

Queen’s University

- Antonia Najas, MA/ Research Assistant Universitat Pompeu Fabra

- Phil Jaekl, PostdocUniversitat Pompeu Fabra

- All the people of the Vision Lab, UBC, Vancouver

Thanks to…

Thanks for your attention!!

Cross-modal Prediction in Speech Perception Carolina Sánchez, Agnès Alsius, James T. Enns &...

Documents

Transcript of Cross-modal Prediction in Speech Perception Carolina Sánchez, Agnès Alsius, James T. Enns &...

Marie-Agnès Cathiard

XMLCONF IETF 57 – Vienna Rob Enns (rpe@juniper.net)

Enns (1991) Preattentive recovery of three-dimensional ...wexler.free.fr/library/files/enns (1991) preattentive recovery of three... · world of objects in three-dimensional space

Introducing Agnès Varda

Waltke, Interaction With Peter Enns

Peter Enns - Apostolic Hermeneutics and Evangelical Doctrine of Scripture

Enns Adam White Paper

Quick facts Agnès Bischoff-Kimcnls.lanl.gov/External/whitedwarf/Posters/kim2.pdfThe White Dwarf Evolution Code Revamped Agnès Bischoff-Kim Penn State Worthington Scranton, Dunmore,

Delve Deeper into The Beaches of Agnès - PBSDelve Deeper into The Beaches of Agnès A film by Agnès Varda In this delightful memoir, award-winning French filmmaker Agnès Varda (Vagabond,

GEDLIST Copyright by The American Historical …...ENNS, Agatha -M-F170 5 Oct 1863 19 Jan 1890 ENNS, Agatha -R256 26-Feb-33 Winkler, Manitoba, Canada ENNS, Agatha -W056 15 May 1861

Knowledge Sharing 2.0 - Quando gli ambienti diventano conversazioni Fabrizio Faraco, ETAss Workshop

Who is Andi Enns?

1998 15th Anniversary Season 2013 directed by Leonard Enns ...dacapochamberchoir.ca/wp-content/uploads/2014/09/... · directed by Leonard Enns 1998 · 15th Anniversary Season · 2013

Agnès Parent- Thirion surveys and trends unit

Internet Engineering Task Force (IETF) R. Enns, Ed

Agnès Thurnauer May/June 2015 By Alexandra Fau Agnès …michelrein.com/cspdocs/press/files/agne_768_sthurnauer_2015_art_abs… · M PA/B Agnès Thurnauer Art Absolument May/June

Agnès Robin DG Research & Innovation Research Infrastructures

Research Designs Murray W. Enns Professor of Psychiatry.

AgnÈs Patuano

Formas Farmaceuticas Faraco