Representing Intonational Variation
-
Upload
janna-warner -
Category
Documents
-
view
27 -
download
0
description
Transcript of Representing Intonational Variation
![Page 1: Representing Intonational Variation](https://reader035.fdocuments.us/reader035/viewer/2022062411/56813061550346895d963006/html5/thumbnails/1.jpg)
04/24/23 1
Representing Intonational Variation
Julia HirschbergCS 4706
![Page 2: Representing Intonational Variation](https://reader035.fdocuments.us/reader035/viewer/2022062411/56813061550346895d963006/html5/thumbnails/2.jpg)
04/24/23 2
Today
• How can we represent meaningful speech variation s.t. we can communicate this to others?– Expanded vs. compressed pitch range?– Louder vs. softer speech?– Faster vs. slower speech?– Differences in intonational prominence?– Differences in intonational phrasing?– Differences in pitch contours?
![Page 3: Representing Intonational Variation](https://reader035.fdocuments.us/reader035/viewer/2022062411/56813061550346895d963006/html5/thumbnails/3.jpg)
04/24/23 3
Schemes for Representing Intonational Variation
• An early proposal: Joshua Steele• Language Learning Approaches
– / IS it INteresting /– / d’you feel ANGry? /– / WHAT’S the PROBlem? / (McCarthy,
1991:106)• How can we capture all and only the meaningful
intonational variation for a given language unambiguously?
![Page 4: Representing Intonational Variation](https://reader035.fdocuments.us/reader035/viewer/2022062411/56813061550346895d963006/html5/thumbnails/4.jpg)
04/24/23 4
Intonation Models
• No commonly agreed upon model for one language, let alone all
• Researchers work in different traditions and focus on different aspects of intonation
• Different models may arise from different types of data– Auditory– Acoustic– Perceptual– …
![Page 5: Representing Intonational Variation](https://reader035.fdocuments.us/reader035/viewer/2022062411/56813061550346895d963006/html5/thumbnails/5.jpg)
04/24/23 5
Intonation Models
•Auditory: – ESL-orientated; empirical data scarce; even trained
listeners do not always agree on what they hear •Acoustic:
– Distinction between linguistically relevant and irrelevant details in acoustic signal
•Perceptual approach – Experimental data, often w/ manipulated f0– Hard to design experiments with naïve listeners
which give adequate control over parameters used in making decisions
![Page 6: Representing Intonational Variation](https://reader035.fdocuments.us/reader035/viewer/2022062411/56813061550346895d963006/html5/thumbnails/6.jpg)
04/24/23 6
Intonation models
• Basic division into linear and superpositional models– Linear models: intonation involves a
succession of individual choices from an intonation lexicon
– Superpositional models: the intonation of an utterance involves a combination of local and utterance-sized components
• Speakers may combine aspects of linear and superpositional models in the production of intonation
![Page 7: Representing Intonational Variation](https://reader035.fdocuments.us/reader035/viewer/2022062411/56813061550346895d963006/html5/thumbnails/7.jpg)
04/24/23 7
Intonation Models
• Linear or Tone sequence models– British school (Kingdon ’58, O’Connor &
Arnold ’73, Cruttenden ’97): based on auditory analysis
– American School (Pierrehumbert ’80, ToBI): mainly acoustic analysis
– Dutch school (‘t Hart, Collier and Cohen 1990): perceptual data
• Superpositional models (Fujisaki 1983, Möbius et al. 1993): acoustic/physiological
![Page 8: Representing Intonational Variation](https://reader035.fdocuments.us/reader035/viewer/2022062411/56813061550346895d963006/html5/thumbnails/8.jpg)
04/24/23 8
Superpositional models
• Pitch pattern of intonation modeled with two components: phrase component and accent component.
• Phrase has basic shape, and pitch movements for individual accents are superimposed over basic shape:
plus
=Apples, oranges and tomatoes
![Page 9: Representing Intonational Variation](https://reader035.fdocuments.us/reader035/viewer/2022062411/56813061550346895d963006/html5/thumbnails/9.jpg)
04/24/23 9
Lily and Rosa thought this was divine.Prince William was gorgeous and he was looking for a bride.
They dreamed of wedding bells.
• Declination: downtrend in f0 over the course of an utterance
• Best seen as statistical abstraction: if one takes f0 measurements from enough utterances, over time, a downtrend in f0 will emerge
Good for modeling declination
![Page 10: Representing Intonational Variation](https://reader035.fdocuments.us/reader035/viewer/2022062411/56813061550346895d963006/html5/thumbnails/10.jpg)
04/24/23 10
Superpositional models
• Advantages
– Good at modeling declination in intonation languages
– Successful in speech synthesis for languages like Japanese (little variation in accent type, e.g.)
– Capture prosodic structure in languages which have both tone and intonation (e.g. Mandarin)
• Disadvantages
– All contours must be modeled with an accent and a phrase component
– Many SAE contours cannot be captured easily
![Page 11: Representing Intonational Variation](https://reader035.fdocuments.us/reader035/viewer/2022062411/56813061550346895d963006/html5/thumbnails/11.jpg)
04/24/23 11
– Intonation contours cannot be modeled as sequences of prosodic events
– No account of different accent types, or variations in phrase endings
– No notation system which allows users to share observations from large speech corpora or to compare contours
– A method primarily for synthesis, analysis of speech production
![Page 12: Representing Intonational Variation](https://reader035.fdocuments.us/reader035/viewer/2022062411/56813061550346895d963006/html5/thumbnails/12.jpg)
04/24/23 12
Tone sequence models
• General assumption: intonation is generated from sequences of (possibly) categorically different and phonologically distinctive accents
• Two types of models within the group of tone sequence models:Type 1: Intonation made up of sequences of
pitch movementsType 2: Intonation made up of sequences of
pitch levels or targets
![Page 13: Representing Intonational Variation](https://reader035.fdocuments.us/reader035/viewer/2022062411/56813061550346895d963006/html5/thumbnails/13.jpg)
04/24/23 13
Two types of tone-sequence model
t a rge t
H
L
t a rge t
Type 1: based on pitch movements
Type 2: based on pitch levels
The American School
The British School The Dutch School
![Page 14: Representing Intonational Variation](https://reader035.fdocuments.us/reader035/viewer/2022062411/56813061550346895d963006/html5/thumbnails/14.jpg)
04/24/23 14
Tone Sequence Models
• Overall shape of intonation phrase is not component of models
– Model is a succession of independent accent and boundary tone choices from an intonation lexicon
– Do not model phrase-level phenomena (e.g. declination, pitch range, nuclear accent)
![Page 15: Representing Intonational Variation](https://reader035.fdocuments.us/reader035/viewer/2022062411/56813061550346895d963006/html5/thumbnails/15.jpg)
04/24/23 15
The British School
• Tone sequence model and pitch movement analysis (e.g. falling vs. rising intonation)
• Auditory model: teaching English as a second language– O’Connor and Arnold 1972:
• Earliest textbook for English instruction that tells user which contour appropriate in which context
• No empirical evidence• British school analyses applied to English,
German, Dutch, French, …
![Page 16: Representing Intonational Variation](https://reader035.fdocuments.us/reader035/viewer/2022062411/56813061550346895d963006/html5/thumbnails/16.jpg)
04/24/23 16
Concepts in the British School
• Basic unit of intonational description: intonation phrase (tone unit)
– Delimited by pauses, phrase-final lengthening, pitch movement
• Syllables within a tone unit can be stressed or accented
– telephone
• Accented syllables are stressed and pitch prominent
![Page 17: Representing Intonational Variation](https://reader035.fdocuments.us/reader035/viewer/2022062411/56813061550346895d963006/html5/thumbnails/17.jpg)
04/24/23 17
Accent
Stressed syllable has full vowel and is perceived as involving a rhythmic beat
Pitch prominence– syllable produced with moving pitch or– syllable part of a pitch jump from a
preceding syllable or onto a following syllable or
– syllable at a point in the utterance where the direction of pitch movement changes (e.g. from rising to falling)
![Page 18: Representing Intonational Variation](https://reader035.fdocuments.us/reader035/viewer/2022062411/56813061550346895d963006/html5/thumbnails/18.jpg)
04/24/23 18
Pitch Prominence
– Syllable produced with moving pitch
– Syllable part of a pitch jump from a preceding syllable or onto a following syllable
– Syllable at a point in utterance where direction of pitch movement changes
the girl
thegi r l in the garden
the gi r l in the g ar den
![Page 19: Representing Intonational Variation](https://reader035.fdocuments.us/reader035/viewer/2022062411/56813061550346895d963006/html5/thumbnails/19.jpg)
04/24/23 19
An example
...a POINT where you have to CLEAN it
and I think it’s HOrriblerrible
There’s a point where you have to clean it and I think it’s horrible...
![Page 20: Representing Intonational Variation](https://reader035.fdocuments.us/reader035/viewer/2022062411/56813061550346895d963006/html5/thumbnails/20.jpg)
04/24/23 20
Intonation Phrase Structure
• Intonational phrases have an internal structure
– Structure determined by location of accents in an IP
– Each accent defines the beginning of a prosodic constituent
![Page 21: Representing Intonational Variation](https://reader035.fdocuments.us/reader035/viewer/2022062411/56813061550346895d963006/html5/thumbnails/21.jpg)
04/24/23 21
Intonation phrase structure
• Two types of accent unit in the British School:
– Prenuclear accent units; also called the Head– Nuclear accent units; also called the Nucleus
• The nuclear accent unit is the last accent unit in the IP
• The head comprises all prenuclear accent units
![Page 22: Representing Intonational Variation](https://reader035.fdocuments.us/reader035/viewer/2022062411/56813061550346895d963006/html5/thumbnails/22.jpg)
04/24/23 22
Intonation phrase structure
JOHN’s never BEEN to Jamaica
Prenuclear accent unit Nuclear accent unit
But
Prehead
Stressed syllable
‘Head’ ‘Nucleus’
![Page 23: Representing Intonational Variation](https://reader035.fdocuments.us/reader035/viewer/2022062411/56813061550346895d963006/html5/thumbnails/23.jpg)
04/24/23 23
Six nuclear choices in English
Ja maic
falling
a ic
rising
Ja maa
a c
rising-falling
iJa m a
falling-rising
Ja maica
Rising-falling-rising
a ciJa m aalevel
Ja maica
![Page 24: Representing Intonational Variation](https://reader035.fdocuments.us/reader035/viewer/2022062411/56813061550346895d963006/html5/thumbnails/24.jpg)
04/24/23 24
Strengths and Weaknesses
• How are accents, prominence defined? How are they related to segments? Too many options….
• Are prenuclear accents qualitatively different from nuclear accents? What is the evidence?
• Does each pitch accent begin a new ‘prosodic unit’ in the phrase? What is the evidence?
![Page 25: Representing Intonational Variation](https://reader035.fdocuments.us/reader035/viewer/2022062411/56813061550346895d963006/html5/thumbnails/25.jpg)
04/24/23 25
Next Class
• The American School and Laboratory Phonology• ToBI
– Read the ToBI conventions– Listen to the ToBI
training data or cardinal examples– Bring your laptop and headphones to class