LOT summer school Ultrasound, phonetics, phonology: Articulation for Beginners!

67
James M Scobbie CASL Research Centre LOT summer school Ultrasound, phonetics, phonology: Articulation for Beginners! With special thanks to collaborators Jane Stuart-Smith & Eleanor Lawson Joanne Cleland & Zoe Roxburgh Natasha Zharkova, Laura Black, Steve Cowen Reenu Punnoose, Koen Sebreghts Sonja Schaeffler & Ineke Mennen Conny Heyde Alan Wrench (aka Articulate Instruments Ltd) for AAA software and UTI hardware Various funding – thank you to ESRC, EPSRC, QMU June 2013

description

LOT summer school Ultrasound, phonetics, phonology: Articulation for Beginners!. With special thanks to collaborators Jane Stuart-Smith & Eleanor Lawson Joanne Cleland & Zoe Roxburgh Natasha Zharkova , Laura Black, Steve Cowen Reenu Punnoose , Koen Sebreghts - PowerPoint PPT Presentation

Transcript of LOT summer school Ultrasound, phonetics, phonology: Articulation for Beginners!

Page 1: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

James M ScobbieCASL Research Centre

LOT summer schoolUltrasound, phonetics, phonology:

Articulation for Beginners!

With special thanks to collaboratorsJane Stuart-Smith & Eleanor Lawson

Joanne Cleland & Zoe RoxburghNatasha Zharkova, Laura Black, Steve Cowen

Reenu Punnoose, Koen SebreghtsSonja Schaeffler & Ineke Mennen

Conny HeydeAlan Wrench (aka Articulate Instruments Ltd) for AAA software and UTI hardware

Various funding – thank you to ESRC, EPSRC, QMU

June 2013

Page 2: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Structure

• Introduction to articulation• Brief overview of techniques• Ultrasound tongue imaging• Playtime• Technical issues and the nitty gritty of data• Maybe a linguistic illustration

– Malayalam liquids

Page 3: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Technical issues

• Different laboratories have different solutions• Exemplification will be based around current

practice at QMU / Articulate Instruments Ltd• Topics (mostly in this order)

– Resolution, fixed aspect ratio representations– Up, down and horizontal…the bite plane– Quick averaging multiple tongue surfaces– Statistical testing of difference between averages– Two tongues, synching, de-interfacing– Video-rate vs. (ultra) high speed ultrasound

Page 4: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Spatial resolution around the curve

• More echo-pulse beams / scanlines means more resolution in a circumferential direction– Let’s assume 1 scanline each 2° (180 in a circle)– Scanlines get further apart the further they are from

the probe• At 90mm from probe centre, resolution is 3.14mm • At 60mm, resolution is 2mm• 45mm it is 1.6mm

– To maintain these resolutions…• A 90° field of view would need 46 scanlines• A 135° field of view would need 69 scanlines

Page 5: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Spatial resolution along the radii

• More sample points means more resolution in a radial direction– 8cm depth with 256 sample points = 0.3mm/point

• Assuming enough pixels to represent each point

Page 6: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

150 s-lines @ 0.9°, FoV 135°, 57fps

Page 7: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

50 s-lines @ 2.7°, FoV 135°, 166fps

Page 8: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Rectangular images

• The fan shape is presented on a rectangular screen, and occupies a proportion of that space

• A TV image has a certain number of data points horizontal / vertical (e.g. in NTSC)

• These are digitised into pixels at a given resolution…

• Horizonatal in the head is not the same thing as being parallel to the x-axis in the rectangle!

Page 9: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Harrington, Kleber & Reubold 2011

• Approximate location of EMA coils in analysis of /u/ fronting in SSBE

Page 10: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Harrington et al

• Approximate location of EMA coils in analysis of /u/ fronting in SSBE – 2-4mm back/below /i/

Page 11: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

UTI single SSE speaker

• Example of a UTI vowel space, rotated to occlusal bite plane, with average curves (± 1sd)

• Left pane is standard view, right the UTI view…

Page 12: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Finding the “horizontal”

• Use a “bite plate” to detect the unique occlusal plane for each speaker, as in typical in EMA

• Flat plane defined on upper dentition surface• Also provides common origin as well as axes

• Scobbie, Lawson, Cowen, Cleland & Wrench (2012) ms. – I might be able to put this online…

Page 13: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

wikipedia

In humans, the directions "rostral" and "caudal" often become confused with anterior and posterior, or superior and inferior. The difference between the two is most easily visualized when looking at the head, as can be seen in the image to the right. From the most caudal of positions in the nervous system (of a person) to a nearby, rostral area, it is equally accurate to say the area in question is rostral as to say it is superior. However, in the frontal lobes of the telencephalon, to say an area is rostral to a nearby area is equivalent to saying it is anterior. Those two lines lie on planes perpendicular to one another. This occurs, as becomes clear in the diagram, due to the intuitive yet curious curving "C" shape of rostrocaudal directionality when discussing the human brain.

Page 14: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Occlusal biteplane trace

bite plate

Page 15: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

30

40

50

60

60 70 80 90 100 110

s1

s2

s3

s4

s5

s6

Variation in bite plane

s1 s2 s3 s4 s5 s6Occlusal slope -8° -18° -23° -13° -22° -27°Distance from probe

surface (mm) 31 44 43 35 42 40Angular offset of rear 92° 82° 83° 77° 80° 69°

• Six young adult female speakers• Varying slopes

(mean 18.5°)• Varying

vertical offset• Varying

horizontal offset

back of plate

Page 16: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Overlay of 6 hard palates

• Mean hard palate trace (black) and biteplane trace grey), automatic curve fitting

Page 17: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Palates normalised to bite planes

• Normalised (translation and rotation) to rear of bite plane and relocation of origin (+45mm)

• Better palate trace alignment, with one “rogue”

Page 18: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Alternatives

• Palates can be used to orientate between sessions, by swallowing (e.g. water or yoghurt)

• Longitudinal, within-speaker– Just line up the palates!– Easy, huh?!

• Cross-speaker– Might be better than bite plate when worrying about

close approximation constrictions– Bite plate might be better for open approximation

• The probe can be moved instead • A consistent articulation can be used, eg [u]

Page 19: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Upright / supine

• MRI data is collected supine – does it matter?• Upright L and supine R “pop” vowel• Wrench et al 2011

Page 20: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!
Page 21: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Summary

• 6 female speakers, varied accents• 5 reps of pep and of pop in randomised list of

vowels• 4 blocks, repeating upright/supine set twice

– Upright first for 3, supine first for 3• Pharyngeal slump under gravity of about 3mm• And a couple of cases of blade raising

Page 22: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Averaging tokens within-speaker

Page 23: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Averaging within AAA

• Averaging along 42 fan-grid radii, “parallel” to scan-lines / echo-pulse beam from the probe

Page 24: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Tokens of [s] from /si/

• n tokens along radius r

Page 25: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Tokens of [s] from /sa/ and /si/

• vs. a different condition

Page 26: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

2 groups of curves

• t-test of the difference between mean tongue contour at crossing point at each fan line

• 2-tailed test assuming unequal variances and unequal sample sizes

• No Bonferroni or other corrections• Up to 5 or 6 adjacent radii, mean distance from

probe is correlated, perhaps indicating non-independence of such “close” measures

• For a linguistic interpretation of difference, 5 or 6 adjacent radii, all at p<0.05 on t-test is more important than p<0.0001 on one radius

Page 27: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Pilot correlation v1

• 2 speakers, 4 frames each• 42 radii per speaker…• What % of correlations between two random radii are

significant, depending on the distance between them• Radial distance • Grand mean• All parts of

tongue pooled• More cases of

adjacent than longdistancecomparisons

0%

20%

40%

60%

80%

100%

0 1 2 3 4 5 6 7 8 9 10Gap between radi i

% o

f cor

rela

tions

that

are

si

gnifi

cant

Page 28: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Pilot correlation v2

• A range of 9 varied tongue shapes (9 single frames) from each speaker

• 4 samples for each frame – roughly equally spaced

• Is there a correlation for fans 10 apart?

• 9? 8? 7? …

• Pilot B (NI1)

Page 29: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

• 3 attempts – more long-distance significance found

• One sectoron the fan is 7 fanlines

A B C# fans 8 7 4

Page 30: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

What to try next?

• Just two sample points per frame, front and back?

• Pilot 2 A = 9 fans rather than 8 were significant (n=18 observations, so lower values of r were significant)

• Or one in the middlelooking forwards and backwards?

• Or use many more target types?

• Or ones that show more subtle differences, such as a set of CV transitions, including every frame, not just varied targets

Page 31: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Tokens of [s] from /sa/ and /si/

• Raw tongue curves again

Page 32: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Mean /sa/ vs. /si/

• Significant root advancement (~5mm) and palatalisation (~4mm) in /si/

• More than 5 adjacent fans where p<0.05, but in 2 areas

Page 33: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Mean /sa/ vs. /si/

• SS-ANOVA best fit lines (∓ 95% c.i.) - Davidson

Page 34: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Mean /sa/ vs. /si/

• Exploring treating >5 fan lines at p<0.05 as categorically “significant” but quantifying it all:– Including crossing/pivot points– Ignore significance if curves are low

confidence– Quantify length of the significant tongue

surface– Estimate total

difference in area

Page 35: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Single speaker (SSE) Neutral space

• Thick lines for means – cf overlap, non overlap, and crossings

Page 36: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

The two tongue problem

• Wrench & Scobbie (2006) list some of the problems with video-ultrasound resulting from buffering multiple probe scans into one image– More than one scan from the probe in an image– Partial scans from the probe in an image– Don’t forget 30fps is about 33ms, so synch is vague

• Some solutions,– Use raw probe data (cine loop) but this costs €– Use a high scan rate (more than twice NTSC) and

then deinterlace the video to 60fps– Halves vertical spatial resolution (rectangular up)

Page 37: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Video digital capture & buffering

• The scanner scans and makes screen images

Page 38: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Video digital capture & buffering

• The scanner scans and makes screen images

Page 39: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Plain 30fps video

• In these images, two apparent tongues show the effect of two scans in the same buffer, on odd and even video “lines”

Page 40: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Solutions

• Deinterlace video images to 30fps (16ms or so)• “Cineloop” digital output can be stored locally

on US scanners– Full rectangular cine images– Approx 15 second chunks– Continuous audio recordings need post-processing

alignment• AAA / QM Ultrasonix-based system

– Data stored as raw probe echo-pulse returns– Synchronised at source with audio at each frame– Video channel freed up, and can be used to capture

lip videos

Page 41: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

High speed

• 76 scan lines, 100fps, FoV 112°

Page 42: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Ultra-high speed

• 39 scan lines, 196fps, FoV 112°

Page 43: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Ultra-high speed

• 25 scan lines, 306fps, FoV 112°

Page 44: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

“dog” – ultra high speed 382fps

time

g

ɔd

back

front

Page 45: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

“tongue blade height” during [d]

• hs-UTI @ 382fps & video @ 60fps, 300ms– Constriction-tracking, comparable to but different to

flesh-point tracking

Page 46: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Demo videos

• Video demo, deinterlaced lip camera 60fps [folder]– UTI old dutch and labialised english r [link]– Lip ultrax kids [link] – deinterlaced ring [link]

• High speed UTI 100fps – Malayalam retroflex lateral [folder]

• Slomo [link]• Slomo with spline [link]• Real speed with spline [link]

Page 47: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Single frame targets

• Two darker (tongue root) liquids, L /ɭ/ and R /r/• Three clearer (ATR, ~pal’ised) l /l/, r /ɾ/, 5 zh

Page 48: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

High speed (100fps)

• Malayalam trill /r/ R between /a/– Left = closing half of gesture– Right = opening half

• Note trill motion in blade and stable root

Page 49: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

High speed (100fps)

• Malayalam tap /ɾ/ between /a/

• Note greater movement in root, pivot point

Page 50: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

High speed (100fps)

• Malayalam retroflex flap /ɭ/• Stable root, mobile blade, slower approach with

very fast release (nb some UTI artefacts) of over 400mm/sec peak velocity

Page 51: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Gestural speed

• Unlike EMA, it’s hard to quantify kinematics• Need to explore / compare with EMA

Page 52: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Root stability?

• Positional examinations are easier• Retroflex flap and trill both have a very stable

root, which could be due to– Posterior bracing to enable the anterior movement– Coincidental, because the context was /a__a/ and

these liquids have a dark resonance in Malayalam• We can compare /a__a/ to /a__i/

Page 53: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Retroflex lateral flap in a__a

• Green = prevvowels andformation ofmaximally retracted “target” (black)

• Red = duringthe flap

• Purple = afterwards

Page 54: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

• Green = prevvowels andformation ofmaximally retracted “target” (black)

• Red = duringthe flap

• Purple = afterwards

Retroflex lateral flap /ɭ/ in a__i

High spatial accuracy when orthogonal to

beam

Lower spatial accuracy when parallel to beam

Page 55: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Comparison

• Overlap:during period from target toacoustic transition– dark aLa – light aLi

• How should weto quantify?

• No sig differenceanteriorly but…?

Page 56: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Single frame targets

• Two darker (tongue root) liquids, L /ɭ/ and R /r/• Three clearer (ATR, ~pal’ised) l /l/, r /ɾ/, 5 zh

Page 57: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Next to an /i/ vowel

• Two darker (tongue root) liquids, L /ɭ/ and R /r/• Root advancement and some palatalisation

Page 58: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Next to an /i/ vowel

• Three clearer (ATR, ~pal’ised) l /l/, r /ɾ/, 5 zh • Root advancement and more palatalisation

Page 59: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Summary

• Support for Punnoose’s acoustic findings of dark vs. light resonances in the liquid system,– Tongue root– Palatal dorsal area

• Apparent tongue-root bracing for trill and retroflex lateral flap in an /a_a/ context is associated with these being dark consonants– There is steady dynamic root coarticulation in /a_i/

• Both light and dark liquids coarticulate but don’t overlap

Page 60: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Any time for any more?

Page 61: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

ULTRAX – adding missing pieces

• Ultrasound misses a great deal of information!• ULTRAX project to obtain corpus

of 12 speakers in MRI / UTIto build real-time model

• Renals & Richmond @ CSTR

Page 62: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

• Could be used for head-movement correction within the midsagittal plane and/or

• Analysis of lip kinematics

Headset-mounted camera

Page 63: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

• Estimate based on oval model of internal 2D labial aperture, 60fps (~17ms per frame)

Coronal “cross-sectional area”

Page 64: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Orientation-free measures

• Measures of curviness of the tongue may escape the image-orientations problem

• Mielke’s concavity & Zharkova’s dorsal bulge (and others) offer speaker-internal unoriented analysis– But there is a worry about front/back of tongue

being needed, since end-points can be arbitrary

Page 65: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

How do /t/ and /k/ differ?

• For a variety of work, it is nice to compare a speaker’s productions against a kind of norm

• ULTRAX group 1 corpus of 30 children offer useful dataset– ata, iti, oto vs. aka, iki, oko– Speaker-internal ratio of /k/-/t/, along fanlines– Should show extra dorsal distance in /k/ and extra

alveolar distance for /t/

Page 66: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

results

• For example, /k/-/t/ of /a/ can be averaged by lining up the maximum excursion point

• These are not tongue surfaces! Nor in a fan!

Page 67: LOT summer school Ultrasound, phonetics, phonology:  Articulation for Beginners!

Results – anterior to left

max_dor alv dor-max phara -4.0 11.9 -3.5i -3.6 7.5 -3.1o -9.3 12.0 -0.1