Tools for Speech Analysis Julia Hirschberg CS4706 Thanks to Jean-Philippe Goldman, Fadi Biadsy.
Tools for Speech Analysis
description
Transcript of Tools for Speech Analysis
![Page 1: Tools for Speech Analysis](https://reader035.fdocuments.us/reader035/viewer/2022062519/56814f30550346895dbcbef2/html5/thumbnails/1.jpg)
Tools for Speech Analysis
Julia Hirschberg
CS4995/6998
Thanks to Jean-Philippe Goldman, Fadi Biadsy
![Page 2: Tools for Speech Analysis](https://reader035.fdocuments.us/reader035/viewer/2022062519/56814f30550346895dbcbef2/html5/thumbnails/2.jpg)
2
Goals
• Create stimuli for a perception experiment• Record subjects for a production experiment• Analyze data from an experiment or from a
natural speech corpus• Extract features from speech data for a machine
learning experiment
![Page 3: Tools for Speech Analysis](https://reader035.fdocuments.us/reader035/viewer/2022062519/56814f30550346895dbcbef2/html5/thumbnails/3.jpg)
3
Many Software Options
• Goldwave (audio editor)• Esps Xwaves (routines + visual.)• Praat (speech analysis)• Wavesurfer (speech editor)• Transcriber (annotation tool)• Matlab (general purpose soft)• OGI speech tools (routines + app. dev.)• …winpitch, pitchworks, phonedit, cooledit…..
![Page 4: Tools for Speech Analysis](https://reader035.fdocuments.us/reader035/viewer/2022062519/56814f30550346895dbcbef2/html5/thumbnails/4.jpg)
4
Links
• www.goldwave.com• www.speech.kth.se/software/#esps• www.praat.org• www.speech.kth.se/software/#wavesurfer• http://cslu.cse.ogi.edu/toolkit/• www.mathworks.com (Matlab)
• www.lpl.univ-aix.fr/~sqlab/ (phonedit)• http://www.kvraudio.com/get/696.html (PitchWorks)• www.winpitch.com (WinPitch)• http://www.adobe.com/special/products/audition/
syntrillium.html (CoolEdit > Audition)
![Page 5: Tools for Speech Analysis](https://reader035.fdocuments.us/reader035/viewer/2022062519/56814f30550346895dbcbef2/html5/thumbnails/5.jpg)
5
Our Choice: Praat
• Developed by Paul Boersma and David Weenink at the Institute of Phonetic Sciences, University of Amsterdam
• General purpose speech tool : editing, segmentation and labeling, prosodic manipulation, many tutorials, large user community, yahoo group
![Page 6: Tools for Speech Analysis](https://reader035.fdocuments.us/reader035/viewer/2022062519/56814f30550346895dbcbef2/html5/thumbnails/6.jpg)
6
![Page 7: Tools for Speech Analysis](https://reader035.fdocuments.us/reader035/viewer/2022062519/56814f30550346895dbcbef2/html5/thumbnails/7.jpg)
7
File Management
• Recording files and saving them– New menu
• Opening files– Read menu
• Long and short sound files• Other file types
– Write menu– Exercise: Record a file with your own name, play it to
check, call it ‘<your name>’, save it to list, write it to a .wav file on disk, remove it from the objects list, read it back in
![Page 8: Tools for Speech Analysis](https://reader035.fdocuments.us/reader035/viewer/2022062519/56814f30550346895dbcbef2/html5/thumbnails/8.jpg)
8
Editing Options from Objects Window
• Select and edit your name file• Spectrum:
– Show a spectral slice – Show a spectrogram
• Pitch: – Show pitch– Check the settings, change the range– Get pitch information: get pitch, get min/max pitch
• Intensity: – Get intensity information: similar to pitch functions– Check the settings
• Formant: Display
![Page 9: Tools for Speech Analysis](https://reader035.fdocuments.us/reader035/viewer/2022062519/56814f30550346895dbcbef2/html5/thumbnails/9.jpg)
9
Modifying the Data
• Changing the pitch contour of your name file:• Go to To manipulation• Edit the new object• Pitch Stylize pitch (2st)• Modify pitch by dragging points up and down• Modify duration:
– Add points in duration tier– Drag points up and down
• To save: File -> Publish resynthesis
![Page 10: Tools for Speech Analysis](https://reader035.fdocuments.us/reader035/viewer/2022062519/56814f30550346895dbcbef2/html5/thumbnails/10.jpg)
10
Annotation: Textgrids
• From objects window, w/ sound file selected– Annotate To textgrid– Point vs. interval tiers
• Add a point tier and an interval tier and insert some labels
• NB: remember to select the interval or point first in the waveform or spectrogram before trying to insert a label
![Page 11: Tools for Speech Analysis](https://reader035.fdocuments.us/reader035/viewer/2022062519/56814f30550346895dbcbef2/html5/thumbnails/11.jpg)
11
Scripting
• From history:– Praat new Praatscript Edit Paste
history– NB: you can run all or part of the script
• Writing scripts• Modifying existing scripts:
– Tutorials, scripts, resources, user groups, search
![Page 12: Tools for Speech Analysis](https://reader035.fdocuments.us/reader035/viewer/2022062519/56814f30550346895dbcbef2/html5/thumbnails/12.jpg)
Sample Praat Script
# This script will create a new text-grid for a wav fileform Make a text-grid for a .wav file comment Source Directory?
sentence Directory C:\Documents and Settings\julila\My Documents\ comment File name? sentence Filename comment Tier Name? sentence TierendformRead from file... 'directory$‘ ‘filename$'stem$ = left$(filename$,length(filename$)-4)select Sound 'stem$'To TextGrid... 'tier$' 'tier$‘ # tier names, which tiers are point tiersWrite to text file... 'directory$'\'stem$'.TextGridRemove
![Page 13: Tools for Speech Analysis](https://reader035.fdocuments.us/reader035/viewer/2022062519/56814f30550346895dbcbef2/html5/thumbnails/13.jpg)
Task 1
• Read in the file ‘mummy.wav’• Edit it• Zoom in on the most prominent • Select the entire contour
– Display the pitch and intensity contours– What is the minimum pitch? Maximum?
Mean?– What is the minimum intensity? Maximum?
Mean?
![Page 14: Tools for Speech Analysis](https://reader035.fdocuments.us/reader035/viewer/2022062519/56814f30550346895dbcbef2/html5/thumbnails/14.jpg)
Task2
• Record a statement• Record a yes-no question• Record a wh-question• What are the similarities? Differences?
![Page 15: Tools for Speech Analysis](https://reader035.fdocuments.us/reader035/viewer/2022062519/56814f30550346895dbcbef2/html5/thumbnails/15.jpg)
Task3
• Record something in a very loud voice, to produce clipping, and see what the waveform looks like – how do you identify clipping? Avoid it?
![Page 16: Tools for Speech Analysis](https://reader035.fdocuments.us/reader035/viewer/2022062519/56814f30550346895dbcbef2/html5/thumbnails/16.jpg)
Task4
• Record a file using falling intonation• Modify it to produce a rising intonational contour• Edit the new contour to
– Raise the pitch range (select the contour and use ‘shift pitch frequencies’
– Stylize the pitch contour
![Page 17: Tools for Speech Analysis](https://reader035.fdocuments.us/reader035/viewer/2022062519/56814f30550346895dbcbef2/html5/thumbnails/17.jpg)
Task 5: Pitch contour cloning
• Replace the pitch contour in beach.wav with the contour in speech.wav– Create a manipulation object for beach.wav– Extract the pitch tier– Create a manipulation object for speech.wav– Select the pitch tier of beach.wav and the
manipulation object for speech.wav and click ‘Replace pitch tier’
– Select the manipulation object for speech.wav and click ‘Get resynthesis…’
– Compare the original file and the new file
![Page 18: Tools for Speech Analysis](https://reader035.fdocuments.us/reader035/viewer/2022062519/56814f30550346895dbcbef2/html5/thumbnails/18.jpg)
Task6
• Record “My mama moonlights in Memphis” – As angry speech– As sad speech– As happy speech
• For each token answer the following:– What is the mean pitch? Maximum?– What is the mean intensity? Maximum?– What is the duration?– Do you see any differences in the F0 contour?
![Page 19: Tools for Speech Analysis](https://reader035.fdocuments.us/reader035/viewer/2022062519/56814f30550346895dbcbef2/html5/thumbnails/19.jpg)
Task 7
• Edit mummy.wav• Convert Change Gender• Filter filter (pass) Hann band
– Find a pass band that masks the words but retains the intonation
– Find a pass band that masks the intonation but retains the words
![Page 20: Tools for Speech Analysis](https://reader035.fdocuments.us/reader035/viewer/2022062519/56814f30550346895dbcbef2/html5/thumbnails/20.jpg)
20
Help
• Online help, FAQ, manual• Links from http://www.praat.org• Additional tutorials, scripts, resources, user
groups
![Page 21: Tools for Speech Analysis](https://reader035.fdocuments.us/reader035/viewer/2022062519/56814f30550346895dbcbef2/html5/thumbnails/21.jpg)
21
Next Class
• Report on your results from the seven tasks