Improved 3D Sound Delivered to Headphones Using Wavelets By Ozlem KALINLI EE-Systems University of...

13
Improved 3D Sound Delivered to Headphones Using Wavelets By Ozlem KALINLI EE-Systems University of Southern California December 4, 2003

Transcript of Improved 3D Sound Delivered to Headphones Using Wavelets By Ozlem KALINLI EE-Systems University of...

Page 1: Improved 3D Sound Delivered to Headphones Using Wavelets By Ozlem KALINLI EE-Systems University of Southern California December 4, 2003.

Improved 3D Sound Deliveredto Headphones Using Wavelets

By

Ozlem KALINLI

EE-SystemsUniversity of Southern California

December 4, 2003

Page 2: Improved 3D Sound Delivered to Headphones Using Wavelets By Ozlem KALINLI EE-Systems University of Southern California December 4, 2003.

Outline:

Introduction

Work

Results

Conclusion

Page 3: Improved 3D Sound Delivered to Headphones Using Wavelets By Ozlem KALINLI EE-Systems University of Southern California December 4, 2003.

Immersive Audio Environments Transport listener into the same sonic environment as the event

o Multiple, spatially-distributed

sound sourceso Head and source motiono Room Acoustics

Virtually listening environmentso Synthetic acoustic images (headphones or loudspeakers)o Simulated directional sound informationo Simulated room acoustics

Introduction

Immersive Reproduction of 3D Sound Scheme

Page 4: Improved 3D Sound Delivered to Headphones Using Wavelets By Ozlem KALINLI EE-Systems University of Southern California December 4, 2003.

Head Related Transfer Function (HRTF)

Head Related Transfer Function (HRTF)

Special transformation of a source from a point in free space to the listener’s eardrums.

HRTF measurements are computed using a dummy head (KEMAR)

Used for sound localizationSound Transmission from Source to Listener

Introduction

Page 5: Improved 3D Sound Delivered to Headphones Using Wavelets By Ozlem KALINLI EE-Systems University of Southern California December 4, 2003.

Sound Localization

Localization of sound, cues:o Interaural time difference

(ITD) , dominant below 1.5 kHz

o Interaural intensity difference (IID), dominant above 3 kHz

Reasons:o Path length differenceo Head Shadowingo Reflection of Head

Introduction

Page 6: Improved 3D Sound Delivered to Headphones Using Wavelets By Ozlem KALINLI EE-Systems University of Southern California December 4, 2003.

Main Work Goal of Work: To obtain a better sound diffusion

from the mono-sound recorded at an anechoic chamber

System Toolso Use HRTF to localize

sound, 30o azimuth, and 0o elevationo Use wavelet filter banks with

time delay at the lowest frequency (below 1.5 kHz) to get the sound diffusion (adding reverberant sound)

Work

Page 7: Improved 3D Sound Delivered to Headphones Using Wavelets By Ozlem KALINLI EE-Systems University of Southern California December 4, 2003.

Overall System

Work

• Fs= 44.1 kHz, 16 bit

• 5 Stages of dyadic tree to get the signal below 1.5 kHz

• Daubechies wavelets, with filter tap 16

• Delay time 7.25 ms

Page 8: Improved 3D Sound Delivered to Headphones Using Wavelets By Ozlem KALINLI EE-Systems University of Southern California December 4, 2003.

Simulation Results 4 different types of audio signals are tested Piano, guitar, classical music, pop song

Time Domain Waveforms for Piano Sound (Left Channel)

(a) HRTF Sound (b) Delayed Sound with Wavelet (c) Final Sound

Results

Page 9: Improved 3D Sound Delivered to Headphones Using Wavelets By Ozlem KALINLI EE-Systems University of Southern California December 4, 2003.

Results for Piano Sound Subjective Listening Tests

Relation Between Time Delays and Correlation Coefficient

0

1

2

3

4

5

6

people

More Diffusive

Listening Tests

HRTF Sound

Delayed Sound

Final Sound

0

1

2

3

4

people

Sounds Better

Listening Tests

HRTF Sound

Delayed Sound

Final Sound

Time Delay [ms]

Correlation Coefficient

Delayed Sound Final Sound

7.25 -0.3994 0.3577

14.5 -0.3235 0.3002

17.4 -0.3566 0.3377

Results

Page 10: Improved 3D Sound Delivered to Headphones Using Wavelets By Ozlem KALINLI EE-Systems University of Southern California December 4, 2003.

Other Work Done

Sound localized at 110o of azimuth with 0o elevation is also tested, since surround sound is desired at the + 110o and - 110o

o Listening test results similar to the 30o of azimuth

o Relation Between Time Delays and Correlation Coefficient

Time Delay [ms]

Correlation Coefficient

Delayed Sound Final Sound

7.25 -0.2905 -0.0803

14.5 -0.2024 -0.1194

17.4 -0.2894 -0.1254

Results

Page 11: Improved 3D Sound Delivered to Headphones Using Wavelets By Ozlem KALINLI EE-Systems University of Southern California December 4, 2003.

Results for Piano Sound

Original sound, Mono HRTF-30

o Test signal (no delay)o Delayed Sound (7.25 ms)o Final Sound

HRTF-110o Delayed Sound (7.25 ms)o Final Sound

7.25 ms 14.5 ms 17.4 ms

7.25 ms 14.5 ms 17.4 ms

Results

Page 12: Improved 3D Sound Delivered to Headphones Using Wavelets By Ozlem KALINLI EE-Systems University of Southern California December 4, 2003.

Conclusion Introducing delay in the frequency band

below 1.5 kHz produces reverberant sound

The final sound is better than HRTF sound in sense of the sound diffusion.

Depending on the audio characteristic, the optimum delay time to obtain de-correlated sound (small correlation coefficient) may vary.

When the delay is very high, it simulates big halls.

Conclusion

Page 13: Improved 3D Sound Delivered to Headphones Using Wavelets By Ozlem KALINLI EE-Systems University of Southern California December 4, 2003.

References “Improved 3D Sound Using Wavelets”, U. P. Chong, H.

Kim, K. N. Kim, IEEE Information Systems and Technologies, 2001.

“HRTF Measurements of a KEMAR Dummy-Head Microphone”, MIT Media Lab Perceptual Computing- Technical Report #280.

“HRTF Measurements of a KEMAR Dummy-Head Microphone”, http://sound.media.mit.edu/KEMAR.html

“Virtually Auditory Space Generation and Applications”, Simon Carlie, Chapman and Hall, 1996.