Designing and Testing Voice Interfaces through Microphone ... · – Microphone Array Design –...

Designing and Testing Voice

Interfaces through Microphone Array

Modeling, Audio Prototyping, and

Text Analytics

Vidya Viswanathan

What Device Is This?

Overview of the System

Audio AcquisitionSpeech

PreprocessingPost Processing

Detected

Speech

Steering

Time-Domain

Signal

How Can I…

1. Design a voice interface?

2. Validate if my voice interface can work

in real-life scenarios?

3. Test the performance of my system?

Choice of Microphones

▪ Single Microphone

– Fixed directivity in a given

direction

– Noise cancellation is non-

trivial

▪ Array of Microphones

– Can control the directivity

for a specific direction

– Noise cancellation is

easier

Microphone Array Geometries

Uniform Linear Array

Uniform Circular Array

Uniform Rectangular Array

and many more…

Design a Microphone Array System

…starting from a given array hardware

▪ BlueCoin from ST Microelectronics

4x MEMS

Microphones

Microphone Array Design- Where to Start?

Array Design and

Analysis

– Element definition

– Array geometry

definition

– Array shading/tapers

– Array steering

– Radiation patterns in

different forms

Choosing between different array geometries

How do you get a Cardioid Pattern?

Differential beamforming

𝜏 =𝑑

𝑧−𝜏

Beam Steering – Does this work?

Variable Array Configuration

Time Domain Simulation of an Array System

Sound sourceSound source

Time Domain Simulation of an Array System

Direction of Arrival Estimation Phased Array System Toolbox

▪ ULA

▪ Sum and difference monopulse

▪ Beamscan, MVDR (Capon)

▪ High resolution (ESPRIT, Root MUSIC, etc...)

▪ URA

▪ Sum and difference monopulse

▪ Beamscan, MVDR (Capon)

▪ Conformal arrays

▪ Beamscan

▪ MVDR (Capon)

How Can I…

Constrained Simulations vs. Real Life

Validating in Real-Life Scenarios

1. Including the Room Impulse Response

• Record the audio in a constrained environment

• Include the Room Impulse Response (RIR)• Generate RIR

• Measure RIR

2. Live Acquisition

• Connect a microphone array

• Acquire speech signals live in a real-life environment

Integrating the Room Impulse Response

Room Impulse Response Simulation

Image Source Method (ISM) for simulation of

Room Impulse Response (RIR) in small-room

acoustics

Link to the code

Impulse Response Measurer App

Measure impulse and frequency responses of

electrical and acoustic systems with:

▪ Maximum-Length Sequences (MLS)

▪ Exponentially Swept Sinusoids (ESS)

Live Acquisition from Hardware

%% Live Audio Acquisition and Streaming

fs = 16000;

tscope = dsp.TimeScope('SampleRate',fs);

readaudio =

audioDeviceReader('SampleRate',fs,'NumChann

els',4,...

'Device', 'Microphone (STM32 AUDIO

Streaming in FS Mode)');

% Set block duration

readaudio.SamplesPerFrame = 1024;

while isVisible(tscope)

% Acquire live

in = readaudio();

% Visualize in real-time

tscope(in);

release(readaudio)

Creating Audio Testbench

▪ Debug your audio

plugin

▪ Time and frequency

domain visualization

of your processing

▪ Option to bypass

algorithm under test

for interactive A/B

testing

Reuse your design in Digital Audio WorkstationsGenerating VST Plugins

VST plugin

How Can I…

How To Measure Performance?

"91.5% of spoken

sentences correctly

converted"

Output audio

"sounds good"

Test performance with speech-to-text services

>> [samples, fs] = audioread('helloaudioPD.wav');

>> soundsc(samples, fs)

>> speechObject = speechClient('Google','languageCode','en-US');

>> outInfo = speech2text(speechObject, samples, fs);

>> outInfo.TRANSCRIPT =

'hello audio product Developers'

>> outInfo.CONFIDENCE =

0.9385

Speech Dataset

Voice Interface

Cloud Services

Speech to Text

Importing data

into MATLAB

Connecting

MATLAB to Cloud

Connecting to Cloud Services for Speech Recognition

▪ Google® Speech API

▪ IBM® Watson Speech API

▪ Microsoft® Azure Speech

Speech-To-Text – Access 3rd-party web services from MATLAB

▪ Automate content labelling of speech datasets

▪ Validate speech enhancement algorithms for transcription performance

▪ Run text analytics on auto-transcribed voice recordings

▪ Choice of

– Google® Speech API

– IBM® Watson Speech API

– Microsoft® Azure Speech API

▪ Requires separate credentials

for service provider of choice

https://www.mathworks.com/matlabcentral/fileexchange/65266-speech2text

Speech Dataset

Voice Interface

Cloud Services

Speech to Text

Importing data

into MATLAB

Connecting

MATLAB to Cloud ✓

Building a small speech dataset quickly

Example: an App with automated content labelling*

*See also Dataset Recorder App prototype in example "Record

Audio Datasets" (From R2018a in Audio System Toolbox)

Speech Command Recognition with Deep Learning

(New Example)

▪ Train a Convolutional Neural Network

(CNN) to recognize speech commands

▪ Work with Google's speech command dataset

▪ Leverage helper code for:

– Reading and managing large datasets

(New audio datastore prototype)

– Transforming 1D signals into 2D images via perceptually-aware spectrograms

(New auditory spectrogram prototype)

▪ Prototype trained network in real-time on live audio

To Learn More about Deep Learning with MATLAB

How Can I…

1. Design a voice interface?– Microphone Array Design

– Beamforming and Direction of Arrival Estimation

in real-life scenarios?– Live Acquisition from hardware

– Leveraging Audio Plugins for performance improvement

3. Test the performance of my system?– Using Cloud Services for Speech Recognition

– Creating your own speech dataset

Phased

System

Toolbox

System

Toolbox

MATLAB

Signal Processing with MATLABThis two-day course shows how to analyze signals and design signal processing systems using

MATLAB®, Signal Processing Toolbox™, and DSP System Toolbox™.

Topics include:

▪ Creating and analyzing signals

▪ Performing spectral analysis

▪ Designing and analyzing filters

▪ Designing multirate filters

▪ Designing adaptive filters

Call to ActionVideos and Webinars

▪ Real-time Audio Processing for Algorithm

Prototyping and Custom Measurements

▪ Reusing and Prototyping Code to

Accelerate Innovation: Smart Voice

Interfaces

• Share your experience with MATLAB & Simulink on Social Media

▪ Use #MATLABEXPO

▪ I use #MATLAB because……………………… Attending #MATLABEXPO▪ Examples

▪ I use #MATLAB because it helps me be a data scientist! Attending #MATLABEXPO

▪ Learning new capabilities in #MATLAB and #Simulink at #MATLABEXPO.

• Share your session feedback: Please fill in your feedback for this session in the feedback form

Speaker Details

Email: Vidya.Viswanathan@mathworks.in

LinkedIn:

www.linkedin.com/in/vidyaviswanathan

Contact MathWorks India

Products/Training Enquiry Booth

Call: 080-6632-6000

Email: info@mathworks.in

Designing and Testing Voice Interfaces through Microphone ... · – Microphone Array Design –...

Documents

Transcript of Designing and Testing Voice Interfaces through Microphone ... · – Microphone Array Design –...

Beamforming Microphone Array Installation Guide - … Beamfor… · 2 Beamforming Microphone Array » NOTE: The PoE connection is only for power, not control. Power must be supplied

2011 Beamforming regularization matrix and inverse problems applied to sound field measurement and extrapolation using microphone array

Microphone Array Design and Beamforming · Microphone Array Design and Beamforming Heinrich Löllmann Multimedia Communications and Signal Processing heinrich.loellmann@fau.de with

Microphone Array Wiener Beamforming with emphasis on Reverberation · 2018-06-10 · Microphone Array Wiener Beamforming with emphasis on Reverberation Vamsynagh Pedamallu This thesis

Beamforming Microphone MAS-A100 - pro.sony

Shure KSM313 Dual-Voice Ribbon Microphone User Guide

DESIGNING OPTIMIZED MICROPHONE BEAMFORMERS · PDF fileFigure 2: Beamforming performance of a microphone array from a look angle of 90° off-axis. Yellow sections indicate the strongest

BEAMFORMING MICROPHONE ARRAY - VideoCentric...The Beamforming Microphone Array is the Pro-Audio industry’s first professional-grade microphone array with beamforming and adaptive

Portable 512 MEMS-Microphone-Array for 3D-Intensity- and Beamforming-Measurements ... · 2020. 2. 18. · BeBeC-2020-D27 Portable 512 MEMS-Microphone-Array for 3D-Intensity- and Beamforming-Measurements

Diploma Thesis Modal beamforming using planar circular ... · Diploma Thesis Modal beamforming using planar circular microphone arrays Markus Zaunschirm Graz, March 2012 Supervisor:

Beamforming Microphone Array 2€¦ · Large Lecture Halls Training Centers 2nd-Generation Mic Array ... Beamforming Mic Array 2 Beamforming Mic Array 2 DIALOG 20 Receiver PoE Power

Clustered Blind Beamforming From Ad-Hoc …...Clustered Blind Beamforming From Ad-Hoc Microphone Arrays Background Background Beamformer Beamforming is an eﬀective method of spatial

BEAMFORMING MICROPHONE, USER INTERFACE, AND MORE

Beamforming Microphone Array Installation Guide - … BMF2 Ceiling Mount... · the weight of the Beamforming Microphone Array and its mounting hardware. support to center support

Beamforming Microphone Array Installation Guide › univers › Documents › CL-BFM2-KIT24B... · 2015-02-09 · The Beamforming Microphone Array supports use in ceiling-mounted

Time-of-arrival estimation for blind beamforming...1) Traditional beamforming / beam steering 2) Ad-hoc microphone arrays 3) Three ad-hoc array beam steering methods – Time-of-Arrival

DESIGN OF OPTIMAL LINEAR DIFFERENTIAL MICROPHONE ARRAYS …2019... · 2019. 6. 25. · tivity factor (DF) than the traditional superdirective beamforming methods. Differential microphone

Features - ClearOneclearone.com/uploads/resource/COLLABORATE_RoomPro_500_600_DS… · > 620 includes Beamforming Microphone Array and ... VIDEO FEATURES ... Through API SECURITY &

UHF Wireless Microphone System - Electro-Voice

Professional Conferencing Microphones - ClearOne · Tabletop Microphones Ceiling Microphone Array Wireless Microphone Systems Beamforming Microphone Array ClearOne offers a full line