DETECTION AND LOCALIZATION OF SOUND USING ADAPTIVE ... and... · Speech enhancement is one of the...

10
INTERNATIONAL JOURNAL OF TECHNOLOGY AND COMPUTING (IJTC) ISSN-2455-099X, Volume 2, Issue 6 June 2016. IJTC201606002 www. ijtc.org 252 DETECTION AND LOCALIZATION OF SOUND USING ADAPTIVE LEARNING TECHNIQUE Anil Kumar 1 , Dr. Manish Mann 2 1 Student, M.Tech, Department of CSE, L.R.I.E.T Solan, India 2 Associate Professor, Department of CSE, L.R.I.E.T Solan, India 1 [email protected] Abstract: In the field of artificial intelligence, Adaptive Learning Technique refers to the combination of artificial neural networks. In this research paper the Adaptive Learning Technique has been implemented to carry out the Detection and Localization of Sound (S). In this technique two methods are used to detect the pure sound, In the first method wiener filter are used to reduce the amount of noise in a signal and minimize the mean square error (M.S.E), And in the second method wiener with bacterial foraging optimization are used for effectiveness in sound. These proposed methods are compared and the results reveal its superiority. Keywords: Adaptive Learning Technique, Detection and Localization of Sound, Wiener filter, Wiener with Bacterial Foraging optimization, SNR, MSE. I. INTRODUCTION Sound confinement is the strategy for elucidating the position of a sound source. It is an audience's capacity which is utilized to discover or recognize the careful area or cause of an identified sound in heading and separation. The mind uses slight contrasts in timing signs quality and unearthly to permit us to limit sound source. The 3-dimensional position idea is utilized to express the restriction. To start with is the point of azimuth or level edge, second is the edge of height or vertical edge and last is separation or speed. The distinction in the passageway times between the ears, by the total sufficiency of high-recurrence sounds and by the unpredictable unearthly reflections from different parts of our bodies, including middle, shoulders, and so forth is known as the azimuth. Separation prompts are characterized as the loss of adequacy, loss of high frequencies, and the straight's proportion sign to the resounded sign is known as the separation signals. These signs are effectively interaual prompts. This is the little distinction between the two ears. It is depending on where the source is put, in this our head goes about as an obstruction to change the tone, force, and unearthly characteristics of the sound, assisting the with braining change where the sound radiated from. The scope of Lower frequencies, with longer wavelengths, diffracts the sound around the head and compelling the cerebrum to concentrate just on the staging signs from the source [1] IJTC.ORG

Transcript of DETECTION AND LOCALIZATION OF SOUND USING ADAPTIVE ... and... · Speech enhancement is one of the...

INTERNATIONAL JOURNAL OF TECHNOLOGY AND COMPUTING (IJTC)

ISSN-2455-099X,

Volume 2, Issue 6 June 2016.

IJTC201606002 www. ijtc.org 252

DETECTION AND LOCALIZATION OF

SOUND USING ADAPTIVE LEARNING

TECHNIQUE

Anil Kumar1, Dr. Manish Mann2

1Student, M.Tech, Department of CSE, L.R.I.E.T Solan, India

2Associate Professor, Department of CSE, L.R.I.E.T Solan, India

[email protected]

Abstract: In the field of artificial intelligence, Adaptive Learning Technique refers to the combination of

artificial neural networks. In this research paper the Adaptive Learning Technique has been implemented

to carry out the Detection and Localization of Sound (S). In this technique two methods are used to detect the

pure sound, In the first method wiener filter are used to reduce the amount of noise in a signal and minimize

the mean square error (M.S.E), And in the second method wiener with bacterial foraging optimization are

used for effectiveness in sound. These proposed methods are compared and the results reveal its superiority.

Keywords: Adaptive Learning Technique, Detection and Localization of Sound, Wiener filter, Wiener with

Bacterial Foraging optimization, SNR, MSE.

I. INTRODUCTION

Sound confinement is the strategy for elucidating the position of a sound source. It is an audience's capacity

which is utilized to discover or recognize the careful area or cause of an identified sound in heading and

separation. The mind uses slight contrasts in timing signs quality and unearthly to permit us to limit sound

source. The 3-dimensional position idea is utilized to express the restriction. To start with is the point of

azimuth or level edge, second is the edge of height or vertical edge and last is separation or speed. The

distinction in the passageway times between the ears, by the total sufficiency of high-recurrence sounds and

by the unpredictable unearthly reflections from different parts of our bodies, including middle, shoulders, and

so forth is known as the azimuth. Separation prompts are characterized as the loss of adequacy, loss of high

frequencies, and the straight's proportion sign to the resounded sign is known as the separation signals. These

signs are effectively interaual prompts. This is the little distinction between the two ears. It is depending on

where the source is put, in this our head goes about as an obstruction to change the tone, force, and unearthly

characteristics of the sound, assisting the with braining change where the sound radiated from. The scope of

Lower frequencies, with longer wavelengths, diffracts the sound around the head and compelling the

cerebrum to concentrate just on the staging signs from the source [1]

IJTC.O

RG

INTERNATIONAL JOURNAL OF TECHNOLOGY AND COMPUTING (IJTC)

ISSN-2455-099X,

Volume 2, Issue 6 June 2016.

IJTC201606002 www. ijtc.org 253

a. Sound Enhancement

Speech enhancement is one of the most important topics in speech signal processing. Several techniques have

been proposed for this purpose like the spectral subtraction approach, the signal subspace approach, adaptive

noise canceling and the iterative Wiener filter. The performances of these techniques depend on the quality

and intelligibility of the processed speech signal. The improvement in the speech signal-to-noise ratio (SNR)

is the target of most techniques. Spectral subtraction is the earliest method for enhancing speech degraded by

additive noise. This technique estimates the spectrum of the clean (noise-free) signal by the subtraction of the

estimated noise magnitude spectrum from the noisy signal magnitude spectrum while keeping the phase

spectrum of the noisy signal. The drawback of this technique is the residual noise.

Another technique is the signal subspace approach. It is used for enhancing speech signals degraded by

uncorrelated additive noise or colored noise. The idea of this algorithm is based on the fact that the vector

space of the noisy signal can be decomposed into a signal plus noise subspace and an orthogonal noise

subspace. Processing is performed on the vectors in the signal plus noise subspace only, while the noise

subspace is removed first. Decomposition of the vector space of the noisy signal is performed by applying the

singular value decomposition or the Karhunen-Loeve transform (KLT) on the speech signal. Mi et al. have

proposed the signal/noise KLT based approach for the removal of colored noise [9].

b. Sound Effectiveness with Bacterial Foraging

During foraging of the real bacteria, locomotion is achieved by a set of tensile flagella. Flagella help an E.coli

bacterium to tumble or swim, which are two basic operations performed by a bacterium at the time of foraging

.When they rotate the flagella in the clockwise direction, each flagellum pulls on the cell. That results in the

moving of flagella independently and finally the bacterium tumbles with lesser number of tumbling whereas

in a harmful place it tumbles frequently to find a nutrient gradient. Moving the flagella in the

counterclockwise direction helps the bacterium to swim at a very fast rate.

In the above-mentioned algorithm the bacteria undergoes chemotaxis, where they like to move towards a

nutrient gradient and avoid noxious environment. Generally the bacteria move for a longer distance in a

friendly environment.

When they get food in sufficient, they are increased in length and in presence of suitable temperature they

break in the middle to from an exact replica of itself. This phenomenon inspired Passino to introduce an event

of reproduction in BFOA. Due to the occurrence of sudden environmental changes or attack, the chemotactic

progress may be destroyed and a group of bacteria may move to some other places or some other may be

introduced in the swarm of concern. This constitutes the event of elimination-dispersal in the real bacterial

population, where all the bacteria in a region are killed or a group is dispersed into a new part of the

environment [11]

Hamed Shamsi and I. Yucel Ozbek. Have presents most of heart sound cancellation algorithms to improve the

quality of lung sound use information about heart sound locations. Therefore, a reliable estimation of heart

IJTC.O

RG

INTERNATIONAL JOURNAL OF TECHNOLOGY AND COMPUTING (IJTC)

ISSN-2455-099X,

Volume 2, Issue 6 June 2016.

IJTC201606002 www. ijtc.org 254

sound localizations within chest sound is a key issue to enhance the performance of heart sound cancellation

algorithms. In this paper, we present a new technique to estimate locations of heart sound segments in chest

sound using the temporal fuzzy c-means (TFCM) algorithm. [18]

A lot of research has been work out on the locations of sound, but still there are being many problems to

provide pure sound. So, to overcome these problems a proposed adaptive learning technique is used to detect

and localize the sound from different sounds and we filtering the sound interference using wiener and wiener

with bacterial foraging methods.

In this thesis adaptive learning technique has been implemented to carry out the Detection and Localization of

sound samples.

2. LITERATURE REVIEW

A lot of research has been work out on the locations of sounds but still there are being many problems to

provide pure sound.

Separation and localization of heart sound artefacts from respiratory data by a selection of eign triples in SSA,

Pulmonary auscultation is a vastly using diagnosis method over centuries. Together with the breath sound,

there is a possibility of hearing heart sound, since both sounds are originated from the human chest. For the

electronic analysis of the breath sound, separation of heart sound (HS) is important. In this paper, the

separation of HS is achieved by using a modified Singular Spectrum Analysis (SSA) method, by introducing a

provision for adaptive selection of SSA parameters [28].

Heart sound localization in chest sound using temporal fuzzy c- means classification, this paper present a new

technique to estimate locations of heart sound segments in chest sound using the temporal fuzzy c-means

(TFCM) algorithm. Therefore, reliable estimation of heart sound localizations within chest sound is a key

issue to enhance performance of heart sound cancellation algorithm [18].

Real time high accuracy 3-D PHAT based sound source localization using 4-microphone arrangement, this

paper investigates wideband sound source localization in outdoor cases. In such cases, time difference of

arrival (TDOA)-based methods are commonly used for 2-D and 3-D wideband sound source localization.

These methods have lower accuracy in comparison with direction of arrival-based approaches [16].

Detection of Heart Disease using Binary Particle Swarm Optimization, This paper introduces a

computer-aided diagnosis system of the heart valve disease using binary particle swarm optimization and

support vector machine, in conjunction with K-nearest neighbor and with leave-one-out cross-validation [17].

3. PROPOSED WORK

In our proposed method sound localization having different sound samples taken, they all are in active mode.

They produce noisy sound signals and object can identify the sound which they want to use.

3.1 Research Methodology

The object will detect and localize the sound. The object will follow adaptive learning technique; in the first

part wiener filter will reduce the amount of noise in a signal with SNR and minimize the mean square error

IJTC.O

RG

INTERNATIONAL JOURNAL OF TECHNOLOGY AND COMPUTING (IJTC)

ISSN-2455-099X,

Volume 2, Issue 6 June 2016.

IJTC201606002 www. ijtc.org 255

(M.S.E). In the second part wiener with bacterial foraging will work, According to previous values it will

make changes until effectiveness or variations occur in the values and calculate SNR and give MSE.

The methodology of above work is divided into three parts:

I. Sound inputs

II. Algorithm Steps

III. Results and Discussions

I. Sound Inputs

Fig a. Different sound Samples

II. Algorithm Steps

Step-1. Applying adaptive learning technique to detect the sound.

Step-2. We adapted the SSA algorithm and call it adaptive learning technique.

Step-3. According just before the evaluation consequences, the arrangement concert of the future technique

was most excellent in the midst of every one categorize.

Step-4. In this research, the concept of digital signal processing is used by which we detect the sound by

applying the adaptive learning technique.

Step-5. We can reducing the interference by applying wiener filter and wiener with bacterial foraging methods

then compare that signal spectrum with database stored spectrum.

III. Results and Discussions

In this work different impure sounds samples are used as input to proposed method. This proposed method

helps to identify & localize pure sound.

a. Sound Localization

Fig b. Sound localization from D

Fig.b represents that when all the sound samples are in active mode, object D can be localized.

IJTC.O

RG

INTERNATIONAL JOURNAL OF TECHNOLOGY AND COMPUTING (IJTC)

ISSN-2455-099X,

Volume 2, Issue 6 June 2016.

IJTC201606002 www. ijtc.org 256

Fig c. Sound localization from B

Fig.c depicted the sound samples of object B, when all the sounds are in active mode object B can be

localized.

b. Sound Detection by Wiener Method

Fig.d sound signal in time domain

Fig.d represents the noisy signal and cleaned signal graphs in terms of time domain .

Fig.e sound signal in frequency domain

Fig.e represents the signals in terms of frequency domain.

Fig. f signal to noise ratio

Fig. g represents the SNR and MSE values.

Fig. g In this SNR and MSE values are calculated.

IJTC.O

RG

INTERNATIONAL JOURNAL OF TECHNOLOGY AND COMPUTING (IJTC)

ISSN-2455-099X,

Volume 2, Issue 6 June 2016.

IJTC201606002 www. ijtc.org 257

c. Sound Effectiveness by Wiener with Bacterial Foraging Method

Fig.h this shows how bacterias are setting its own position.

Fig.i sound signal in time domain

Fig.d represents the noisy signal and cleaned signal graphs in terms of time domain

Fig.e sound signal in frequency domain

Fig.e represents the signals in terms of frequency domain.

Fig.f Shows signal to noise ratio

IJTC.O

RG

INTERNATIONAL JOURNAL OF TECHNOLOGY AND COMPUTING (IJTC)

ISSN-2455-099X,

Volume 2, Issue 6 June 2016.

IJTC201606002 www. ijtc.org 258

Fig.g shows variations in values.

Fig.g these figures shows the variations in values.This is the Bacterial foraging iteratively optimization

method ,This method will work according to previous values,it will make changes into previous values unitll

effectiveness occurs in the values and with this wiener will calculate SNR and MSE.

IJTC.O

RG

INTERNATIONAL JOURNAL OF TECHNOLOGY AND COMPUTING (IJTC)

ISSN-2455-099X,

Volume 2, Issue 6 June 2016.

IJTC201606002 www. ijtc.org 259

d. Comparison between SNR and MSE for both Wiener and Wiener with Bacterial Foraging Methods.

SNR Comparison

MSE Comparison

The above tables show the comparison between SNR and MSE.

From these comparisons we can see that in the bacterial foraging optimization method, The SNR increases

means in the signal effectiveness occur and MSE is decreases. So that we can reduce interferences and the

result reveal its superiority.

CONCLUSION

The proposed method has been designed for reducing the interference by applying Adaptive learning

technique, In this technique wiener filter and wiener with bacterial foraging optimization method used. This

method is very effective for noise reducing, this method will make changes in the noisy signal until variations

occurs and give effectiveness in signal.

sREFERENCES

[1] Davide Rocchisso, “Sound Processing”, In IEEE Computers in Cardiology, vol. 30, pp.367−372, 2003.

[2] MB Malarvili, I Kamarulafizam, S Hussain and D Helmi, “Heart Sound Segmentation Algorithm Based

on Instantaneous Energy of Electrocardiogram”, in IEEE Computers in Cardiology, vol. 30, pp. 327−313,

2003.

IJTC.O

RG

INTERNATIONAL JOURNAL OF TECHNOLOGY AND COMPUTING (IJTC)

ISSN-2455-099X,

Volume 2, Issue 6 June 2016.

IJTC201606002 www. ijtc.org 260

[3] Debbal SM and Reguig FB, “Heartbeat Sound Analysis with the wavelet Transform”, J Mech Med Biol,

vol. 4, no. 2, pp. 133-141, 2004.

[4] Paramythis and Stephanidis, “A Generic Adaptation Framework for Hypermedia Systems”, In Adaptable

and Adaptive Hypermedia Systems by Chen, S. Y., and Magoulas, G. D. (Eds.), Idea Group, Inc, forthcoming,

2004.

[5] C.Ahlstrom, O. Liljefeldt and P. Hult, “Heart sound cancellation from lung sound recordings using

recurrence time statistics and nonlinear prediction”, IEEE Signal Process. Lett, vol. 12, no. 12, pp. 812–815,

2005.

[6] P. Pertild, M. Parviaitnen, T. Korhonen, A. Visa, “Moving sound source localization in large areas”,

Proceedings of International Symposium on Intelligent Signal Processing and Communication Systems

December 13-16, 2005 Hong Kong.

[7] Yadollahi and Z. Moussavi, “A robust method for heart sounds localization using lung sounds entropy”,

IEEE Trans. Bio-Medical Engineering., vol. 53, no. 3, pp. 497–502, march 2006.

[8] Jonas Hörnstein, Manuel Lopes and Jose Santos-Victor, “Sound Localization for Humanoid Robots -

Building Audio-Motor Maps based on the HRTF”, in IEEE Computers in Cardiology, October 9 - 15, 2006,

Beijing, China.

[9] Mittal, U. and N. Phamdo, “Signal/noise KLT based approach for enhancing speech degraded by colored

noise,” IEEE Trans. Speech Audio Processing, Vol. 8, No. 2, 159–167, 2000.

[10] Zhan Huan Zhou, “Sound Localization and Virtual Auditory Space”, IEEE Institute of Biomaterials and

Biomedical Engineering Edward S. Roger Department of Electrical and Computer Engineering University of

Toronto, CANADA, 2008.

[11] Arijit biswas, “Bacterial Foraging Optimization Algorithm: Theoretical Foundations, Analysis, and

Applications”, Journal of Computational Information Systems vol. 52, 2009.

[12] Ali Pourmohammad and Seyed Mohammad Ahadi, “ TDE-ILD-Based 2D Half Plane Real Time High

Accuracy Sound Source Localization Using Only Two Microphones and Source Counting”, International

Conference on Electronics and Information Engineering, 2010.

[13] Fatih Ça lar, and Yucel ozbek, “Heart Sound Localization in Chest Sound Using Convex-Hull

Algorithm”, International Conference on Electrical and Electronics Engineering , vol. 58, no. 4, pp. 880–

883, 2011.

[14] Jinqun LIU, Haibin WANG, Wuchang LIU, “Autonomous Detection and Classi_cation of Congenital

Heart Disease Using an Auscultation Vest”, Journal of Computational Information Systems vol. 8, Issue 2, pp.

485–492, 2012.

[15] R. M. Potdar and Nishi Shahnaj Haider, “Removal of Heart Sound from Lung Sound using LabVIEW

8.6”, International Conference on Electronics and Information Engineering vol. 2, Issue 3, pp.1313-1319,

May-Jun 2012.

IJTC.O

RG

INTERNATIONAL JOURNAL OF TECHNOLOGY AND COMPUTING (IJTC)

ISSN-2455-099X,

Volume 2, Issue 6 June 2016.

IJTC201606002 www. ijtc.org 261

[16] Ali Pourmohammad and Seyed Mohammad Ahad, “Real Time High Accuracy 3-D PHAT-Based Sound

Source Localization Using a Simple 4-Microphone Arrangement”, IEEE SYSTEMS JOURNAL, vol. 6, no. 3,

September, 2012.

[17] Mona Nagy Elbedwehy, Hossam M. Zawbaa†, Neveen Ghali‡and Aboul Ella Hassanien , “Detection of

Heart Disease using Binary Particle Swarm Optimization”, IEEE Federated Conference on Computer Science

and Information Systems, 09/2012.

[18] Hamed Shamsi and I. Yucel Ozbek, “Heart Sound Localization in Chest Sound Using Temporal Fuzzy

C-Means Classification”, 34th Annual International Conference of the IEEE EMBS San Diego, California

USA, 28 August - 1 September, 2012.

[19] M. Pourazad, “Heart Sounds Reduction from lung sounds recordings applying signal and image

processing techniques in time-frequency domain”, M.Sc. thesis, Elect. Comput. Eng. Dept., Univ. Manitoba,

Winnipeg,, MB, Canada, 2004.

[20] Moussavi, D. Flores, and G. Thomas, “Heart sound cancellation based on multi-scale products and linear

prediction”, in IEEE Eng. Med. Biol. Mag, pp. 3840–3843, Sep. 2004.

[21] Keh-Shih Chuang , Hong-Long Tzeng, Sharon Chen, Jay Wu and Tzong Jer Chen, “Fuzzy c-means

clustering with spatial information for image segmentation”, Computerized Medical Imaging and Graphics ,

vol. 30, pp. 9-15, 2006.

[22] Z. Dokur, T. Ölmez, “Heart sound classification using wavelet transform and incremental self-organizing

map”, Digital Signal Processing, vol. 18, pp. 951–959, 2008.

[23] D. Štorek, “Graphical user interface for measuring the just noticeable difference in localization of virtual

acoustic sources”, Department of Radio electronics, Faculty of Electrical Engineering Czech Technical

University in Prague, 2011.

[24] Hiroshi Umezu and Kenji Suyama, “Multiple Sound Source Localization based on Local Existence

Property of Speech Signal”, World Academy of Science, Engineering and Technology , 2011.

[25] Alessio Brutti, Maurizio Omologo, “Compression between different sound source Localization

techniques based on a real Data Collection”, Fondazione Bruno Kessler – irst, via Sommarive, Trento, Italy,

2012.

[26] Tang H, Li T, “Segmentation of heart sounds based on dynamic clustering Biomedical Signal Processing

and Control” , International Conference on Electronics and Information Engineering, vol. 7, Issue 5, pp.

509-16, 2012.

[27] Tseng YL, Ko PY, Jaw FS, “Detection of the Third and Fourth Heart Sounds Using Hilbert-Huang

Transform”, BioMedical Engineering OnLine, 2012.

[28] Shinto Sebastian, Rathnakara S, “ Separation and Localisation of Heart Sound Artefacts from

Respiratory Data by Adaptive Selection of Eigen Triples in Singular Spectrum Analysis” ,IEEE 4th ICCCNT

– 2013 Tiruchengode, India, July 4-6, 2013.

IJTC.O

RG