SlideShare una empresa de Scribd logo
1 de 31
RAMIN ANUSHIRAVANI
ECE 551
FALL 2014
Sound Source Localization
with Microphone Arrays
Red box
ignore me, if you wish!
BOLD and GREEN
LOOK AT ME!
1
I need 7 minutes and 45 second to finish up 
Outline
 Background
 Application
 Human Sound Localization
 Time Delay Model
 Beamforming
 Signal Model
 Criteria
 Microphone Arrays
 Uniform Linear Array (ULA)
 Beampattern
 Spatial Aliasing
 Sound Source Localization
 Conventional Beamforming
 MUSIC algorithm
 Results
Where are you
technology? 
Where is my
hearing aid?
2
Application
3
 Why localizing a
sound source is
useful?
 Improving Speech
recognition
 Speech Enhancement
 Hearing aids
 Audio Surveillance
 Teleconferencing
 Spatial Audio
Background
How do we localize sound?
 Interaural time
difference - ITD
 Interaural level
difference - ILD
 Spectral information
Background
4
A Time Delay Model
 τ =
d sin(θ)
𝑐
 τ is the time delay
between the two sensors.
(.-)τ -> exp(jωτ)
Time Domain Frequency Domain
 Where c is the speed of
sound.
 Θ : Angle of arrival
d
θ
d sin(θ)
Ref
Background
Far Field Assumption
: Distance between the two sensors
5
 Delay the reference signal until the
sum of the energy of the two signals
is at its maximum (or undo the
delay from the delayed signal).
Example
𝑡𝑟𝑢𝑒 𝑑𝑒𝑙𝑎𝑦 = arg max(||𝑟𝑒𝑓(𝑛 + 𝑚)+delayed(n)||) = arg 𝑚𝑎𝑥 𝐶 𝑥𝑦 [m]
= 𝑚=0
𝑛
𝑑𝑒𝑙∗(𝑛)ref(n+m)
For x(n) =ref (n) , y(n) = delayed signal (n)
m (samples) -> τ (seconds) = d sin(θ)/ c
Background
6
Simulated
Signals
Direction
Of Arrival
(DOA)
Example
Background
Undo the delay 7
Beamforming
 Spatial Filtering
 Detect and estimate the
output of a sensor array
 Types
 Fixed vs Adaptive
Beamformer
 Delay and Sum (Filter and
Sum)
 MVDR (Capon)
 Narrowband vs
Broadband
 Beamformer
 Z(k) = 𝑾 𝑯Y(k)
Z(k)
Y1(k)
Y2(k)
Source
Recorded
at mics
Filters
8
Beamforming [S P. Boyd]
Signal Model
 𝒚 𝒏 𝒕 = 𝒈 𝒏 𝒕 ∗ 𝒔 𝒕 + 𝒗 𝒏 𝒕 = 𝒙 𝒏 𝒕 + 𝒗 𝒏 𝒕
y : received signal at each microphone
g : spatial response corresponding to
the source location
s : source signal
v: noise
 𝑌𝑛 𝑓 = 𝐺 𝑛 𝑓 𝑆 𝑓 + 𝑉𝑛 𝑓 = 𝑋 𝑛 𝑓 + 𝑉𝑛 𝑓 = d(f)𝑋1(𝑓)+v(f)
Where,
d: steering (direction) vector
X1: recorded signal at the first (ref) microphone
Beamforming
For simplicity we assume x
and v are uncorrelated.
9
Signal Model
Beamforming
𝒁 𝒇 = 𝒉 𝑯 𝒇 𝒚 𝒇
ℎ 𝐻
𝑓 [𝑑 𝑓 𝑋1 𝑓 + 𝑣(𝑓)] =
𝑋1,𝑓 𝑓 + 𝑉𝑟𝑛 𝑓
Where,
𝑋1,𝑓 𝑓 = ℎ 𝐻
𝑓 𝑑 𝑓 𝑋1 𝑓
𝑉𝑟𝑛 𝑓 = ℎ 𝐻 𝑓 𝑣(𝑓)
10
BeamformerRecorded Signal Beamformer
output
Beamforming Criteria
 Signal to Noise Ratio
 Array Gain
Output SNR over the input SNR.
 Noise Rejection
Amount of noise being rejected by
the beamformer.
 Beampattern
Represent the response of the beamformer to an arbitrary input signal as
a function of the steering vectors (microphone array impulse response).
Beamforming [Benesty et al.]
11
Microphone Array
Linear array2D array
Hexagon array
Spherical array
Ad-Hoc arrays
12
Microphone Array
Background
That parasite can
localize sound
better than me?!
WTB?
Sigh…
13
• Biologically inspired
Sonistic’s MEMs array
Ormia
ULA
 Collecting signal from a source with
microphone where the spacing
between each element is Δ.
 Signal received at the 𝑚 𝑡ℎ
microphone:
𝑥 𝑚 𝑡 = 𝑠𝑖(𝑡)
𝑖=1
𝑑
𝑒 𝑗(𝑚−1)μ𝑖 + 𝑛 𝑚(𝑡)
Where, 𝜇𝑖 = (-2π/λ) Δsin(θ𝑖) : Spatial frequency
𝑥 = 𝐴𝑠 𝑡 + 𝑛(𝑡)
Microphone Array [Bhuiy et al.]
14
Steering vectors based on a
time delay model for one
frequency (narrowband)
MIC
Angles
S
x
A
Beampattern
 𝑒−𝑖𝜔𝑡
= 𝑒−𝑖2π𝑓𝑡
=
𝑒−𝑖2π𝑘(𝑆)𝑑sin(θ)/𝑐𝑝
Where,
k : discrete frequency
S : Sampling Rate
p : Number of DFT samples.
 We can visualize the
steering vector by
plotting the steering
vectors over all
angles for an arbitrary
input for any number of
microphones.
Microphone Array
[𝒆−𝒊𝝎𝟎
⋯ ⋯ 𝒆−𝒊𝝎𝟎
] [𝒆−𝒊𝝎𝝉 𝟏 ⋯ ⋯ 𝒆−𝒊𝝎𝝉 𝒏]
Ref Delayed by 𝜏𝑖 span over
all angles and frequencies
Add and normalized by the number of
microphones for some arbitrary input.
Steering
Vectors
15
ITD Polar Pattern
Main lobe
Grating lobe
Spatial
Aliasing
16
1000 Hz 4000 Hz
2 Mic
22 cm
apart
2 Mic
2 cm
apart
Microphone Array
ITD Polar Pattern
17
1000 Hz 4000 Hz
4 Mic
22 cm
apart
10 Mic
2 cm
apart
Microphone Array
Spatial Aliasing
 Aliasing
“If the bandwidth of the signal exceeds half of the
sampling frequency, the spectral replicas overlap,
leading to a distortion in the observed spectrum.”
 Spatial Aliasing
“The spacing between adjacent microphone elements should
be less than half of the wavelength corresponding to the
highest temporal frequency of interest.”
Microphone Arrays [J. P. Dmochowski et al.]
18
If distance between adjacent
microphones > λ/2
Where λ = speed of sound/ frequency
Spatial Aliasing Frequency > 1600 Hz
Spatial Aliasing
Beamforming
Main lobe
Grating lobeOmni Response
19
Sound Source Localization
 Using beamforming and subspace methods to
localize a sound source.
 Delay and Sum - Classical Beamformer
 Capon – Minimum Variance Distortionless Response (MVDR)
beamformer
 Multiple signal classification (MUSIC) - A Subspace Algorithm
20
Delay and Sum
 𝒀 𝒕 = 𝑾 𝑯
𝑿 𝒕
 Output power:
𝑃 𝑤 = 1/𝑘 𝑘=1
𝐾
|𝑌(𝑡 𝑘 )2
| =
𝑤 𝐻
𝑅 𝑥𝑥 𝑤
Where,
𝑅 𝑥𝑥 = 𝑋(𝑡 𝑘 )𝑋 𝐻
(𝑡 𝑘 )
𝑤 = 𝐴(θ)
𝑷 𝑫𝑺 = 𝑨(𝜽)𝑹 𝒙𝒙 𝑨(𝜽) 𝑯
Source Localization [Richter]
Y(k)
X1(k)
X2(k)
Source
W
⋮
21
Minimum Variance Distortionless Response
 A delay and sum beamformer with an additional
constraint on the output power,
𝒘 𝑴𝑽𝑫𝑹 = 𝒂𝒓𝒈 𝒎𝒊𝒏 (𝒘 𝑯 𝑹 𝒙𝒙 𝒘) s.t. 𝒘 𝑯A(θ) = 1
Constrain the look direction gain to be, g(φ𝑖
) = 1 and
minimize the output power of the beamformer. φ
Source Localization [J. Capon]
22
Minimum Variance Distortionless Response
 𝑤 𝑀𝑉𝐷𝑅 = 𝑎𝑟𝑔 𝑚𝑖𝑛 (𝑤 𝐻
𝑅 𝑥𝑥 𝑤) s.t. 𝑤 𝐻
A(θ) = 1.
This lead to the Lagrangian,
 𝐽 𝑤, λ = 𝑤 𝐻
𝑅 𝑥𝑥w+λ(𝑤 𝐻
A(θ)-1)(A(θ)
𝐻
w-1)
After having lots of fun it turns out that,
𝒘 𝑴𝑽𝑫𝑹(𝜽) =
𝑹 𝒙𝒙
−𝟏
A(θ)
(A(θ)
𝑯
𝑹 𝒙𝒙
−𝟏
A(θ))
𝑷 𝑴𝑽𝑫𝑹 𝜽 =
𝟏
(A(θ)
𝑯
𝑹 𝒙𝒙
−𝟏
A(θ))
Source Localization
23
MUSIC
 𝑿 = 𝑨 𝜽 𝑺 + 𝑽
Where,
X: Collected samples (N samples)
A: Steering vectors
S: Source signals
V: Gaussian noise model with mean zero and variance 𝜎 𝑁
2.
𝑅 𝑥𝑥 =
1
N 𝑛=1
𝑁
𝑥 𝑛 𝑥 𝐻 𝑛 =
1
𝑁
𝑋𝑋 𝐻
𝐸{𝑅 𝑥𝑥 }= A(θ) 𝑅 𝑠𝑠 𝐴 𝐻
θ + 𝜎 𝑁
2
𝐼
Where,
𝑅 𝑠𝑠 =
1
𝑁
𝑆𝑆 𝐻
, signal covariance matrix.
Source Localization [Kawitkar]
24
MUSIC
 𝑹 𝒙𝒙 = [𝑼 𝒔 𝑼 𝒏 ]
𝝀 𝟏 ⋯ 𝟎
⋮ ⋱ ⋮
𝟎 ⋯ 𝝀 𝑴
𝑼 𝒔
𝑯
𝑼 𝒏
𝑯
Where,
𝑈𝑠 = signal subspace
𝑈 𝑛 = noise subspace
And λ1>λ2 > ⋯>λ 𝑀.
Span(𝑈𝑠 ) = span(A(θ))
MUSIC uses the orthogonally between the noise subspace and the
steering vectors.
𝑈 𝑛 ⊥ A(θ) => 𝑈 𝑛
𝐻A(θ)= 0.
Source Localization
You need to know how many sources you have.
25
MUSIC
 MUSIC Pseudo Spectrum is defined as,
𝑷 𝑴𝑼𝑺𝑰𝑪 𝜽 =
𝟏
||𝑼 𝒏
𝑯A(θ)||
=
𝟏
A(θ)
𝑯
𝑼 𝒏 𝑼 𝒏
𝑯A(θ)
 MUSIC Spatial Spectrum is defined as,
𝑷 𝑴𝑼𝑺𝑰𝑪 𝜽 =
𝟏
||𝑼 𝒏
𝑯A(θ)||
=
A(θ)A(θ)
𝑯
A(θ)
𝑯
𝑼 𝒏 𝑼 𝒏
𝑯A(θ)
=> MUSIC measures of the orthogonality between steering
vectors of the array and the noise subspace. The poles of this
expression points to the direction of the signal source.
Source Localization
26
Experiment Setup
≃ 8cm long
4 microphones
16 KHz sampling rate
PS EYE
(More like
PS ears :p
27
Results
Results
 1 source, @ 15 degree ish
 2 Microphones
28
1 Source, 2 Microphone
Results
• 2 sources at 15 and
-25 degree
• 4 Microphones
29
2 Source, 4 Microphone
Results
RMSE Delay and Sum MVDR MUSIC
Accuracy 2 sources 0.7035 0.1012 0.0851
Accuracy 4 sources 0.4992 0.4990 0.1903
30
• Which one is more robust to noise?
• Which one is more robust to reverberation?
• Which one give out a higher SNR for
enhancing speech?
• …etc.
Localization accuracy[Bhuiya et al.]
𝑅𝑀𝑆𝐸 =
1
𝑘
𝑘=1
𝐾
(Θ 𝑒𝑠𝑡_𝑘 − Θ 𝑡𝑟𝑢𝑒_𝑘)2
K : number of audio blocks (group of frames)
Citation
 Benesty, Jacek P. Dmochowski , Microphone Arrays: Fundamental
Concepts , Springer
 Bhuiya, F. Islam, M , Analysis of Direction of Arrival Techniques Using
Uniform Linear Array , International Journal of Computer Theory and
Engineering
 J. Capon. High-resolution frequency-wavenumber spectrum analysis.
Proc. IEEE, 57(8), 1408–1418 (1969).
 Kawitkar, R , Performance of Different Types of Array Structures Based
on Multiple Signal Classification (MUSIC) algorithm, International
Conference on MEMS NANO, and Smart Systems
 Richter, I , Spatial Filtering and DoA Estimation MVDR Beamformer
and MUSIC Algorithm , Sensor Array Signal Processing
 S P. Boyd, R , ROBUST MINIMUM VARIANCE BEAMFORMING
31

Más contenido relacionado

La actualidad más candente

Acoustic echo cancellation
Acoustic echo cancellationAcoustic echo cancellation
Acoustic echo cancellation
chintanajoshi
 

La actualidad más candente (20)

Optimum Receiver corrupted by AWGN Channel
Optimum Receiver corrupted by AWGN ChannelOptimum Receiver corrupted by AWGN Channel
Optimum Receiver corrupted by AWGN Channel
 
speech processing basics
speech processing basicsspeech processing basics
speech processing basics
 
IMPLEMENTATION OF UPSAMPLING & DOWNSAMPLING
IMPLEMENTATION OF UPSAMPLING & DOWNSAMPLINGIMPLEMENTATION OF UPSAMPLING & DOWNSAMPLING
IMPLEMENTATION OF UPSAMPLING & DOWNSAMPLING
 
Phase locked loop design
Phase locked loop designPhase locked loop design
Phase locked loop design
 
Dsp lecture vol 7 adaptive filter
Dsp lecture vol 7 adaptive filterDsp lecture vol 7 adaptive filter
Dsp lecture vol 7 adaptive filter
 
Lecture 5 description of electro acoustic characteristics of hearing instrume...
Lecture 5 description of electro acoustic characteristics of hearing instrume...Lecture 5 description of electro acoustic characteristics of hearing instrume...
Lecture 5 description of electro acoustic characteristics of hearing instrume...
 
NOISE IN Analog Communication Part-2 AM SYSTEMS.ppt
NOISE IN Analog Communication  Part-2 AM SYSTEMS.pptNOISE IN Analog Communication  Part-2 AM SYSTEMS.ppt
NOISE IN Analog Communication Part-2 AM SYSTEMS.ppt
 
Eye diagram
Eye diagramEye diagram
Eye diagram
 
Sound Source Localization with microphone arrays
Sound Source Localization with microphone arraysSound Source Localization with microphone arrays
Sound Source Localization with microphone arrays
 
Broadside array vs end fire array
Broadside array vs end fire arrayBroadside array vs end fire array
Broadside array vs end fire array
 
Detection techniques in coherent optical communications
Detection techniques in coherent optical communicationsDetection techniques in coherent optical communications
Detection techniques in coherent optical communications
 
Adaptive filter
Adaptive filterAdaptive filter
Adaptive filter
 
Antennas slideshare part 2
Antennas slideshare part 2Antennas slideshare part 2
Antennas slideshare part 2
 
Chap04
Chap04Chap04
Chap04
 
Acoustic echo cancellation
Acoustic echo cancellationAcoustic echo cancellation
Acoustic echo cancellation
 
Noise
NoiseNoise
Noise
 
M ary psk modulation
M ary psk modulationM ary psk modulation
M ary psk modulation
 
Difference between wavelet transform and fourier transform
Difference between wavelet transform and fourier transformDifference between wavelet transform and fourier transform
Difference between wavelet transform and fourier transform
 
Fourier transform
Fourier transformFourier transform
Fourier transform
 
Frequency transformation
Frequency transformationFrequency transformation
Frequency transformation
 

Destacado

Destacado (9)

FPGA Based Acoustic Source Localization Project
FPGA Based Acoustic Source Localization ProjectFPGA Based Acoustic Source Localization Project
FPGA Based Acoustic Source Localization Project
 
Sound source localization
Sound source localizationSound source localization
Sound source localization
 
recommender_systems
recommender_systemsrecommender_systems
recommender_systems
 
Beamforming antennas (1)
Beamforming antennas (1)Beamforming antennas (1)
Beamforming antennas (1)
 
Robust Adaptive Beamforming for Antenna Array
Robust Adaptive Beamforming for Antenna ArrayRobust Adaptive Beamforming for Antenna Array
Robust Adaptive Beamforming for Antenna Array
 
Food Preservation for Farming Communities in Nepal: A Low Cost Engineering So...
Food Preservation for Farming Communities in Nepal: A Low Cost Engineering So...Food Preservation for Farming Communities in Nepal: A Low Cost Engineering So...
Food Preservation for Farming Communities in Nepal: A Low Cost Engineering So...
 
Kinect Microphone Array case study
Kinect Microphone Array case studyKinect Microphone Array case study
Kinect Microphone Array case study
 
Beamforming for Antenna Array
Beamforming for Antenna ArrayBeamforming for Antenna Array
Beamforming for Antenna Array
 
Defying Nyquist in Analog to Digital Conversion
Defying Nyquist in Analog to Digital ConversionDefying Nyquist in Analog to Digital Conversion
Defying Nyquist in Analog to Digital Conversion
 

Similar a Beamforming and microphone arrays

Time reversed acoustics - Mathias Fink
Time reversed acoustics - Mathias FinkTime reversed acoustics - Mathias Fink
Time reversed acoustics - Mathias Fink
Sébastien Popoff
 
N 4-antennas propagation
N 4-antennas propagationN 4-antennas propagation
N 4-antennas propagation
15010192
 
Antennas propagation
Antennas propagationAntennas propagation
Antennas propagation
hrishi_tiwary
 
Slide Handouts with Notes
Slide Handouts with NotesSlide Handouts with Notes
Slide Handouts with Notes
Leon Nguyen
 
Scattering Model for Vegetation Canopies and Simulation of Satellite Navigati...
Scattering Model for Vegetation Canopies and Simulation of Satellite Navigati...Scattering Model for Vegetation Canopies and Simulation of Satellite Navigati...
Scattering Model for Vegetation Canopies and Simulation of Satellite Navigati...
Frank Schubert
 

Similar a Beamforming and microphone arrays (20)

Analog Communication fast mode material.pdf
Analog Communication fast mode material.pdfAnalog Communication fast mode material.pdf
Analog Communication fast mode material.pdf
 
Presentation WASPAA - A Polynomial Eigenvalue Decomposition MUSIC Approach fo...
Presentation WASPAA - A Polynomial Eigenvalue Decomposition MUSIC Approach fo...Presentation WASPAA - A Polynomial Eigenvalue Decomposition MUSIC Approach fo...
Presentation WASPAA - A Polynomial Eigenvalue Decomposition MUSIC Approach fo...
 
Noise Performance of CW system
Noise Performance of CW systemNoise Performance of CW system
Noise Performance of CW system
 
Time reversed acoustics - Mathias Fink
Time reversed acoustics - Mathias FinkTime reversed acoustics - Mathias Fink
Time reversed acoustics - Mathias Fink
 
Defense - Sound space rendering based on the virtual Sound space rendering ba...
Defense - Sound space rendering based on the virtual Sound space rendering ba...Defense - Sound space rendering based on the virtual Sound space rendering ba...
Defense - Sound space rendering based on the virtual Sound space rendering ba...
 
N 4-antennas propagation
N 4-antennas propagationN 4-antennas propagation
N 4-antennas propagation
 
Antennas propagation
Antennas propagationAntennas propagation
Antennas propagation
 
3-Basic Acoustics.ppt
3-Basic Acoustics.ppt3-Basic Acoustics.ppt
3-Basic Acoustics.ppt
 
Slide Handouts with Notes
Slide Handouts with NotesSlide Handouts with Notes
Slide Handouts with Notes
 
Vidyalankar final-essentials of communication systems
Vidyalankar final-essentials of communication systemsVidyalankar final-essentials of communication systems
Vidyalankar final-essentials of communication systems
 
Ch7 noise variation of different modulation scheme pg 63
Ch7 noise variation of different modulation scheme pg 63Ch7 noise variation of different modulation scheme pg 63
Ch7 noise variation of different modulation scheme pg 63
 
_Pulse-Modulation-Systems.pdf
_Pulse-Modulation-Systems.pdf_Pulse-Modulation-Systems.pdf
_Pulse-Modulation-Systems.pdf
 
Chapter#5
Chapter#5Chapter#5
Chapter#5
 
An Overview of Array Signal Processing and Beam Forming TechniquesAn Overview...
An Overview of Array Signal Processing and Beam Forming TechniquesAn Overview...An Overview of Array Signal Processing and Beam Forming TechniquesAn Overview...
An Overview of Array Signal Processing and Beam Forming TechniquesAn Overview...
 
311 communication system concepts
311 communication system concepts311 communication system concepts
311 communication system concepts
 
Audio Signal Processing
Audio Signal Processing Audio Signal Processing
Audio Signal Processing
 
example based audio editing
example based audio editingexample based audio editing
example based audio editing
 
Scattering Model for Vegetation Canopies and Simulation of Satellite Navigati...
Scattering Model for Vegetation Canopies and Simulation of Satellite Navigati...Scattering Model for Vegetation Canopies and Simulation of Satellite Navigati...
Scattering Model for Vegetation Canopies and Simulation of Satellite Navigati...
 
3 D Sound
3 D Sound3 D Sound
3 D Sound
 
Signal Processing
Signal ProcessingSignal Processing
Signal Processing
 

Más de Ramin Anushiravani (6)

Techfest jan17
Techfest jan17Techfest jan17
Techfest jan17
 
3D audio
3D audio3D audio
3D audio
 
3D Audio playback for single channel audio using visual cues
3D Audio playback for single channel audio using visual cues3D Audio playback for single channel audio using visual cues
3D Audio playback for single channel audio using visual cues
 
A computer vision approach to speech enhancement
A computer vision approach to speech enhancementA computer vision approach to speech enhancement
A computer vision approach to speech enhancement
 
Poster cs543
Poster cs543Poster cs543
Poster cs543
 
3D Spatial Response
3D Spatial Response3D Spatial Response
3D Spatial Response
 

Último

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
ssuser89054b
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoorTop Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
dharasingh5698
 
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Kandungan 087776558899
 

Último (20)

Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.ppt
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
DC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equationDC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equation
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoorTop Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf
 
Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfUnit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdf
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS Lambda
 

Beamforming and microphone arrays

  • 1. RAMIN ANUSHIRAVANI ECE 551 FALL 2014 Sound Source Localization with Microphone Arrays Red box ignore me, if you wish! BOLD and GREEN LOOK AT ME! 1 I need 7 minutes and 45 second to finish up 
  • 2. Outline  Background  Application  Human Sound Localization  Time Delay Model  Beamforming  Signal Model  Criteria  Microphone Arrays  Uniform Linear Array (ULA)  Beampattern  Spatial Aliasing  Sound Source Localization  Conventional Beamforming  MUSIC algorithm  Results Where are you technology?  Where is my hearing aid? 2
  • 3. Application 3  Why localizing a sound source is useful?  Improving Speech recognition  Speech Enhancement  Hearing aids  Audio Surveillance  Teleconferencing  Spatial Audio Background
  • 4. How do we localize sound?  Interaural time difference - ITD  Interaural level difference - ILD  Spectral information Background 4
  • 5. A Time Delay Model  τ = d sin(θ) 𝑐  τ is the time delay between the two sensors. (.-)τ -> exp(jωτ) Time Domain Frequency Domain  Where c is the speed of sound.  Θ : Angle of arrival d θ d sin(θ) Ref Background Far Field Assumption : Distance between the two sensors 5
  • 6.  Delay the reference signal until the sum of the energy of the two signals is at its maximum (or undo the delay from the delayed signal). Example 𝑡𝑟𝑢𝑒 𝑑𝑒𝑙𝑎𝑦 = arg max(||𝑟𝑒𝑓(𝑛 + 𝑚)+delayed(n)||) = arg 𝑚𝑎𝑥 𝐶 𝑥𝑦 [m] = 𝑚=0 𝑛 𝑑𝑒𝑙∗(𝑛)ref(n+m) For x(n) =ref (n) , y(n) = delayed signal (n) m (samples) -> τ (seconds) = d sin(θ)/ c Background 6
  • 8. Beamforming  Spatial Filtering  Detect and estimate the output of a sensor array  Types  Fixed vs Adaptive Beamformer  Delay and Sum (Filter and Sum)  MVDR (Capon)  Narrowband vs Broadband  Beamformer  Z(k) = 𝑾 𝑯Y(k) Z(k) Y1(k) Y2(k) Source Recorded at mics Filters 8 Beamforming [S P. Boyd]
  • 9. Signal Model  𝒚 𝒏 𝒕 = 𝒈 𝒏 𝒕 ∗ 𝒔 𝒕 + 𝒗 𝒏 𝒕 = 𝒙 𝒏 𝒕 + 𝒗 𝒏 𝒕 y : received signal at each microphone g : spatial response corresponding to the source location s : source signal v: noise  𝑌𝑛 𝑓 = 𝐺 𝑛 𝑓 𝑆 𝑓 + 𝑉𝑛 𝑓 = 𝑋 𝑛 𝑓 + 𝑉𝑛 𝑓 = d(f)𝑋1(𝑓)+v(f) Where, d: steering (direction) vector X1: recorded signal at the first (ref) microphone Beamforming For simplicity we assume x and v are uncorrelated. 9
  • 10. Signal Model Beamforming 𝒁 𝒇 = 𝒉 𝑯 𝒇 𝒚 𝒇 ℎ 𝐻 𝑓 [𝑑 𝑓 𝑋1 𝑓 + 𝑣(𝑓)] = 𝑋1,𝑓 𝑓 + 𝑉𝑟𝑛 𝑓 Where, 𝑋1,𝑓 𝑓 = ℎ 𝐻 𝑓 𝑑 𝑓 𝑋1 𝑓 𝑉𝑟𝑛 𝑓 = ℎ 𝐻 𝑓 𝑣(𝑓) 10 BeamformerRecorded Signal Beamformer output
  • 11. Beamforming Criteria  Signal to Noise Ratio  Array Gain Output SNR over the input SNR.  Noise Rejection Amount of noise being rejected by the beamformer.  Beampattern Represent the response of the beamformer to an arbitrary input signal as a function of the steering vectors (microphone array impulse response). Beamforming [Benesty et al.] 11
  • 12. Microphone Array Linear array2D array Hexagon array Spherical array Ad-Hoc arrays 12
  • 13. Microphone Array Background That parasite can localize sound better than me?! WTB? Sigh… 13 • Biologically inspired Sonistic’s MEMs array Ormia
  • 14. ULA  Collecting signal from a source with microphone where the spacing between each element is Δ.  Signal received at the 𝑚 𝑡ℎ microphone: 𝑥 𝑚 𝑡 = 𝑠𝑖(𝑡) 𝑖=1 𝑑 𝑒 𝑗(𝑚−1)μ𝑖 + 𝑛 𝑚(𝑡) Where, 𝜇𝑖 = (-2π/λ) Δsin(θ𝑖) : Spatial frequency 𝑥 = 𝐴𝑠 𝑡 + 𝑛(𝑡) Microphone Array [Bhuiy et al.] 14 Steering vectors based on a time delay model for one frequency (narrowband) MIC Angles S x A
  • 15. Beampattern  𝑒−𝑖𝜔𝑡 = 𝑒−𝑖2π𝑓𝑡 = 𝑒−𝑖2π𝑘(𝑆)𝑑sin(θ)/𝑐𝑝 Where, k : discrete frequency S : Sampling Rate p : Number of DFT samples.  We can visualize the steering vector by plotting the steering vectors over all angles for an arbitrary input for any number of microphones. Microphone Array [𝒆−𝒊𝝎𝟎 ⋯ ⋯ 𝒆−𝒊𝝎𝟎 ] [𝒆−𝒊𝝎𝝉 𝟏 ⋯ ⋯ 𝒆−𝒊𝝎𝝉 𝒏] Ref Delayed by 𝜏𝑖 span over all angles and frequencies Add and normalized by the number of microphones for some arbitrary input. Steering Vectors 15
  • 16. ITD Polar Pattern Main lobe Grating lobe Spatial Aliasing 16 1000 Hz 4000 Hz 2 Mic 22 cm apart 2 Mic 2 cm apart Microphone Array
  • 17. ITD Polar Pattern 17 1000 Hz 4000 Hz 4 Mic 22 cm apart 10 Mic 2 cm apart Microphone Array
  • 18. Spatial Aliasing  Aliasing “If the bandwidth of the signal exceeds half of the sampling frequency, the spectral replicas overlap, leading to a distortion in the observed spectrum.”  Spatial Aliasing “The spacing between adjacent microphone elements should be less than half of the wavelength corresponding to the highest temporal frequency of interest.” Microphone Arrays [J. P. Dmochowski et al.] 18
  • 19. If distance between adjacent microphones > λ/2 Where λ = speed of sound/ frequency Spatial Aliasing Frequency > 1600 Hz Spatial Aliasing Beamforming Main lobe Grating lobeOmni Response 19
  • 20. Sound Source Localization  Using beamforming and subspace methods to localize a sound source.  Delay and Sum - Classical Beamformer  Capon – Minimum Variance Distortionless Response (MVDR) beamformer  Multiple signal classification (MUSIC) - A Subspace Algorithm 20
  • 21. Delay and Sum  𝒀 𝒕 = 𝑾 𝑯 𝑿 𝒕  Output power: 𝑃 𝑤 = 1/𝑘 𝑘=1 𝐾 |𝑌(𝑡 𝑘 )2 | = 𝑤 𝐻 𝑅 𝑥𝑥 𝑤 Where, 𝑅 𝑥𝑥 = 𝑋(𝑡 𝑘 )𝑋 𝐻 (𝑡 𝑘 ) 𝑤 = 𝐴(θ) 𝑷 𝑫𝑺 = 𝑨(𝜽)𝑹 𝒙𝒙 𝑨(𝜽) 𝑯 Source Localization [Richter] Y(k) X1(k) X2(k) Source W ⋮ 21
  • 22. Minimum Variance Distortionless Response  A delay and sum beamformer with an additional constraint on the output power, 𝒘 𝑴𝑽𝑫𝑹 = 𝒂𝒓𝒈 𝒎𝒊𝒏 (𝒘 𝑯 𝑹 𝒙𝒙 𝒘) s.t. 𝒘 𝑯A(θ) = 1 Constrain the look direction gain to be, g(φ𝑖 ) = 1 and minimize the output power of the beamformer. φ Source Localization [J. Capon] 22
  • 23. Minimum Variance Distortionless Response  𝑤 𝑀𝑉𝐷𝑅 = 𝑎𝑟𝑔 𝑚𝑖𝑛 (𝑤 𝐻 𝑅 𝑥𝑥 𝑤) s.t. 𝑤 𝐻 A(θ) = 1. This lead to the Lagrangian,  𝐽 𝑤, λ = 𝑤 𝐻 𝑅 𝑥𝑥w+λ(𝑤 𝐻 A(θ)-1)(A(θ) 𝐻 w-1) After having lots of fun it turns out that, 𝒘 𝑴𝑽𝑫𝑹(𝜽) = 𝑹 𝒙𝒙 −𝟏 A(θ) (A(θ) 𝑯 𝑹 𝒙𝒙 −𝟏 A(θ)) 𝑷 𝑴𝑽𝑫𝑹 𝜽 = 𝟏 (A(θ) 𝑯 𝑹 𝒙𝒙 −𝟏 A(θ)) Source Localization 23
  • 24. MUSIC  𝑿 = 𝑨 𝜽 𝑺 + 𝑽 Where, X: Collected samples (N samples) A: Steering vectors S: Source signals V: Gaussian noise model with mean zero and variance 𝜎 𝑁 2. 𝑅 𝑥𝑥 = 1 N 𝑛=1 𝑁 𝑥 𝑛 𝑥 𝐻 𝑛 = 1 𝑁 𝑋𝑋 𝐻 𝐸{𝑅 𝑥𝑥 }= A(θ) 𝑅 𝑠𝑠 𝐴 𝐻 θ + 𝜎 𝑁 2 𝐼 Where, 𝑅 𝑠𝑠 = 1 𝑁 𝑆𝑆 𝐻 , signal covariance matrix. Source Localization [Kawitkar] 24
  • 25. MUSIC  𝑹 𝒙𝒙 = [𝑼 𝒔 𝑼 𝒏 ] 𝝀 𝟏 ⋯ 𝟎 ⋮ ⋱ ⋮ 𝟎 ⋯ 𝝀 𝑴 𝑼 𝒔 𝑯 𝑼 𝒏 𝑯 Where, 𝑈𝑠 = signal subspace 𝑈 𝑛 = noise subspace And λ1>λ2 > ⋯>λ 𝑀. Span(𝑈𝑠 ) = span(A(θ)) MUSIC uses the orthogonally between the noise subspace and the steering vectors. 𝑈 𝑛 ⊥ A(θ) => 𝑈 𝑛 𝐻A(θ)= 0. Source Localization You need to know how many sources you have. 25
  • 26. MUSIC  MUSIC Pseudo Spectrum is defined as, 𝑷 𝑴𝑼𝑺𝑰𝑪 𝜽 = 𝟏 ||𝑼 𝒏 𝑯A(θ)|| = 𝟏 A(θ) 𝑯 𝑼 𝒏 𝑼 𝒏 𝑯A(θ)  MUSIC Spatial Spectrum is defined as, 𝑷 𝑴𝑼𝑺𝑰𝑪 𝜽 = 𝟏 ||𝑼 𝒏 𝑯A(θ)|| = A(θ)A(θ) 𝑯 A(θ) 𝑯 𝑼 𝒏 𝑼 𝒏 𝑯A(θ) => MUSIC measures of the orthogonality between steering vectors of the array and the noise subspace. The poles of this expression points to the direction of the signal source. Source Localization 26
  • 27. Experiment Setup ≃ 8cm long 4 microphones 16 KHz sampling rate PS EYE (More like PS ears :p 27 Results
  • 28. Results  1 source, @ 15 degree ish  2 Microphones 28 1 Source, 2 Microphone
  • 29. Results • 2 sources at 15 and -25 degree • 4 Microphones 29 2 Source, 4 Microphone
  • 30. Results RMSE Delay and Sum MVDR MUSIC Accuracy 2 sources 0.7035 0.1012 0.0851 Accuracy 4 sources 0.4992 0.4990 0.1903 30 • Which one is more robust to noise? • Which one is more robust to reverberation? • Which one give out a higher SNR for enhancing speech? • …etc. Localization accuracy[Bhuiya et al.] 𝑅𝑀𝑆𝐸 = 1 𝑘 𝑘=1 𝐾 (Θ 𝑒𝑠𝑡_𝑘 − Θ 𝑡𝑟𝑢𝑒_𝑘)2 K : number of audio blocks (group of frames)
  • 31. Citation  Benesty, Jacek P. Dmochowski , Microphone Arrays: Fundamental Concepts , Springer  Bhuiya, F. Islam, M , Analysis of Direction of Arrival Techniques Using Uniform Linear Array , International Journal of Computer Theory and Engineering  J. Capon. High-resolution frequency-wavenumber spectrum analysis. Proc. IEEE, 57(8), 1408–1418 (1969).  Kawitkar, R , Performance of Different Types of Array Structures Based on Multiple Signal Classification (MUSIC) algorithm, International Conference on MEMS NANO, and Smart Systems  Richter, I , Spatial Filtering and DoA Estimation MVDR Beamformer and MUSIC Algorithm , Sensor Array Signal Processing  S P. Boyd, R , ROBUST MINIMUM VARIANCE BEAMFORMING 31

Notas del editor

  1. Need at least two microphone to localize sounds, two ears 22 cm apart
  2. Simulate your head with two microphones and no head!
  3. appendix
  4. appendix
  5. Narrowband beamformer
  6. fixed. Brain and detection. Remove
  7. We’re trying to find some filters that would increase this SNR say in an speech enhancement application. Measure of beamformer goodness.
  8. Series of Microphone . Geometry is given? Derive the spatial responses for the microphone and do fun application like speech enhancement, noise reduction, dereverberation, sound ,signal estimation, source localization, …. Beamforming is a strong tool in array signal processing
  9. The idea is to „steer‟ the array in one direction at a time and measure the output power. The steering direction which coincides with the DOA of a signal and result in a maximum output power yields the DOA estimates. You can also calibrate a microphone array by playing MLS sequence.
  10. Formally, the beampattern is defined as the ratio of the variance of the beamformer output when the source impinges with a steering vector d(f) to the variance of the desired signal x1(t).
  11. Too many, octave frequency 1 2 4 8 kHZ Produce by : Delay and sum beamforming Of two mics, can’t distinguish back and front since the time delays are the same.
  12. Too many, octave frequency 1 2 4 8 kHZ Produce by : Delay and sum beamforming Of two mics, can’t distinguish back and front since the time delays are the same.
  13. J. P. Dmochowski and J. Benesty In order to reconstruct a spatial sinusoid from a set of uniformly-spaced discrete spatial samples, the spatial sampling period must be less than half of the sinusoid’s wavelength.
  14. Super low frequencies: omni response. Narrow beam at zero degrees => good But high energy at all these other angles, cannot distinguish the difference. Of course we almost never hear one frequency, but a wide range of frequencies. So , conclusion here is that there is more to sound localaization besides ITD.
  15. Also Barlett method. A(θ) is defined as the steering vector with a scanning angle θ. The idea is to scan across the angular region of interest. In speech enhancement you can fix the angle Form a beam toward an angle and capture those desired signals, in sound source localization, you want to scan over all angles and Look for when the power is maximum.
  16. Take the gradient of J with respect to h and lambda and use the constraint on power. MVDR requires a good estimation of the covariance matrix. There has to be at least as many observation as sensors in the array.
  17. Eigenvalue decomposition For true covariance matrix this is approximately zero. Corresponds to the smallest eigenvalues.
  18. We need to have one noise subspace at least, and that requires having one more sensor than the number of location, that we can resolve M-1 sources with M sensors.
  19. For this first experiment I used the far most right and left channel. I recorded at about 75 degrees. Some problems with using all4 mics,
  20. Speech enhancement and reverberant would be better with music, much narrower beam.
  21. I used 4 microphones, two sources, one speech one was a loud fan. Only music was able to identify both sources others smears the two sources into one. One is at -25 and the other 15 degrees.
  22. I’m only looking at angle location and how narrow the beam is one should also look at noise and reverberation when localizing sound sources