Académique Documents
Professionnel Documents
Culture Documents
France Telecom R&D, 2 avenue Pierre Marzin, 22307 Lannion Cedex, France
jerome.daniel@francetelecom.com, stephanie.bertet@francetelecom.com
ABSTRACT
Higher Order Ambisonics (HOA) is a flexible approach for representing and rendering 3D sound fields.
Nevertheless, lack of effective microphone systems limited its use until recently. As a result of authors previous
work on the theory and design of spherical microphone arrays, a 4th order HOA microphone has been built,
measured and used for natural recording. The present paper first discusses theoretical aspects and physical
limitations proper to discrete, relatively small arrays (spatial aliasing, low-frequency estimation). Then it focuses on
the objective validation of such microphones. HOA directivities reconstructed from simulated and measured 3D
responses are compared to the expected spherical harmonics. Criteria like spatial correlation help characterizing the
encoding artifacts due to the model limitations and the prototype imperfections. Impacts on localisation criteria are
evaluated.
1.
INTRODUCTION
Moreau et al.
(1)
+1
= 2 cos m v .Sv
= Bmm
1
= Bmm
= 2 sin m v .Sv
v'
W
Y
Moreau et al.
(2)
D=
1 T,
C
N
(3)
of encoding
c n = 1 ... 2 cos( M n ) 2 sin( M n )
gains associated to loudspeaker azimuth n. For an Mth
order encoding, at least N=2M+2 loudspeakers are
recommended for a homogeneous reproduction.
Another way of understanding ambisonic spatialization
is to merge spatial encoding and decoding, so that
encoding directivities are themselves combined to form
directivities of virtual microphones associated to
Moreau et al.
m=0
basic
max rE
in-phase
m=1
m=2
+
-
m=3
+
+
-
m=4
+
+
( f ) =
c/ f
2sin ( /(2 M + 2) )
(4)
f lim
cM
cM
4 R( M + 1) sin ( /(2 M + 2) ) 2 R
(5)
Moreau et al.
p( kr , , ) =
r
E=
n =1
N
2
n n
G
n =1
2
n
, with ur = cos n
n
sin n
1
700 Hz
45
2
1300 Hz
30
3
1900 Hz
22.5
n = 0 =1
(7)
Bmn
=
( , ) dS ,
p( r , , ) Ymn
i jm (kr )
S
m
(8)
if jm (kr ) 0.
4
2500 Hz
18
2.2.
jm ( kr ) Bmn
Ymn
( , )
(6)
m =0
where
G u
Moreau et al.
( , ) =
Ymn
( 2m + 1) (2 0,n )
cos n
sin n
if = +1
if = 1
are
the
(m n)!
Pmn (sin )
(m + n)!
(9)
(ignored if n = 0)
associated
Legendre
pM (kr , , ) =
m=0
p (kr , , ) pM (kr , , ) dS
(12)
p (kr , , ) dS
e(kr ) =1 ( 2m +1) ( jm ( kr ) )
(13)
m =0
Bmn = S .Ymn ( S , S )
(10)
0 dB
(100%)
0
-10 dB
(10%)
10
11
12
13
14
-14 dB
(4%)
15
-20 dB
NMSE
(1%)
-30 dB
(0,1%)
-40 dB
(0,01%) 0
kr
9 10 11 12 13 14 15
Moreau et al.
(14)
3.
3.1.
p ( , )Y ( , ) dS
1
B00
p ( , )Y ( , ) dS
B111
1
00
p ( kR, , )
1
11
pR ( , )Ymn ( , ) dS
S
Equalization
Bmn
p ( kR, , )
(15)
kR
Moreau et al.
s ( kR, , ) =
im ( jm ( kR ) (1 )jm ( kR ))
m =0
Bmn
Ymn
(16)
( , )
n = 0 =1
=
Bmn
1
i m ( jm ( kR ) (1 )jm ( kR ) )
pYmn
(17)
sin d d
1
EQm ( kR) =
( jm ( kR ) (1 ) jm ( kR ) )
0.6
1
2
0.2
0
4
kR
0.2
0
4
kR
j ( kR )
n = 0 =1
(18)
Bmn Ymn ( , )
m =0
i m 1
( kR )2 h ( kR )
m
(19)
Bmn Ymn ( , )
n = 0 =1
Bmn
= i m +1 ( kR )2 hm ( kR )
0.4
m =0
0.6
jm ( kr ) hm ( kR ) hm ( kr )
1
0.8
0.4
0.8
(20)
Moreau et al.
5 5
6 6
77
88
rigid sphere
800
cardioid
j (kR ) + i ( 1) j (kR )
m
m
directional
microphones
Wm ( kR) =
m +1
2
( kR ) hm ( kR )
i
rigid sphere
Tb = s,
(23)
Amplitude
600
400
200
1
Y00
( 1 , 1 )
T=
1
Y00 ( Q , Q )
2
2
(22)
= Y diag [ Wm ( kR ) ],
kr
3.2.
Sq =
Wm ( kR )
m =0
n = 0 =1
Bmn
Ymn
(q , q ) (21)
YM1 0 ( 1 , 1 )
diag [ Wm ( kR ) ]
YM1 0 ( Q , Q )
0mM
1
1
1 1
1
, B11
, B11
, B10 ,K , BM
s = S1 , S2 ,K , SQ , b = B00
0 .
min s Tb 2 .
where
Moreau et al.
(T T) b LS = T s .
(24)
b LS = ( T T ) T s .
1
(25)
= diag
E s,
Wm ( kR )
(26)
E = ( Yt Y ) Yt .
1
( T T )
(T T)
Ts T T b LS
T s
E = ( Yt Y ) Yt
S 2 ( kR, 2 , 2 )
(28)
lead
this
one
(cf.
1
B00
B111
M
EQm (kR) =1/ Wm (kR )
SQ ( kR, Q , Q )
Matrixing
BM1 0
Equalization
where
( T T ) = T T
b LS
b b LS
b LS 2
1
2
(27)
c
c
,
=
2d
2R
(29)
(30)
Moreau et al.
4.
4.1.
Fm ( kR) =
Wm (kR )
Wm (kR ) + 2
(32)
EQm (kR ) =
Wm (kR )
2
Wm (kR ) + 2
(33)
10
1 e
Wm ( kR )
Fm (kR )
10
-1
10
-2
(31)
Wm (kR ) =
Wm (kR )
Wm (kR) = 10
Moreau et al.
1 11/ a 2
1+ 11/ a 2
(34)
a = Q 10 20 .
4.2.
i = 0, , 2M + 1
(35)
j = 0, , 2M + 1
Bmn
=
2 2 M +1 2 M +1
i sR (i , j )Ymn (i , j ) (36)
Wm ( kR ) 2 M + 2 i=0 j =0
1
Bmn
=
1
Wm (kR)
M +1 2 M +1
i =0
j =0
sR ( i , j )Ymn
( i , j ) (37)
1 t
Y Y ,
Q
(38)
Moreau et al.
1.5
2
2
1.5
1
3
0.5
(b) Hexahedron
3
2.5
1.5
1.2
1
2
0.8
0.6
1
0.5
0.4
0.2
(c) Octahedron
0
0.5
3
(a) Tetrahedron
(d) Icosahedron
3
0.7
0.6
0.5
0.4
0.3
0.2
0.1
(e) Dodecahedron
Figure 14 Matrices of orthonormality error for the five
regular polyhedrons. Gray level is related to error level.
0
1
2
3
0 1
0.2
0.18
5.
5.1.
0.16
0.14
0.12
0.1
0.08
0.06
5
0.04
0.02
0
Moreau et al.
100
10
10
9
-1
1
10
-2
10-3
|W(kR)|
10
-4
4
3
10-5
2
10
-6
10
-7
102
103
104
Frequency (Hz)
5.2.
(T T) =
(39)
Moreau et al.
Acquisition system is composed of a 32 channel homemade preamplifier, two MOTU 24I/O AD converters
plugged to a MOTU PCI424 card in a computer. The
software Plogue Bidule is used in combination with
VST plug-ins to make real-time recording and rendering
over a predefined configuration of loudspeakers or over
headphones.
Amplitude (dB)
40
20
0
Amplitude (dB)
-20
10
10
Frequency
10
40
20
6.1.
Measurements
-20
Amplitude (dB)
1
0
6.
10
10
10
10
Frequency
40
20
0
2
3
4
-20
10
10
Frequency
Moreau et al.
6.2.
Moreau et al.
90
120
1
0.8
60
0.6
150
30
0.4
0.2
180
330
210
300
240
270
f=
f=
f=
f=
1000 Hz
2000 Hz
4000 Hz
8000 Hz
R S meas , S model =
s meas , s model
s meas . s model
(40)
x, x , (41)
The top of Figure 23 shows that values are very close (if
not equal) to 1 especially at low and mid frequencies.
They decrease at high frequencies especially for a few
sensors. Although its hard to state a quality judgment
from absolute values, it worth noticing that the worst
values concern the sensors [#10,12,15,16] near the
sheath underneath the sphere, which causes a disturbing
and non modelled diffraction effect.
Moreau et al.
#10
2000
#12
4000
6000
#15
#16
8000
10000
12000
16000
#10,12,15,16
0.85
0.8
2000
4000
6000
#7,14
8000
10000
Frequency (Hz)
12000
14000
16000
Rsmeas ,smodel =
meas
,s
model
s meas . s model
(42)
freefield
s meas , s model
s
model 2
(43)
(44)
Moreau et al.
(45)
(46)
Moreau et al.
Figure 24 Reconstruction of a some specimens of HOA directivity at 5 kHz (top) and at 10 kHz (bottom). From left
to right: components B22+1, B21+1, B32+1, B44+1.
Figure 25 Cylindrical plots of reconstructed (meshed plots) and expected (in transparency) directivities for
incidences in the horizontal plane, and for horizontal components Bmm+1 with m=0 to 4 (from left to right). Vertical
axis is for frequency (in Hz).
Moreau et al.
1
1
b , b
b . b
Lb ,b =
3
4
0.8
0.7
(47)
5
-5
(48)
10
-10
10
0.9
10
10
Frequency (Hz)
10
10
0
-5
-10
10
Frequency (Hz)
10
Moreau et al.
80
60
0.8
40
0.7
10
10
20
0
10
(fMo+0dB@Simu32)
(fMo+6dB@Simu32)
(fMo+20dB@Simu32)
(iMe+20dB@Meas32)
80
60
-5
10
40
-10
10
Frequency (Hz)
10
20
0
10
10
Moreau et al.
7.
CONCLUSION
8.
ACKNOWLEDGEMENTS
9.
REFERENCES
[1].
[2].
[3].
[4].
[5].
[6].
[7].
[8].
Moreau et al.
[9].
[20].
[10].
[21].
[11].
[22].
[12].
[23].
[13].
[24].
[14].
[25].
[15].
[26].
[16].
[27].
[17].
[28].
[29].
[18].
[19].
http://www.research.att.com/~njas/packings/