Académique Documents
Professionnel Documents
Culture Documents
Richard M. Stern
Department of Electrical and Computer Engineering and School of Computer Science Carnegie Mellon University Pittsburgh, Pennsylvania 15213 Telephone: (412) 268-2535 FAX: (412) 268-3890 INTERNET: rms@cs.cmu.edu
January 15, 2004
INTRODUCTION
Carnegie Mellon
Slide 2
Carnegie Mellon
Slide 3
Amplitude
5000
4000
Frequency
3000
2000
1000
Carnegie Mellon
Slide 5
Original speech: Speech with 75-Hz excitation: Speech with 150-Hz excitation: Speech with noise excitation:
Carnegie Mellon
Slide 6
Carnegie Mellon
Slide 7
Carnegie Mellon
Slide 9
Carnegie Mellon
Slide 10
n Define:
r density of air
c velocity of sound
Slide 11
p u =r x t
and
u 1 p = 2 x rc t
where
rc 2 = gP
Carnegie Mellon
Slide 12
2u
1 2u = 2 2 x 2 c t
n Comments:
Slide 14
where
P(x, w ) = rc U + e - jwx / c - U - e + jwx / c U(x, w ) = U + e - jwx / c + U - e + jwx / c
Carnegie Mellon
Slide 15
x = -l Glottis
x=0 Lips
n Boundary conditions:
p(0, t) = 0 U + = U u(-l,t) = U 0 cos(w 0t) U + = U0 2 cos(w 0 l /c)
Carnegie Mellon
Slide 16
n Comments:
Resonant frequencies occur when w 0 l /c = (p /2),(3p /2),(5 p /2)K
Carnegie Mellon
Slide 17 ECE and SCS Robust Speech Group
45
40
35
Amplitude
30
25
20
15
10
100
200
300
400
n Comments:
Frequency
500
600
700
800
900
1000
UL (w )/U G (w ) is of the form A /cos(wl /c) With nonideal absorptive walls, response is not infinite at resonant frequencies
Carnegie Mellon
Slide 18
x = -l Glottis
x=0 Lips
Carnegie Mellon
Slide 19
Carnegie Mellon
Slide 20
5000
4000
Frequency
3000
2000
1000
Carnegie Mellon
Slide 21
Carnegie Mellon
Slide 22
Carnegie Mellon
Slide 23
SUMMARY
n We have discussed very superficially the production of speech
sounds
Source-filter model Vocal tract transfer functions Impact on perception Some attributes of acoustical analysis
Carnegie Mellon
Slide 24