Académique Documents
Professionnel Documents
Culture Documents
M. Sanchez-Gutierrez,
sinc(z)
Speech
Research
Recognition"
E.
Center
M. Albornoz,
for
6th
Springer,
Signals,
Mexican
F.pp.
Systems
M.
Conference
311-320,
Martinez,
and jun,
Computational
on
H. Pattern
2014.
L. Rufiner
Recognition
&
Intelligence
J. Goddard;
(MCPR2014),
(fich.unl.edu.ar/sinc)
"Deep Learning
LNCS -for
Departamento de
Autonoma Metropolitana
Ingenieria
Electrica,
Universidad
(Mexico)
Centro de Investigacion SINC(i), Universidad Nacional del
Litoral - CONICET
(Argentina)
edmax86@gmail.com
2
1 Introduction
Emotional
M. Sanchez-Gutierrez,
sinc(z)
Speech
Research
Recognition"
E.
Center
M. Albornoz,
for
6th
Springer,
Signals,
Mexican
F.pp.
Systems
M.
Conference
311-320,
Martinez,
and jun,
Computational
on
H. Pattern
2014.
L. Rufiner
Recognition
&
Intelligence
J. Goddard;
(MCPR2014),
(fich.unl.edu.ar/sinc)
"Deep Learning
LNCS -for
Fig. 1. Restricted
Boltzmann Machine.
2 Deep Learning
M. Sanchez-Gutierrez,
Recognition"
6thsinc(z)
Mexican
E.
Research
M.
Conference
Albornoz,
Centeron
for
F. Pattern
M.
Signals,
Martinez,
Recognition
Systems
H. L.and
(MCPR2014),
Rufiner
Computational
& J. Goddard;
LNCS
Intelligence
- Springer,
"Deep (fich.unl.edu.ar/sinc)
Learning
pp. 311-320,
for Emotional
jun, 2014.
Speech
(1)P(v, exp
E(v,h)
(
h)
where W is the symmetric
matrix of the weights
where
Z, known
as the partition
connecting
the visible
and Zhidden units, and a, 2b
are
bias vectors
on by:
the connections of bias unit
- )to
function,
defined
the visibleis and
hidden
Z layer, respectively.
=
^exp
The joint probability, p(v, h), for the RBM
mentioned above, assigns a probability to every
configuration (v, h) of visible and hidden vectors
(3)
using the energy function:
v,h
The probability assigned by the network to a
visible vector v is:
E(v
(v )
=Z
ex -E ( v , h )
Z
h)
(4)
M.
Recognition"
Sanchez-Gutierrez,
6thsinc(/')
Mexican
E.
Research
M.
Conference
Albornoz,
Centeron
for
F. Pattern
M.
Signals,
Martinez,
Recognition
Systems
H. L.and
(MCPR2014),
Rufiner
Computational
& J. Goddard;
LNCS
Intelligence
- Springer,
"Deep (fich.unl.edu.ar/sinc)
Learning
pp. 311-320,
for Emotional
jun, 2014.
Speech
energy function
E i is altered v to reflect this
E(v,
(
modification
ij '
i
i
a
)2
(c.f. [8]) as:
h)
7
i
With this modified
energy function, the
)
conditional probabilities
2a? are now given by:
TTA
p j= =
(h
1|v) a(
i
a?
ij
hw -b h
+j
b )
j
where: N(|^,a ) denotes the Gaussian probability
2
density function with mean ^ and variance a .
2
Emotional
M. Sanchez-Gutierrez,
sinc(z)
Speech
Research
Recognition"
E.
Center
M. Albornoz,
for
6th
Springer,
Signals,
Mexican
F.pp.
Systems
M.
Conference
311-320,
Martinez,
and jun,
Computational
on
H. Pattern
2014.
L. Rufiner
Recognition
&
Intelligence
J. Goddard;
(MCPR2014),
(fich.unl.edu.ar/sinc)
"Deep Learning
LNCS -for
speech
Emotions
Neutral, panic,
anxiety, hot
anger, cold
anger, despair,
sadness, elation,
joy, interest,
boredom,
shame,
pride,
Anger, joy,
Berlin
Public and German 535
sadness,
fear,
emotional
utterances
free
disgust,
database
Danish
Public with Danish 260
Anger, joy,
emotional
license fee
utterances sadness,
Natural
Private
Mandari 388
Anger, neutral
Anger, joy,
ESMBS
Private
Mandari 720
n
utterances sadness, disgust,
fear, surprise
disgust,
INTERFACE Commercia English, 186, 190, Anger,
Slovenia 184, 175 fear, joy,
lly
n,
utterances surprise,
available Spanish, respectively sadness, slow
French
neutral, fast
Emotional
M. Sanchez-Gutierrez,
sinc(z)
Speech
Research
Recognition"
E.
Center
M. Albornoz,
for
6th
Springer,
Signals,
Mexican
F.pp.
Systems
M.
Conference
311-320,
Martinez,
and jun,
Computational
on
H. Pattern
2014.
L. Rufiner
Recognition
&
Intelligence
J. Goddard;
(MCPR2014),
(fich.unl.edu.ar/sinc)
"Deep Learning
LNCS -for
In a subjective test of the database with 16 nonprofessional listeners (UPC engineering students), it
was found that over 80% of the sentences were
correctly classified initially, and given a second
choice, more than 90%. Each expression was
correctly classified by at least half of the listeners.
It is interesting to note that errors were generally
committed on the isolated words or short phrases,
while all sentences and longer texts were classified
correctly initially by all listeners. This subjective
test is useful because we can compare it to the
results obtained automatically by classifiers.
Emotional
M. Sanchez-Gutierrez,
sinc(z)
Speech
Research
Recognition"
E.
Center
M. Albornoz,
for
6th
Springer,
Signals,
Mexican
F.pp.
Systems
M.
Conference
311-320,
Martinez,
and jun,
Computational
on
H. Pattern
2014.
L. Rufiner
Recognition
&
Intelligence
J. Goddard;
(MCPR2014),
(fich.unl.edu.ar/sinc)
"Deep Learning
LNCS -for
Emotional
M. Sanchez-Gutierrez,
sinc(z)
Speech
Research
Recognition"
E.
Center
M. Albornoz,
for
6th
Springer,
Signals,
Mexican
F.pp.
Systems
M.
Conference
311-320,
Martinez,
and jun,
Computational
on
H. Pattern
2014.
L. Rufiner
Recognition
&
Intelligence
J. Goddard;
(MCPR2014),
(fich.unl.edu.ar/sinc)
"Deep Learning
LNCS -for
Emotional
M. Sanchez-Gutierrez,
sinc(z)
Speech
Research
Recognition"
E.
Center
M. Albornoz,
for
6th
Springer,
Signals,
Mexican
F.pp.
Systems
M.
Conference
311-320,
Martinez,
and jun,
Computational
on
H. Pattern
2014.
L. Rufiner
Recognition
&
Intelligence
J. Goddard;
(MCPR2014),
(fich.unl.edu.ar/sinc)
"Deep Learning
LNCS -for
6 Conclusions
References
Emotional
M. Sanchez-Gutierrez,
sinc(z)
Speech
Research
Recognition"
E.
Center
M. Albornoz,
for
6th
Springer,
Signals,
Mexican
F.pp.
Systems
M.
Conference
311-320,
Martinez,
and jun,
Computational
on
H. Pattern
2014.
L. Rufiner
Recognition
&
Intelligence
J. Goddard;
(MCPR2014),
(fich.unl.edu.ar/sinc)
"Deep Learning
LNCS -for