Vous êtes sur la page 1sur 8

From: AAAI-98 Proceedings. Copyright © 1998, AAAI (www.aaai.org). All rights reserved.

A Motivational System for Regulating Human-Robot Interaction

Cynthia Breazeal (Ferrell)


Massachusetts Institute of Technology
Artificial Intelligence Laboratory
545 Technology Square, l%oom 938
Cambridge, MA 02139 USA
email: ferrell@ai.mit.edu

Abstract satisfying its internal drives.


An infant’s emotions and drives play an important
This paper presents a motivational system for an
role in generating meaningful interactions with the
autonomous robot which is designed to regulate
human-robot interaction. The modeof social in- caretaker (Bullowa 1979). These interactions consti-
teraction is that of a caretaker-infant dyad where tute learning episodes for new communication behav-
a humanacts as the caretaker for the robot. An iors. In particular, the infant is strongly biased to learn
infant’s emotions and drives play a very impor- communication skills that result in having the care-
tant role in generating meaningful interactions taker satisfy the infant’s drives (Halliday 1975). The
with the caretaker, and regulating these inter- infant’s emotional responses provide important cues
actions to maintain an environment suitable for which the caretaker uses to assess how to satiate the
the learning process (Bullowa 1979). Similarly, infant’s drives, and how to carefully regulate the com-
the learning task for the robot is to apply vari- plexity of the interaction. The former is critical for the
ous communicationskills acquired during social
exchanges to manipulate the caretaker such that infant to learn how its actions affect the caretaker, and
its drives are satisfied. Towardthis goal, the mo- the later is critical for establishing and maintaining a
tivational system implements drives, emotions, suitable learning environment for the infant where he
and facial expressions. Although the details of is neither bored nor over-stimulated.
the learning itself are beyond the scope of this The robot’s motivational system is designed to gen-
paper, this work represents an important step to- erate an analogous interaction for a robot-human dyad
wardrealizing robots that can engage in mean- as for an infant-caretaker dyad. As such, the motiva-
ingful bi-directional social interactions with hu- tional system implements dr±yes, enlot±ons, and fa-
mans.
cial expressions. These components interact with one
another to maintain a mutually regulated interaction
Introduction with the human at an appropriate level of intensity.
This paper focuses on the details of how the motiva-
We want to build robots that engage in meaningful
tional system performs this regulatory function, the
social exchanges with humans. In contrast to current details of what is learned and how the learning occurs
work in robotics that focus on robot-robot interactions are left for future papers.
(Billard & Dautenhahn 1997), this work concentrates
on human-robot interactions. By doing so, it is pos- A picture of the robot is shownin figure 1. It consists
sible to have a socially sophisticated human assist the of two active stereo systems, vision and audio, embel-
robot in acquiring more sophicticated communication lished with facial features for emotive expression. Cur-
skills and help it learn the meaning these acts have for rently, these facial features include eyebrows, ears, eye-
others. Toward this end, our approach is inspired by balls, and eyelids (with a mouth soon to follow). The
the way infants learn how to communicate with adults. robot is able to show recognizable expressions analo-
This work represents the first stages of this long term gous to anger, fatigue, fear, disgust, excitement, hap-
endeavor. We present a motivational system for an piness, interest, saddness, and surprise.
autonomous robot specialized for learning in a social This paper is organized as follows: first we dis-
context. Specifically, the modeof social interaction is cuss the numerous roles motivations play in natural
that of a caretaker-infant dyad where a human acts as systems--particularly as it applies to behavior selec-
the caretaker for the robot. The communication skills tion, regulating the intensity of social interactions, and
targeted for learning are those exhibited by infants, learning in a social context. Next we present a frame-
i.e., turn taking, shared attention, vocalizations. The work (inspired by ideas from ethology, psychology, and
context for learning involves social exchanges where cognitive development) for the design of the motiva-
the robot learns how to manipulate the caretaker into tional system and its integration with behavior and
The Molivalional System__ i

TheEiehaviol"
~ystern
"t0i
m i

~J! I~.><,.,
,i
e-.~X’
Figure 1: At left, Kismetdisplays an angry expression.
At right, it displays a look of surprise. Kismethas an 4, =/
active stereo vision system with color CCDcameras
mountedinside the eyeballs. A small microphones is
mountedon each ear providing audio inputs. "i
i

Figure 2: This figure illustrates our framework for


expressive motor acts. After we illustrate these ideas building a motivational system and integrating it with
with a particular implementation on a physical robot, behavior in the world. The implementation used in our
we present the results of some early human-robot in- experiments is shownin figure 3
teraction experiments. Finally, we discuss planned ex-
tensions to the existing system.

A Framework for Designing itself through behavior. Each drive, emotion, behav-
ior, percept, and motor skill are modeledas a separate
Motivational Systems transducer process specifically tailored for its role in
A framework for how the motivational system inter- the overall system architecture. Details are presented
acts with and is expressed through behavior is shown in the following section.
in figure 2. The system architecture consists of four Drives: The robot’s drives serve three purposes.
subsystems: the motivation system, the behavior sys- First, they influence behavior selection by preferen-
tem, the perceptual system, and the motor system. The tially passing activation to somebehaviors over others.
motivation system consists of drives and emotions, Second, they influence the emotive state of the robot
the behavior system consists of various types of be- by passing activation energy to the emotive processes.
haviors as conceptualized by Tinbergen (1951) and Since the robot’s expresions reflect its emotive state,
Lorenz (1973), the perceptual system extracts salient the drives indirectly control the expressive cues the
features from the world, and the facial expressions are robot displays to the caretaker. Third, they provide a
implementedwithin the motor system along with other learning context - the robot learns skills that serve to
motor skills. The organization and operation of this satisfy its drives.
frameworkis heavily influenced by concepts from psy- The design of the robot’s drive subsystem is heav-
chology, ethology, and developmental psychology. ily inspired by ethological views (Lorenz 1973), (Tin-
Computational Substrate: The overall system is bergen 1951). One distinguishing feature of drives
implementedas an agent-based architecture similar to is their temporally cyclic behavior. That is, given no
that of (Blumberg1996) and (Maes 1990). For this stimulation, a drive will tend to increase in intensity
plementation, the basic computational process is mod- unless it is satiated. For instance, an animal’s hunger
eled as a transducer. Its activation energy x is com- level or need to sleep follows a cyclical pattern.
puted by the equation: z = (~j=l wj ¯ ij) + b for in- Another distinguishing feature of drives are their
teger values of inputs ij, weights wj, bias b where n homeostatic nature. For animals to survive, they must
is the number of inputs. The weights can be either maintain a variety of critical parameters (such as tem-
positive or negative; a positive weight corresponds to perature, energy level, amountof fluids, etc.) within
an excitatory connection and a negative weight cor- bounded range. As such, the drives keep changing in
responds to an inhibitory connection. The process intensity to reflect the ongoingneeds of the robot and
is active whenits activation level exceeds an activa- the urgency for tending to them. There is a desired
tion threshold. Whenactive, the process may perform operational point for each driveand an acceptable
some special computation, send output messages to bounds of operation around that point. Wecall this
connected processes, spread someof its activation en- range the homeostatic regime. As long as a drive is
ergy to connected units (Maes 1990), and/or express within the homeostatic regime, the robot’s "needs" are
being adequately met. Emotions: For the robot, emotionsof the robot
serve two functions. First, they influence the emotive
For my robot, each drive is modeled as a separate expression of the robot by passing activation energy
process with a temporal input to implementits cyclic to the face motor processes. Second, they play an im-
behavior. The activation energy of each drive ranges portant role in regulating face to face exchanges with
between [-max,-kma~c], where the magnitude of the the caretaker. The drives play an important role in
drive represents its intensity. For a given drive level, establishing the emotional state of the robot, which is
a large positive magnitudecorresponds to being under- reflected by its facial expression, hence emotions play
stimulated by the environment, whereas a large nega-
an important role in communicating the state of the
tive magnitudecorresponds to being overstimulated by robot’s "needs" to the caretaker and the urgency for
the environment. In general, each drive is partitioned tending to them. The emotions also play an important
into three regimes: an under-whelmedregime, an over- role in learning during face to face exchanges with the
whelmedregime, and the homeostatic regime.
caretaker, but we leave the details of this to another
Behaviors: Drives, however, cannot satiate them- paper.
selves. They becomesatiated whenever the robot is The organization and operation of the emotion sub-
able to evoke the corresponding consummatorybehav- system is strongly inspired by various theories of emo-
ior. For instance, with respect to animals, eating sa-
tiates the hunger drive; sleeping satiates the fatigue tions in humans (Ekman & Davidson 1994), (Izard
1993), and most closely resembles the frameworkpre-
drive, and so on. At any point in time, the robot is sented in (Velasqucz 1996). The robot has several
motivated to engage in behaviors that maintain the emotion processes. Although they are quite differ-
drives within their homeostatic regime. Furthermore, ent from emotions in humans, they are designed to
whenevera drive movesfarther from its desired oper- be rough analogs -- especially with respect to the ac-
ation point, the robot becomesmore predisposed to en- companyingfacial expressions. As such, each emotion
gage in behaviors that serve to satiate that drive -- as is distinct from the others and consists of a family of
the drive activation level increases, it passes more of similar emotions which are graded in intensity. For
its activation energy to the corresponding consumma- instance, happiness can range from being content (a
tory behavior. As long as the consummatorybehavior baseline activation level) to ecstatic (a high activa-
is active, the intensity of the drive is reduced toward tion level). Numerically, the activation level of each
the homeostatic regime. Whenthis occurs, the drive emotion can range between [0, max] where max is an
becomessatiated, and the amountof activation energy integer value determined empirically. Although the
it passes to the consummatorybehavior decreases until emotions are always active, their intensity must ex-
the consummatorybehavior is eventually released. ceed a threshold level before they are expressed exter-
For each consummatorybehavior, there mayalso be nally. Whenthis occurs, the corresponding facial ex-
one or more affiliated appetitive behaviors. One can pression reflects the level of activation of the emotion.
view each appetitive behavior as a separate behavioral Once an emotion rises above its activation threshold,
strategy for bringing the robot to a state whereit can it decays over time back toward the base line level (un-
directly activate the desired consummatorybehavior. less it continues to receive inputs from other processes
For instance, the case mayarise where a given drive or events). Hence, unlike drives, emotions have an
stongly potentiates its consummatorybehavior, but intense expression followed by a fleeing nature. Ongo-
environmental circumstances prevent it from becom- ing events that maintain the activation level slightly
ing active. In this case, the robot maybe able to ac- above threshold correspond to moods in this imple-
tivate an affiliated appetitive behavior instead, which mentation. Tempermanents are established by setting
will eventually enable the consummatorybehavior to the bias term. Blends of emotions occur when several
be activated. compatible emotions are expressed simultaneously. To
avoid having conflicting emotions active at the same
In this implementation, every behavior is modeled time, mutually inhibitory connections exist between
as a separate goal-directed process. In general, both confliting emotions.
internal and external factors are used to computetheir
relevance (whether or not they should be activated). Facial Expressions: For each emotion there is an
The activation level of each behavior can range be- accompanying facial expression. These are imple-
tween [0, max] where max is an integer value deter- mented in the motor system amongvarious motor pro-
mined empirically. The most significant inputs come cesses. The robot’s facii features moveanalgously to
from the drive they act to satiate and from the en- howhumansadjust their facial features to express dif-
vironment. Whena consummatorybehavior is active, ferent emotions, and the robot’s ears moveanalogously
its output acts to reduce the activation energy of the to how dogs to move theirs to express motivational
drive it is associated with. Whenan appetitive be- state.
havior is active, it serves to bring the robot into an
environmentalstate suitable for activating the affili-
ated consummatory behavior.
Design of the Motivational System confused. This occurs whenthe robot receives more
The robot’s motivational system is composedof three stimulation than it can effectively assimilate, and
predisposes the robot to reduce its interaction with
inter-related subsystems. One subsystem implements
the robot’s drives, another implements its emotions, the enviroment, perhaps by closing its eyes, turning
and the last implements its facial expressions. Al- its head away from the stimulus, and so forth.
though the expressive skills are implemented in the ¯ Security Drive. Muchof what the robot learns are
motor system, here we consider them as part of the anticipatory modelsof the effects of its actions on the
motivational system. Wealso present relevant aspects world. If these models hold true, the implication is
of the behavior system. Wepresent the design spec- that the the robot can use these expectations to be-
ification of each subsystem in the remainder of this have adeptly within the environment. This drive
section. plays an important role in regulating the robot’s in-
Motivations establish the nature of a creature by teraction with its environment where many(but not
defining its needs and influencing howand whenit acts all) of these modelsare effective in guiding behavior.
to satisfy them. The "nature" of myrobot is to learn in By doing so, the robot maintains an environment
a social environment. All drives, emotions, and be- where it is competentyet slightly challenged, i.e. it
haviors are organized such that the robot is in a state of needs to modifyits existing modelsto better suit its
homeostatic balance whenit is functioning adeptly and environment or learn new ones. As time passes and if
is in an environmentthat affords high learning poten- left unsatiated, the drive tends toward the secure
tial. This entails that the robot be motivated to engage end of the spectrum. This implies that the robot’s
in appropriate interactions with its environment (i.e. expectations hold true for its interactions with the
the caretaker), and that it is neither under-whelmed environment. If this is not true, its consummatory
or over-whelmedby these interactions. behavior moves the drive toward the insecure end.
The Drive Subsystem: For an animal, adequately ¯ Fatigue drive. This drive is unlike the others in
satisfying its drives is paramountto survival. Similarly, that its purpose is to allow the robot to shut out
for myrobot, maintaining all its drives within their the external world instead of trying to regulate its
homeostatic regime is a never-ending, all important interaction with it. While the creature is "awake",
process. it receives repeated stimulation and learns new pre-
So far, the robot has four basic drives. They are as dictive modelsfor howits actions affect the world.
follows: As time passes (and as the numberof learned events
¯ Social drive: One drive is to be social, i.e. to increases) this drive approaches the exhaused end
of the spectrum. Oncethe intensity level exceeds a
be in the presence of people and to be stimulated
by people. This is important for biasing the robot certain threshold, it is time for the robot to "sleep".
to learn in a social context. On the under-whelmed This is the time for the robot to do "internal house-
extreme the robot is lonely, i.e., it is predisposed keeping", i.e. try to consolidate its learned anticipa-
to act in ways to get into face to face contact with tory models and integrate them with the rest of the
people. If left unsatiated, this drive will continue to internal control structure. While the robot "sleeps",
intensify toward the lonely end of the spectrum. On the drive returns to the homeostatic regime, the
robot awakens and is ready to exercise its newly
the over-whelmedextreme, the robot is asocial, i.e.
it is predisposed to act in ways to disengage people modified control structure.
from face to face contact. The robot tends toward The Behavior Subsystem: For each drive there
the asocial end of the spectrum when a person is is an accompanying consummatorybehavior. Ideally,
over-stimulating the robot. This may occur when a it becomes active when the drive enters the under-
person is movingto much,is too close to the camera, whelmedregime and remains active until it returns to
an so on. the homeostatic regime. The consummatorybehaviors
¯ Stimulation drive:Anotherdriveis to be stim- are as follows:
ulated, where the stimulus can either be generated ¯ PlaywithPeopleactsto movethe socialdrive
externally by the environment or internally through back toward the asocial end of the spectrum. It is
spontaneous self-play. On the under-whelmedend of potentiated more strongly as the social drive ap-
this spectrum, the creature is bored. This occurs if proaches the lonely end of the spectrum. Its activa-
the creature has been inactive or unstimulated over a tion level increases above threshold whenthe robot
period of time. With respect to learning, this drive can engagein face to face interaction with a person,
also tends toward the bored end of the spectrum if and it remainsactive for as long as this interaction is
the current interaction becomesvery predictable for maintained. Only when active does it act to reduce
the robot. This biases the robot to engage in new the intensity of the drive.
kinds of activities and encourages the caretaker to
challenge the robot with new interactions. On the ¯ Play with Toys acts to move the stimulation
over-whelmedpart of the spectrum, the creature is drive back toward the confused end of the spec-
trum. It is potentiated more strongly as the driveis no longer potentiating the consummatorybe-
stimulation driveapproaches the boredend of havior; the enviromental input alone is strong enough
the spectrum. The activation level increases above to keep it active. Whenthe drive enters the over-
threshold when the robot can engage in some sort whelmedregime, the system is strongly motivated to
of stimulating interaction, either with the environ- engage in behaviors that act to stop the stimulation.
mentsuch as visually tracking an object or with itself For instance, if the caretaker is interacting with the
such as playing with its voice. It remains active for robot too intensely, the social drive may moveinto
as long as the robot maintains the interaction, and the asocial regime. Whenthis occurs, the robot dis-
while active it continues to movethe drive toward plays an expression of displeasure, which is a cue for
the over-whelmedend of the spectrum. the caretaker to back off a bit.
¯ Expectation Violation actsto movethe security The Emotion Subsystem: So far, there are eight
drive toward the insecure end of the specturm. It emotions implemented in this system, each as a sep-
is potentiated more strongly as the security drive arate process. The overall framework of the emo-
approaches the secure end of the specturm (imply- tion system shares strong commonality with that of
ing the robot is becoming"bored" with its interac- (Velasquez 1996), although its function is specifically
tions). Its activation level increases wheneverthe targeted for social exchanges and learning. Of the
robot’s current expectations are violated. Whenthe robot’s emotions, anger, disgust, fear, happiness,
activation level rises above threshold, it movesthe and sadness are analogs of the primary emotions in
security drive toward the over-whelmed side of humans. The last three emotions are somewhat con-
the spectrum. troversal in classification, but they play in an impor-
tant role in learning and social interaction between
¯ Sleep acts to satiate the fatique drive. Whenthe caretaker and infant so they are included in the sys-
fatigue drivereaches a specified level, the sleep tem: suprise, interest, excitement. Many exper-
consummatorybehavior turns on and remains active imentsin developmental psychology have shown that
until the fatigue drive is restored to the homeo- infants show suprise when witnessing an unexpected
static regime. Whenthis occurs, it is released and or novel outcome to a familiar event (Carey &Gelman
the robot "wakes up". 1991). Furthermore, parents use their infant’s display
Sleep also serves a special "motivation reboot" func- of excitement or interest as cues to regulate their in-
tion for the robot. Whenactive, it not only restores teraction with them (Wood, Bruner & Ross 1976).
the fatige drive to the homeostatic regime, but all In humans,four factors serve to elicit emotions, i.e.
the other drives as well. If any drive movesfar from neurochemical, sensorimotor, motivational, and cogni-
its homeostatic regime, the robot displays stronger and tive (Izard 1993). In this system, emphasis has been
stronger signs of distress, which eventually culminates placed on how drives, other emotions and pain con-
in extreme anger if left uncorrected. This expressive tribute to a given emotion’s level of activation. The
display is a strong sign to the caretaker to intervene active emotions and accompanyingfacial expressions
and help the robot correct its drive imbalance. If provide the caretaker with cues as to the motivational
the caretaker fails to act appropriately and the drive state of the robot and howthe caretaker should act to
reaches an extreme, a protective mechanismkicks in help satiate the robot’s drives.
where the robot shuts itself downby going to sleep. ¯ Pain: Pain information comes from perceptual pro-
This is a last ditch methodfor the robot to restore all cessing whenthe intensity of the signal is too strong.
its drives by itself. A similar behavior is observed in Perhaps a bright light is shining in the camerawhich
infants. Whenthey are in extreme distress, perhaps "blinds" the robot, or perhaps a sound is so loud
throwing a tantrum, they may fall into a disturbed that the robot cannot hear anything else, etc. In
sleep. This is a self regulation tactic they use in ex- this case, the pain signals serve to increase the level
treme cases (Bullowa 1979). of angerand sadness so the robot exhibits signs of
Three of the four consummatory behaviors cannot distress. This maybe accompaniedby other protec-
be activated by the intensity of the drive alone. In- tive responses such as closing its eyes, rotating its
stead, they require a special sort of environmentalin- ears away from the loud sound source, etc. Nom-
teraction to becomeactive. For instance, Play with inally, the caretaker wouldinterpret these cues as
People cannot become active without the participa- "discomfort" for the robot and seek out the source.
tion of a person. Analogouscases hold for Play with
Toys and Expectation Violation. Furthermore, it ¯ Other Emotions: The influence from other emotions
is possible for these behaviors to becomeactive by the serve to prevent conflicting emotions from becoming
environmentalone if the interaction is strong enough. active at the sametime. To implementthis, conflict-
This has an important consequence for regulating ing emotions have mutually inhibitory connections
the intensity of interaction. For instance, if the na- between them. For instance, inhibitory connections
ture of the interaction is too intense, the drive may exist between happiness and sadness, between
moveinto the over-whelmedregime. In this case, the disgust and happines, and between happiness and
anger, commonfacial expressions. On top of the motor skills
are the face expression processes. These direct all fa-
¯ Drives: Recall that each drive is partitioned into
cial features to showa particular expression. For each
three regimes: homeostatic, over-whelmedor under- expression, the facial features moveto a characteristic
whelmed.This establishes the drive context for the
configuration, howeverthe intensity can vary depend-
system. For a given drive, each region potentiates ing on the intensity of the emotionevoking the expres-
a different emotionand hence a different facial ex- sion. In general, the more intense the expression, the
pression. In this way the facial expressions provide facial features movemore quickly to more extreme po-
cues as to what drive is out of balance and howthe
caretaker should respond to correct for it. sitions. Blended expressions are computed by taking
a weighted average of the facial configurations corre-
In general, when a drive is in its homeostatic sponding to each evoked emotion.
regime, it potentiates positive emotions such as
happiness or interest. The accompanying expres- Experiments and Results
sion tells the caretaker that the interaction is going
well and the robot is poised to play and learn. When
a drive is not within the homeostatic regime, nega-
tive emotions are potentiated (such as anger, disgust, .
or sadness) which produces signs of distress on the
robot’s face. The particular sign of distress pro-
vides the caretaker with additional cues as to what
is "wrong"and howshe might correct for it. With re-
spect to learning, one could easily envision a scenario
i
i-
where a look of suprise appears on the robot’s face
whenever an unexpected event occurs. This would be
a cue to the caretaker that the robot does not have
t
an anticipatory modelfor this event, in which case the
caretaker maychooserepeat the event to help the robot
learn a suitable expectation.
Note that the same sort of interaction can have a Figure 3: Diagramof the motivational system used in
very different "emotional" affect on the robot depend-
the following experiments. Double headed arrows rep-
ing on the drive context. For instance, playing with resent mutually inhibitory connections between nodes.
the robot while all drives are within the homeostatic
regime elicits happiness. This tells the caretaker that A series of early experiments were performed with
playing with the robot is a good interation to be hav- the robot using the motivational system shownin fig-
ing at this time. However,if the fatigue drive is deep ure 3. The system consists of two drives (fatigue
into the exhausted end of the spectrum, then playing
and social), two consummatorybehaviors (sleep and
with the robot actually prevents the robot from going play), one external input stimulus, and a number of
to sleep. As a result, the fatigue drivecontinues to emotions and corresponding facial expressions. The
increase in intensity. Whenhigh enough, the fatigue external input is provided by a humanthrough a GUI
drive begins to potentiate anger. The caretaker may interface and represents the intensity of interaction.
interpret this as the robot acting "cranky" because it The robot’s face changes expression over time as the
is "tired". In the extreme case, fatigue maypotenti- humaninteracts with it through the slider, reflecting
ate anger so strongly that the robot displays "fury". its ongoing motivational state and providing the hu-
The caretaker may construe this as the robot throwing man with visual cues as to how to modify the inter-
a "tantrum". Nominally, the caretaker would back off action to keep the robot’s drives within homeostatic
before this point and allow the sleep behavior to be ranges.
activated. In general, as long as the robot’s drives remain
The Motor Subsystem: For each emotion there is within their homeostatic ranges, the robot displays "in-
an accompanyingfacial expression. These are imple- terest". If the humaninteracts with the robot while
mented in the motor system where there are various in the drives are within their homeostatic regime, the
motor processes. The low level face motor primitives robot displays "happiness". However, once a drive
are separate processes that control the position and leaves its homeostatic range, the robot’s "interest" and
velocity of each degree of freedom. The motor skill "happiness" waneas it grows increasingly distressed.
processes are one level above the primitives. They im- As this occurs, the robot’s expression reflects its dis-
plement coordinated control of the facial features such tressed state. This visual cue tells the humanthat all is
as wiggling the ears or eyebrows independently, arch- not well with the robot, and whether the humanshould
ing both browsinward, raising the brows, and so forth. intensify the interaction, diminish it, or maintain it at
Generally, they are the coordinated motions used in its current level.
i lOOO -~

o
i
~N
!
t
Figure 4: Changesin state of the motivational and behavior systems in response to the social drive and various
intensities of humaninteraction. The left figure (a) corresponds to a short period of insufficient humaninteraction,
the middle figure (b) corresponds to a short period of overly intense humaninteraction, and the right figure (c)
corresponds to an extended period of overly intense interaction.

Figures 4(a), (b), and (c) illustrate the influence extremely anxious state for a prolonged time. Analo-
the social drive on the robot’s motivational and be- gously for the robot, if the interaction is over-whelming
havioral state when interacting with a human. The for long period of time, the robot will first showin-
activation level of the robot’s play behavior cannot creasing signs of "disgust", eventually blending with
exceed the activation threshold unless the humanin- increasingly intense signs of anger, as the social drive
teracts with the robot with sufficient intensity - low continues to movetoward the over-whelmedend of the
intenisty interaction will not trigger the play behav- spectrum. If still no relief is encounteredand the drive
ior even if highly potentiated by the social drive. If hits its outer limit, the robot goes into an emergency
the interaction is intense, even too intense, the robot’s sleep mode.As discussed previously, sleeping serves as
play behavior remains active until the humaneither a sort of "motivational reboot" for the robot by restor-
stops the activity, or the robot takes action to end it. ing all drives to their homeostatic ranges. Hence, upon
Due to a low intensity of humaninteraction, figure "awakening", the robot is in a balanced, "interested"
4(a) showsthe robot becomingincreasingly "sad" over state.
time as the social drive tends toward the "lonely" end ~ Figures 5(a) and (b) illustrate the influence of
of the spectrum. The robot’s expression of sadness con- fatigue drive on the robot’s motivational and behav-
tinues to increase, until the humanfinally responds by ioral state wheninteracting with a human.Over time,
intensifying the interaction. Consequently, the human the fatigue drive increases toward the "exhaused"
sees the robot’s "sadness" decaying over time which end of the spectrum. As the robot’s level of "fatigue"
indicates that the robot’s social drive is returning to increases, the robot displays stronger signs of being
the homeostatic regime. Whenthe robot displays an "tired".
expression of interest again, its social drive is within Figure 5(a) shows that the robot will activate it’s
homeostatic bounds. sleep behavior when its fatigue drive moves above
In contrast, figure 4(b) shows the robot acquir- the threshold value of 1600, provided no one is engag-
ing more "asocial" tendencies when the interaction is ing the robot/ The robot remains "asleep" until all
too intense. If the interaction is over-whelming, the drives are restored to their homeostatic ranges. Once
social drive tends toward the "asocial" end of the this occurs, the activation level of the "sleep" behavior
spectrum. As this drive leaves the homeostatic range, decays until the behavior is no longer active and the
the robot becomesincreasingly "disgusted" and its ex- robot "wakes up" in an "interested" state.
pression of disgust intensifies over time. Whenthe Figure 5(b) shows what happens if a human con-
social drive reaches a fairly large negative value of tinues to interact with the robot dispite its "fatigued"
-1200, the robot displays a fairly intense look of dis- state. The robot cannot fall asleep as long as a person
gust, and the humanbacks off the interaction. This interacts with it because its play behavior remains ac-
causes the social drive to return to the homeostatic tive (note the mutually inhibitory connections in fig-
range and the robot re-establishes an "interested", ure 3). If the fatigue drive exceeds threshold and
"happy" emotional state. the robot cannot fall "asleep", the robot begins to
Figure 4(c) illustrates howthe robot can terminate show signs of "anger". Eventually the robot’s level
the interaction when the humanrefuses engage the of "anger" reaches an intense level of 1100, and the
robot appropriately. As discussed in previous sections, robot appears rageful - akin to throwing a "tantrum".
infants fall into a disturbed sleep when put into an Still the humanpersists with the interaction, but even-
Figure 5: Changes in state of the motivational and behavior systems in response to the fatigue drive and various
intensities of human interaction. The left figure (a) corresponds to minimal human interaction, and the right figure
(b) corresponds to an overly extended period of human interaction.

tually the robot’s fatigue level reaches near maximum appropriately regulate the intensity of interaction to
and emergeny actions are taken by the robot to force benefit the learning process.
an end to the interaction. The robot falls into a dis-
tressed sleep to restore its drives. Acknowledgments
The experimental results described above character- Support for this research was provided by a MUR,I
izes the robot’s behavior when interacting with a hu- grant under the Office of Naval l~esearch contract
man. It demonstrates how the robot’s "emotive" cues N00014-95-1-0600 and the Santa Fe Institute.
are used to regulate the nature and intensity of the
interaction, and how the nature of the interaction in- References
fluences the robot’s behavior. The result is an ongoing
Billard, A. & Dautenhahn, K. (1997), Grounding Commu-
"dance" between robot and human aimed at maintain- nication in Situated, Social Robots, Technical report, Uni-
ing the robot’s drives within homeostatic bounds. If versity of Manchester.
the robot and human are good partners, the robot re-
Blumberg, B. (1996), Old Tricks, NewDogs: Ethology and
mains "intersted" and/or "happy" most of the time. Interactive Creatures, PhDthesis, MIT.
These expressions indicate that the interaction is of
Bullowa, M. (1979), Be]ore Speech: The Beginning o] In-
appropriate intensity for learning and the robot dis- terpersonal Communicaion, Cambridge University Press,
plays a look of readiness to learn. Cambridge, London.
Carey, S. & Gelman, R. (1991), The Epigenesis o] Mind,
Summary Lawrence Erlbaum Associates, Hillsdale, NJ.
We have presented a framework (heavily inspired from Ekman, P. & Davidson, R. (1994), The Nature o] Emo-
work in ethology, psychology, and cognitive devel- tion: Fundamental Questions, Oxford University Press,
opment) for designing motivational systems for au- New York.
tonomous robots specifically geared to regulate human- Halliday, M. (1975), Learning Howto Mean: Explorations
robot interaction. We have shown how the dr±yes, in the Development o] Language, Elsevier, NewYork, NY.
emotions, behaviors, and facial expressions influence Izard, C. (1993), Four Systems for Emotion Activation:
each other to establish and maintain social interactions Cognitive and Noncognitive Processes, in ’Psychological
that can provide suitable learning episodes, i.e., where Review’, Vol. 100, pp. 68-90.
the robot is proficient yet slightly challenged, and Lorenz, K. (1973), Foundations o] Ethology, Springer-
where the robot is neither under-stimulated nor over- Verlag, NewYork, NY.
stimulated by its interaction with the human. With a Maes, P. (1990), ’Learning Behavior Networks from Expe-
specific implementation, we demonstrated how the sys- rience’, ECAL90.
tem engages in a mutually regulatory interaction with Tinbergen, N. (1951), The Study o] Instinct, Oxford Uni-
a human. versity Press, NewYork.
In these early experiments, the human’s input is re- Velasquez, J. (1996), Cathexis, A Computational Model
stricted to GUIsliders. The next step is to incorporate for the Generation of Emotions and their Influence in the
visual and auditory inputs. Furthermore, the specifics Behavior of AutonomousAgents, Master’s thesis, MIT.
of learning in a social context (what is learned and how Wood, D., Brunet, J. S. & Ross, G. (1976), ’The role
it is learned) was not addressed in this paper. That is tutoring in problem-solving’, Journal o] Child Psychology
the subject of work soon to follow, which will include and Psychiatry 17, 89-100.
tuning and adjusting this early motivation system to

Vous aimerez peut-être aussi