Académique Documents
Professionnel Documents
Culture Documents
Topic 6
3 why learn?
3 general model of learning agents
3 inductive learning
3 learning decision trees
Reading: Russell & Norvig, Chapter 18
CITS4211
Slide 142
1. Why Learn?
So far, all intelligence comes from the designer:
time consuming for designer
restricts capabilities of agent
Learning agents can:
act autonomously
deal with unknown environments
synthesise rules/patterns from large volumes of data
handle complex data
improve their own performance
CITS4211
Slide 143
CITS4211
Slide 144
Sensors
Critic
Environment
feedback
changes
Learning
element
knowledge
Performance
element
learning
goals
Problem
generator
Agent
Effectors
CITS4211
Slide 145
CITS4211
Slide 146
CITS4211
Slide 147
CITS4211
Slide 148
CITS4211
Slide 149
3. Inductive Learning
Assume f is function to be learned.
Learning algorithm supplied with sample inputs and corresponding outputs supervised learning
Define example (or sample): pair (x, f (x))
Task:
given a set of examples of f , return a function h that approximates f .
h is called a hypothesis
Task is called pure inductive inference or induction.
Many choices for h
o
o
o
(a)
o
o
o
o
o
o
(b)
(c)
(d)
CITS4211
Slide 150
3. Inductive Learning
Example: Simple reflex agent
Sample is a pair (percept,action)
eg. percept chess board position
action best move supplied by friendly grandmaster
Algorithm:
global examples {}
function Reflex-Performance-Element(percept) returns an
action
if (percept, a) in examples then return a
else
h Induce(examples)
return h(percept)
procedure Reflex-Learning-Element(percept, action)
inputs: percept, feedback percept
action, feedback action
examples examples {(percept, action)}
CITS4211
Slide 151
3. Inductive Learning
Many variations possible, eg.
incremental learning learning element updates h with each
new sample
reinforcement agent receives feedback on quality of action
Many possible representations of h choice of representation is
critical
type of learning algorithm that can be used
whether learning problem is feasible
expressiveness vs tractability
CITS4211
Slide 152
CITS4211
Slide 153
CITS4211
Slide 154
Patrons?
None
Some
No
Full
Yes
WaitEstimate?
>60
3060
No
1030
Alternate?
No
No
Bar?
No
No
Hungry?
Yes
Reservation?
Yes
Yes
010
No
Fri/Sat?
No
No
Yes
Yes
Yes
Yes
Yes
Alternate?
No
Yes
Yes
Yes
Raining?
No
Yes
No
Yes
Yes
Choice of attributes is critical determines whether an appropriate function can be learned (garbage-in, garbage-out)
No matter how good a learning algorithm is, it will fail if appropriate features cannot be identified (cf . neural nets)
In real world problems feature selection is often the hardest task!
(black art?)
c CSSE. Includes material
c S. Russell & P. Norvig 1995,2003 with permission.
CITS4211
Slide 155
2n combinations of inputs
n
22 different functions
eg. 6 attributes, approx. 2 1019 functions to choose from
lots of work for learning element!
CITS4211
Slide 156
Example
X1
X2
X3
X4
X5
X6
X7
X8
X9
X10
X11
X12
Alt
Bar
Fri
Hun
Pat
Yes
Yes
No
Yes
Yes
No
No
No
No
Yes
No
Yes
No
No
Yes
No
No
Yes
Yes
No
Yes
Yes
No
Yes
No
No
No
Yes
Yes
No
No
No
Yes
Yes
No
Yes
Yes
Yes
No
Yes
No
Yes
No
Yes
No
Yes
No
Yes
Some
Full
Some
Full
Full
Some
None
Some
Full
Full
None
Full
Goal
Price Rain
$$$
$
$
$
$$$
$$
$
$$
$
$$$
$
$
No
No
No
No
No
Yes
Yes
Yes
Yes
No
No
No
Res
Type
Est
WillWait
Yes
No
No
No
Yes
Yes
No
Yes
No
Yes
No
No
French
Thai
Burger
Thai
French
Italian
Burger
Thai
Burger
Italian
Thai
Burger
010
3060
010
1030
>60
010
010
010
>60
1030
010
3060
Yes
No
Yes
Yes
No
Yes
No
Yes
No
No
No
Yes
Goal: find a decision tree that agrees with all the examples in
the training set
CITS4211
Slide 157
CITS4211
Slide 158
(a)
Patrons?
None
+:
: X7,X11
Some
Full
+: X1,X3,X6,X8
:
+: X4,X12
: X2,X5,X9,X10
+: X1,X3,X4,X6,X8,X12
: X2,X5,X7,X9,X10,X11
(b)
Type?
French
+: X1
: X5
Italian
+: X6
: X10
Thai
Burger
+: X4,X8
: X2,X11
+: X3,X12
: X7,X9
+: X1,X3,X4,X6,X8,X12
: X2,X5,X7,X9,X10,X11
(c)
Patrons?
None
+:
: X7,X11
Yes
Some
+: X1,X3,X6,X8
:
Full
+: X4,X12
: X2,X5,X9,X10
No
Hungry?
Y
+: X4,X12
: X2,X10
c CSSE. Includes material
c S. Russell & P. Norvig 1995,2003 with permission.
CITS4211
+:
: X5,X9
Agents that Learn
Slide 159
CITS4211
Slide 160
CITS4211
Slide 161
No
Some
Full
Hungry?
Yes
No
Type?
French
Yes
Italian
Yes
No
Thai
Burger
Yes
Fri/Sat?
No
No
No
Yes
Yes
CITS4211
Slide 162
CITS4211
Slide 163
0.9
0.8
0.7
0.6
0.5
0.4
0
20
40
60
Training set size
CITS4211
80
100
Slide 164
CITS4211
Slide 165
CITS4211
Slide 166
The End
CITS4211
Slide 167