Académique Documents
Professionnel Documents
Culture Documents
Learning
The changes in behaviour
that accumulate over a
lifetime are stored in the
central nervous system.
Classical conditioning
Operant conditioning
Ivan Pavlov’s
Before Learning
“Psychic Secretions”
Meat Powder Salivation
Unconditioned Unconditioned
Ivan Pavlov
Stimulus (UCS) Response (UCR)
1849-1936
1
Before Learning During Learning
Bell No response
Neutral
Stimulus Repeatedly pair
UCS and Neutral
Stimulus
After Learning
The term ‘psychic secretions’ was
Bell Salivation eventually replaced with the term,
conditioning, and eventually became
known as Classical Conditioning or
Pavlovian Conditioning.
Conditioned Conditioned
Stimulus (CS) Response (CR)
A. neutral stimulus
B. unconditioned stimulus
C. unconditioned response
D. conditioned stimulus
E. conditioned response Pavlov’s dog, cat and
Cousin Earl
2
John Watson and Behaviorism Black Box Psychology
• “Psychology should be a purely objective
experimental branch of natural science”
• believed that psychology had failed as a
science
• disliked introspection and hypothetical THE
concepts (such as “decisions”, “knowledge”, Stimulus MIND Response
“perception”, “consciousness”)
Do NOT Open
John B. Watson
1878-1958
Little Albert
Little Albert’s fear of the white rat
generalizes to the furry rabbit
video video
3
Classical Conditioning Concepts Does Learning Generalize?
• Generalization and discrimination “He who is bitten by the snake fears the lizard.”
• Higher-order conditioning -- Bugandan proverb
• Extinction
• Spontaneous recovery In fact, the phenomenon of
• Practical applications generalization was one of the
– aversive conditioning
– compensatory reaction hypothesis
mechanisms that Pavlov hoped
would permit conditioning to explain
all kinds of complex behaviour.
20 Generalization 20 Discrimination
What will happen if you
Generalization only give food after a
Drops of saliva
Drops of saliva
10,000 Hz tone but not an
gradient 8,000 or 12,000 Hz tone?
10 10
CS used in
training
0 0
6000 8000 10,000 12,000 14,000 6000 8000 10,000 12,000 14,000
4
Extinction and Recovery Counterconditioning
Why didn’t Vietnam vets crave heroin Why do diabetics blood sugar levels Drug
when they returned to the U.S.? go UP just before they use insulin? Drug Drug Drug Drug Drug
NOT
Administered Expected Administered Expected Administered
Expected
Drug Effect
Time
How is it that some drug addicts can overdose Compensatory Reaction Tolerance
when they’ve only taken their typical dosage? • can lead to cravings • need more drug to get same effect
Compensatory-Reaction Hypothesis
HABITUAL USER
New Location
Large Dose
(e.g., blood sugar levels in diabetic)
Drug
Drug
NOT
Administered
Drug Effect
Expected
Overdose
• without compensatory
reaction, drug is too strong
5
Thorndike’s Puzzle Box Thorndike’s Cat’s Learning Curve
Edward Thorndike
(1874-1949)
Reward
that situation.
lights
to
peck
video
6
Shaping Behaviour Shaping
Reward animal for successive approximations of
Skinner believed that almost any behaviour desired behavior
could be ‘shaped’ by operant methods. 1) reward rat for approaching lever
2) reward rat for facing lever
Pigeons, for example, could be trained to 3) reward rat for touching lever
walk in circles, play ping-pong, or even play 4) reward rat for pressing lever
a tune on a small piano, if successive
approximations to the desired behavior • Reward must promptly follow behavior
were reinforced. • Can get superstitious behavior if not careful
Skinner believed the same idea could be
used to explain how children learned to talk.
Operant Conditioning
Animals can • The animal is reinforced for certain operations on
develop astounding the environment
skills through • also called Instrumental or Skinnerian Conditioning
shaping
Reinforcement
• an outcome that affects the likelihood of an
operant response re-occuring
Partial Reinforcement
• Response is not rewarded on every trial
• Makes behavior more resistant to extinction
• Can get extensive efforts for small rewards
7
Partial Reinforcement Schedules Try these…
• Classify the following situations based on the
1. Fixed ratio: reinforced after a fixed reinforcement schedule:
number of responses. – You get 1 air mile for every $20 you spend at Shell
– You may win a large amount playing video poker
– You study for the tests that your professor gives you once per month
2. Variable ratio: reinforced after a – You study regularly because your professor periodically gives “pop”
quizzes
variable number of responses. – You get one Coca-cola for every $1 of change you put in the vending
machine
– You check your snail mail which comes once per day
3. Fixed Interval: reinforced after a given – Your boss pays you $10/hour
amount of time has elapsed. – Your boss pays you 5 cents for every letter you engrave
– Your boss pays you a commission for each sale you make
8
Differences Between Classical and 2. Nature of responses that can be
Operant Conditioning. trained
1. Timing of events
In classical conditioning the responses are
ones that are generally referred to as
In classical conditioning the delivery of the
“involuntary” (eye blinks, heart rate, etc.)
reinforcement (UCS) is controlled by the
experimenter. In operant conditioning, the responses are
In operant conditioning the delivery of the ones that are generally described as
reinforcement is controlled by the animal or “voluntary” (pecking, hand movements,
human subject. speech, etc).
*a tarragon-
flavoured white sauce
9
Problems with this explanation Bright, Noisy, Tasty Water Expts.
What about the long gap between the CS • Are organisms evolutionarily
and the UCS? prepared to make particular
associations?
Why is food the CS? • The “Bright, Noisy, Tasty Water”
Why not Tristan und Isolde or
Experiments
Wagnerian opera? John Garcia
Song-learning in birds
10