Psychophysics of Remembering

JOURNAL OF THE EXPERIMENTAL ANALYSIS OF BEHAVIOR 1999, 71, 91–113 NUMBER 1 (JANUARY)
PSYCHOPHYSICS OF REMEMBERING
K. G EOFFREY W HITE AND J OHN T. W IXTED
UNIVERSIT Y OF OTAGO, NEW ZEALAND AND
UNIVERSIT Y OF CALIFORNIA, SAN DIEGO
We present a new model of remembering in the context of conditional discrimination. For proce-
dures such as delayed matching to sample, the effect of the sample stimuli at the time of remem-
bering is represented by a pair of Thurstonian (normal) distributions of effective stimulus values.
The critical assumption of the model is that, based on prior experience, each effective stimulus value
is associated with a ratio of reinforcers obtained for previous correct choices of the comparison
stimuli. That ratio determines the choice that is made on the basis of the matching law. The standard
deviations of the distributions are assumed to increase with increasing retention-interval duration,
and the distance between their means is assumed to be a function of other factors that influence
overall difficulty of the discrimination. It is a behavioral model in that choice is determined by its
reinforcement history. The model predicts that the biasing effects of the reinforcer differential
increase with decreasing discriminability and with increasing retention-interval duration. Data from
several conditions using a delayed matching-to-sample procedure with pigeons support the predic-
tions.
Key words: remembering, conditional discrimination, forgetting functions, discriminability, rein-
forcer probability, delayed matching to sample, pigeon
The foundation for a psychophysical anal- offers well-established methods for quantify-
ysis of remembering was laid over a century ing the changes in behavior that result from
ago, with Fechner’s quantitative analysis of measurable changes in the physical environ-
sensation and Ebbinghaus’ experimental ment.
analysis of memory. Fechner (1860) had pro- Ebbinghaus (1885/1964) showed that
posed that sensory experience is a logarith- changes in remembering were amenable to
mic function of stimulus intensity. Although quantification. His measure of retention, the
the function relating experience and environ- percentage savings in time to relearn a list of
ment has since been interpreted as following nonsense words, decreased logarithmically as
a power law (Stevens, 1961) Fechner’s fun- retention interval increased over a period of
damental contribution was to render sensory several weeks. It has since been shown that
experience amenable to scientific analysis. Ebbinghaus’ data are better described by a
Fechner had also described a theory for dis- power function (Anderson & Schooler, 1991;
crimination that predated the more recent Wixted & Ebbesen, 1991). Ebbinghaus’ early
development of signal-detection theor y attempt to quantify memory nevertheless es-
(Link, 1994). The field of psychophysics now tablished the possibility that remembering
may be amenable to analysis in the same
Earlier versions of the model described in this paper terms as sensing and perceiving.
were presented to the meetings of the Society for Quan- The aim of the present paper is to offer an
titative Analyses of Behavior, Washington, D.C., June, analysis of remembering that follows the gen-
1995, and the Behavior Symposium, Christchurch, New eral approach of signal-detection theory and
Zealand, August, 1996. Preparation of the manuscript
was facilitated by the generous hospitality provided by
to apply it to remembering in nonhuman an-
Peter Killeen during the first author’s leave at Arizona imals. The analysis assumes that the effect of
State University. Killeen’s constructive comments consid- the stimulus can be represented in terms of
erably improved the manuscript. We are grateful to An- discriminal processes of the kind suggested
gela Ruske for her outstanding assistance during the con-
duct of the experiments reported here, to members of
by Thurstone (1927). Unlike decision-theo-
our laboratory groups, especially Emily Cooney and Deir- retic approaches, however, a decision criteri-
dra Dougherty, for their helpful discussions, and to Barry on is not assumed; instead, the animal’s
Dingwall, for his essential technical expertise. choice is assumed to be influenced directly
Requests for reprints should be addressed to the first
author at the Department of Psychology, University of
by the payoff ratio. This approach makes the
Otago, Dunedin, New Zealand (E-mail: kgwhite@otago. interesting prediction that the effect of pay-
ac.nz or jwixted@ucsd.edu). offs in biasing an animal’s choice depends on
91
92 K. GEOFFREY WHITE and JOHN T. WIXTED
the discriminability of the stimuli to be dis- adjusted so that more events are reported as
criminated. familiar. In principle, and in the absence of
criterion variance, the analysis prescribed by
Detection Models of Recognition the signal-detection approach leaves the mea-
The treatment of remembering in the sure of discriminability untainted by errant
same terms as sensory discrimination was not changes in response tendencies, and reveals
developed until 80 years after Ebbinghaus, pure memorability.
when Murdock (1965) suggested that remem-
bering was a matter of discriminating familiar Criterion Location
from novel events (see also Banks, 1970; Egan (1975) and Macmillan and Creelman
Lockhart & Murdock, 1970; Parks, 1966). An (1991) have summarized possibilities for
important assumption was that familar and choice of a decision rule for models that rely
novel items were assumed to vary in ‘‘mem- on the assumption of a decision criterion. (a)
ory strength’’ or ‘‘familiarity,’’ with the mean A probability matching rule was proposed by
of the distribution of novel events at zero and Parks (1966) and elaborated by Thomas and
the mean of the distribution of familiar Legge (1970) and Thomas (1975). The prob-
events at a value of memory strength greater ability matching rule is assumed to apply in
than zero. The familiarity of an event increas- cases in which observers have incomplete in-
es with training or practice. The increase in formation about underlying distributions in
familiarity (or item strength) tends to follow contrast to the ‘‘ideal observer’’ who has com-
a power function of the number of training plete information and is thus able to utilize a
trials (Anderson, 1995). According to the the- likelihood ratio criterion. For symmetrical
ory of signal detection (Green & Swets, payoff matrices, the probability matching rule
1966), the task of deciding whether a partic- assumes that the observer reports occurrence
ular event is familiar is accomplished by using of the target items with a probability that
a decision rule: Events of memory strength matches their a priori occurrence. Thus, for
greater than a criterion value are categorized example, if Stimulus A occurs on 80% of the
as familiar and hence are responded to as re- trials and Stimulus B occurs on 20% of the
membered, whereas those of weaker strength trials, the criterion will be placed on the de-
than the criterion are categorized as novel. cision axis in such a way that the probability
of reporting Stimulus A is 80%. Creelman
Discriminability and Bias and Donaldson (1968) showed that for judg-
The signal-detection approach to recogni- ments of line length, changes in the prior
tion generated an extensive empirical litera- probability of the stimuli did not affect dis-
ture that has benefited from perhaps the criminability (i.e., the distance between the
most influential theoretical assumption of distributions) but did affect the placement of
psychophysics: the independence of discrim- the decision criterion such that the highly
inability and response bias. The approach trained subjects matched response propor-
yields a measure of discriminability or mem- tions to relative stimulus probability. Because
orability, measured by the distance d9 be- correct responses produced monetary re-
tween the means of the distributions of novel wards, it is also possible that response pro-
and familiar events. Memorability is indepen- portions were sensitive to the relative mone-
dent of the location of the criterion value for tar y payoff. In other studies, response
deciding between familiar and novel. The lo- proportions undermatched prior stimulus
cation of the criterion value can be used as a probability (Dusoir, 1975). (b) An alternative
measure of response bias, that is, the tenden- decision rule is one that maximizes expected
cy to report familiar versus novel indepen- value. If expected value is to be maximized,
dently of the discriminability of familiar from the rule takes account of both the prior prob-
novel. A major determinant of the location of abilities of occurrence of the stimuli and the
the criterion is relative payoff, although spe- payoff matrix. It assumes that the decision
cific rules relating criterion value to relative rule is based on the likelihood ratio, that is,
payoff have yet to be developed (Macmillan the ratio of probabilities of occurrence of one
& Creelman, 1991). If the payoff favors re- sample versus the other (Egan, 1975). Healy
porting an event as familar, the criterion is and Kubovy (1981) orthogonally varied pay-
PSYCHOPHYSICS OF REMEMBERING 93
off and prior probability in a numerical cat- volves continual changes in discriminability
egorization task. They reported that prior or stimulus control. At very short retention
probabilities had larger effects on the likeli- intervals, remembering is easy and the bias-
hood ratio criteria than did payoff, but that ing effects of differential payoffs are attenu-
a probability matching rule that included a ated, whereas at very long retention intervals,
constant based on the payoff matrix could ac- remembering is difficult and the effects of
count for their data. differential payoffs are magnified (Wixted,
1989). The model we describe here addresses
Conditional Discrimination this issue and at the same time deals with an-
The separation of discriminability from other more fundamental difficulty associated
bias in psychophysics has a parallel in the with the earlier signal-detection approach to
study of conditional discrimination learning, remembering, namely the problem of speci-
where the effects of the discriminative stimuli fying how the subject’s decision rule is deter-
may be separated from the effects of the dif- mined by the location of the criterion on the
ferential reinforcer probabilities that main- memory strength continuum.
tain the discrimination (Nevin, 1981; White, The model we describe does not rely on
1986; Williams, 1988). In the context of the assumption of a criterion. It combines the
choice procedures, differential reinforce- time-honored assumption that stimulus ef-
ment biases the choice towards one alterna- fects are represented by random values drawn
tive versus another. Variation in the reinforc- from normal density functions, as in Thur-
er probabilities for two choices results in a stone’s (1927) discriminal processes, with the
power function relation between the ratios of more recent generalization that the prefer-
the choices and the reinforcer ratios (Baum, ence or choice between alternatives is a func-
1974). Variation in the disparity of the dis- tion of the payoffs they produce (Baum,
criminative stimuli of a conditional discrimi- 1974; Herrnstein, 1970).
nation allows an assessment of the extent to
which stimulus disparity limits the discrimi- Interaction of Reinforcer and
nation (Nevin, 1969). In conditional discrim- Stimulus Control
inations, however, the effects of the reinforc- Quantification of the effects of payoffs on
er differential are modulated by stimulus remembering is easily achieved in terms of
disparity. When choice relies on both stimu- the matching law. Jones and White (1992) ex-
lus and reinforcer differences, large stimulus amined performance in a standard delayed
differences attenuate the effect of the rein- matching-to-sample procedure in which five
forcer differential and small stimulus differ- responses to a red or green sample stimulus
ences amplify the reinforcer effect (White, initiated a delay interval of variable duration,
1986). Signal-detection procedures, which followed by a choice between red and green
are also classed as conditional discriminations comparison stimuli. The probability of rein-
(McCarthy & White, 1987), are associated forcers for correct (matching) choices follow-
with a similar problem. The extent of discrim- ing red and green sample stimuli was varied
inability may modulate the effects of the re- over several conditions. When the probability
inforcers that otherwise bias choice towards of a reinforcer for a correct choice of red was
one or the other alternative. In situations in higher than for a correct choice of green, the
which discriminability is expected to vary, the tendency to choose red was greater than the
effect of the reinforcer differential may inter- tendency to choose green. More generally,
act with the effects of the discriminative stim- the ratio of red to green choices following
uli. red and green samples was a power function
of the ratio of reinforcers obtained by correct
Discriminability Decrement in Remembering red versus green choices.
The interaction of reinforcer effects with The result reported by Jones and White
stimulus effects is especially evident in re- (1992) is summarized in Figure 1, for 1 of
membering by virtue of its primary charac- their birds, X1. Panel A shows the standard
teristic, the systematic decrement in discrim- decrement in matching accuracy with increas-
inability with increasing retention-interval ing delay-interval duration. Matching accu-
duration. Remembering (or forgetting) in- racy was assessed in terms of a measure of
Fig. 1. Data for Bird X1 replotted from results reported in Appendix A of Jones and White (1992). In (A), the
forgetting function is defined by the decrease in discriminability as a function of delay-interval duration. In (B), the
slope of the function relating the log ratio of red to green choices to the log ratio of red to green reinforcers steepens
with increasing delay, as summarized in (C).
discriminability (see below). Panel B shows McLean, 1985). Here, reinforcer control of
that the tendency to choose red versus green the choice was weak for an easy line-tilt dis-
is a function of the ratio of reinforcers ob- crimination and strong for a difficult discrim-
tained by red versus green choices, as ex- ination. Nevin, Cate, and Alsop (1993) have
pected from the power function version of also reported a stronger effect of varying re-
the matching law (Baum, 1974). The slopes inforcer probability for a smaller disparity be-
of the functions in Panel B are a measure of tween discriminative stimuli in a discrete-tri-
the exponent of the power function relating als conditional discrimination. That is, there
the ratio of choices to the ratio of reinforcers. is a parallel between reinforcer sensitivity ef-
The exponent is usually interpreted as an in- fects for perceptual and memorial proce-
dex of sensitivity to reinforcement (Davison dures. If a discrimination is easy because of
& McCarthy, 1988). Panel C summarizes large stimulus disparity or a short retention
these slopes and, more importantly, indicates interval, there is little effect of varying the
that sensitivity to reinforcement is a function reinforcer ratio. If a discrimination is diffi-
of delay-interval duration. The slopes of the cult, as with small stimulus disparity or long
functions in Panel B gradually increase with retention intervals, behavior is very sensitive
increasing retention-interval duration. to variation in the reinforcer ratio. This result
The result that the biasing effect of the re- is consistent with the model of delayed
inforcer ratio is weak at short delays when dis- matching-to-sample performance described
criminability is high and is strong at long de- by Wixted (1989). According to this model,
lays when discriminability is weak has a the influence of sample stimuli and the ratio
parallel in an earlier result for conditional of reinforcers obtained by correct choices
discriminations (White, 1986; White, Pipe, & have separate effects that depend on the de-
lay interval. At short delays the conditional Consistent with the generalized matching law,
discrimination is influential and the reinforc- the log ratio of red to green choices is a lin-
er effect is weak. At long delays when the con- ear function of the log ratio of reinforcers
ditional discrimination is weaker, the rein- obtained by correct red versus green choices
forcer effect is strong. (Rr, Rg). The slope of the function, a, mea-
sures sensitivity of choice to variation in the
Response Measures reinforcer ratio, and c is a constant describing
The measures of discriminability and bias overall (unexplained) preference for one or
that we adopt are, respectively, the geometric the other choice alternative.
mean of the ratio of correct to error respons- The result that the biasing effects of the
es following each sample and the geometric reinforcer ratio depend on the delay suggests
mean of the ratio of red to green choices fol- that discriminability and bias may not be in-
lowing each sample. These measures can be dependent, as otherwise assumed by the stan-
derived from different theoretical assump- dard signal-detection approach or early ver-
tions, but can otherwise be treated as theory- sions of the behavioral detection approach
free measures of performance (as is our pref- (e.g., Davison & Tustin, 1978; but see Alsop
erence here). The measures were originally & Davison, 1991). Both approaches assume
proposed by Luce (1963) in the context of separate and independent influences of the
choice theory, and as anticipated by Nevin discriminative stimuli and factors that gener-
(1969), Davison and Tustin (1978), and Nev- ate bias. In the signal-detection model, the
in (1981) in the context of behavioral detec- location of a decision criterion along the ev-
tion theory. From a theory-free perspective, idence variable (the determinant of bias) is
the measures reflect the likelihood that the independent of the distance between signal
subject makes correct responses versus errors, and noise distributions (the determinant of
or reports one alternative versus the other. By discriminability). In the behavioral detection
taking ratios, the measurement scale is not model, the biasing effects of stimulus dispar-
constrained in the way that, for example, a ity on choice are independent of the biasing
proportion scale is bounded by 1.0. The dis- effects of the reinforcers obtained by the
criminability measure (log d, or log a) is lin- choices. In a more recent version of the be-
early related to the discriminability measure havioral detection model (Alsop & Davison,
d9 derived from signal detection theory, and 1991; see also Nevin et al., 1993, especially
satisfies the requirement that both hits and Equation 8), although the two free parame-
false alarms contribute to the measure of dis- ters describing stimulus and reinforcer effects
criminability (Macmillan & Creelman, 1991, are said to be independent, sensitivity to re-
pp. 11–13). The discriminability measure is inforcement, as measured by Equation 2, re-
calculated by taking the logarithm (base 10) flects the joint effects of stimulus difference
of the ratio of correct (c) to error (e) respons- and differential reinforcement under some
es following each sample (subscripts r and g conditions. An interesting issue for the be-
for red and green samples) and averaging havioral detection model is that it assumes
them according to that the choice between two alternatives fol-
discriminability, lowing one sample is determined by the ratio
of reinforcers obtained by correct choices fol-
log d 5 .5·log[(c r /e r )·(c g /e g )]. (1) lowing both samples. That is, it treats the mul-
The tendency or bias to choose red versus tiple concurrent schedule as if it were a con-
green is calculated by averaging the loga- current schedule. McLean and White (1983)
rithms of the ratios of choices of red to choic- have claimed that in multiple concurrent
es of green following each sample. In Panel B schedules (including detection procedures),
of Figure 1, it is plotted as a function of the the reinforcers obtained by correct choices
red to green reinforcer ratio, according to following one sample do not influence choice
following the other sample (with the excep-
log red/green choices tion of reallocation of extraneous reinforcers
5 0.5·log[(c r /c g )·(e r /e g )] from one time to another). An alternative as-
sumption was made by Nevin (1981) and is
5 a log(R r /R g ) 1 c. (2) consistent with Luce’s (1963) choice theory:
The choice following one sample is influ-

enced by the reinforcers obtained by correct
choices following that sample relative to the
generalized effects of reinforcers obtained by
correct choices following the other sample.
A Criterion-Free Model of Remembering
We model the interaction between discrim-
inability and the biasing effects of reinforcers
by assuming that the individual chooses be-
tween available response options on the basis
of which is more likely to be reinforced in a
given instance. The model may be character-
ized as a behavioral theory of remembering,
in that choice is directly determined by the
reinforcer ratio and no decision criterion is
assumed.
Distributions of stimulus effect. The effect of
the sample stimulus presumably varies from
trial to trial. The stimulus effect at the time
the choice is required is assumed to be a ran-
dom value drawn from a normal density func-
tion distributed along a dimension of stimu- Fig. 2. (A) assumed distributions of stimulus effect
lus effect. This dimension may be interpreted for red and green sample stimuli (discriminal processes).
as Thurstone’s (1927) psychological continu- (B) decrement in discriminability with increasing delay
interval that characterizes the forgetting function. (C)
um (Luce, 1994), and the distributions cor- greater variance (and hence greater overlap) of the dis-
respond to Thurstone’s discriminal processes. criminal processes is illustrated for a long delay interval
Consecutive occurrences of a stimulus always compared to a short delay.
involve some variation in their effect on be-
havior (Green & Swets, 1966), and we assume
that this is particularly the case when the stim- events. In studies of human recognition
ulus is temporally separated from the behav- memor y, for example, receiver-operating
ior. At present we are not committed to the characteristics that relate hit rates to false
view that the variation in stimulus effect is the alarm rates are typically asymmetrical. That is,
result of a psychological process; more simply, there are different variances for the normal
the stimulus effect is a reflection of the en- distributions for new and old items on the
vironment (Fetterman, 1996; White, 1991). familiarity or memory strength dimension
The distribution of stimulus effect is specified (Ratcliff, McKoon, & Tindall, 1994; Yoneli-
at the time of remembering. The passage of nas, 1994).
time weakens the discrimination and contrib- The assumption depicted in Figure 2 (Pan-
utes to the trial-by-trial variability in stimulus el A) follows the standard detection approach
effect. Thus the discriminal processes repre- (cf. White & Cooney, 1996; Wixted, 1993), in
sent the potential stimulus effect at a given that the discriminal processes have equal var-
delay interval. This issue is discussed further iance and are separated by a distance that is
in the General Discussion section below. related to stimulus disparity. A specific as-
This assumption is illustrated in Figure 2 sumption we make, however, is that the vari-
(Panel A). The distance D between the means ance of the distributions increases monoton-
of the discriminal processes is directly related ically with increasing delay-interval duration.
to stimulus disparity. Overall higher levels of This assumption is illustrated in Figure 2
discriminability are reflected in larger values (Panel C). Two pairs of discriminal processes
of D. The variances of the distributions are are shown, both with the same distance D be-
usually assumed to be equal, but unequal var- tween their means. The pair with less overlap
iances may result from specific manipulations illustrates the case at a short retention inter-
of the attributes of the to-be-remembered val. The pair with the greater overlap, owing
the reinforcer probability for correctly re-

porting green is .7 and is .3 for correctly re-
porting red. The distributions of stimulus ef-
fect in Panel A may be translated into
distributions of reinforcer probability shown
in Figure 3 (Panel B) simply by multiplying
the values for the green distribution by .7 and
the values of the red distribution by .3. The
resulting distributions are distributions of re-
inforcer probabilities along the dimension of
stimulus effect. (Because the probabilities are
not required to sum to 1.0, the distributions
are not probability density functions.) Each
of the possible values of stimulus effect x is
therefore associated with a pair of reinforcer
probabilities, one for the green distribution,
p(Gx), and one for the red distribution, p(Rx).
Fig. 3. Probability distributions of stimulus effect Choice determinants. We assume that at a giv-
(Panel A) multiplied by reinforcer probabilities of .7
(green) or .3 (red) (Panel B). en instant or on a particular trial, the effect
of a prior event is represented by a stimulus
value x randomly determined by either of the
to larger standard deviations, illustrates the discriminal processes in Panel A of Figure 3.
case at a longer retention interval. Thus the The individual’s task is to choose green or
reduction in discriminability with increasing red on each trial. Assuming that the individ-
retention-interval duration (Figure 2, Panel ual has prior experience in the procedure,
B) is the result of a continuous increase in for a particular value of x choices of red have
the standard deviations of the discriminal been reinforced with probability p(Rx), and
processes as the retention interval lengthens. choices of green have been reinforced with
At this stage it is not necessary to specify the probability p(Gx). We suppose that on each
function relating variance to delay interval, trial (i.e., for each x) the relative tendency to
because the main aim of the model is to pre- choose red versus green is directly deter-
dict an inverse relation between discrimina- mined by the ratio of probabilities of rein-
bility and sensitivity to reinforcement. That is, forcers, p(Rx)/p(Gx). This assumption follows
the model predicts that reinforcer sensitivity the matching law, according to which choice
increases as discriminability decreases. The proportions match reinforcer proportions
reduction in discriminability can be achieved (Herrnstein, 1970). Our application of the
by increasing the variances of the distribu- matching assumption differs from the usual
tions or by decreasing the distance between use of the matching law, however, in that we
their means (or both). predict the tendency to choose one of two
Distributions of reinforcer probability. In ex- alternatives on a single trial on the basis of
perimental procedures, reinforcing conse- relative reinforcers accumulated over a large
quences follow accurate remembering with a number of prior trials.
defined probability. Each distribution on the It should be noted that unlike various ver-
continuum of stimulus effect is associated sions of detection theory, our model does not
with a given reinforcer probability. For ex- incorporate the notion of a decision criterion,
ample, assume that the left distribution in and, indeed, could be said to be free of de-
Figure 3 (Panel A) describes the variation in cision rules in that the response tendency is
stimulus effect at a given delay interval for determined directly by relative reinforcer fre-
the green sample, and assume that the right quency. In signal-detection models, the x axis
distribution describes the variation in stimu- can be interpreted as a ratio of likelihoods
lus effect at the same delay for the red sam- that either signal may have been presented
ple. (Note that the abscissa is scaled in z-score (Macmillan & Creelman, 1991). A decision
units, although the variance of the distribu- criterion is established at a specific likelihood
tions may differ from 1.0.) Also assume that ratio, which remains constant over all trials
(within the bounds of criterion variance). For sentation probability of .5, and a value x is
values of x greater than the decision criteri- randomly sampled from the stimulus effect
on, one response alternative is chosen, and dimension according to the probability den-
for values of x smaller than the criterion, the sity function defined for each sample stimu-
other alternative is chosen. In the present lus. The reinforcer distributions are defined
model, choice is not related to the likelihood by multiplying the discriminal processes by
ratio because it depends on the current ratio the reinforcer probabilities associated with
of reinforcer probabilities that change with x. the distributions. For a given value of x, the
The probability of choosing one versus the proportion of the heights of the resulting dis-
other alternative varies over the x axis and is tributions (at x) directly predicts the proba-
not all or none, as in the detection models. bility of choosing red versus green on that
Whereas the adoption of a constant decision trial. The heights (or probabilities) are given
criterion in the signal-detection models re- by p(Rx) 5 pr * N(D/2, Sr) and p(Gx) 5 pg *
sults in the derivation of independent mea- N(D/2, Sg). These distributions represent the
sures of discriminability and bias, the assump- probabilities that a reinforcer is arranged on
tion that choice is determined by the the red and green choice alternatives for a
reinforcer ratio for a given value of x in the given value of x. For example, assume that pr
present model allows the prediction that bias (the probability of reinforcement for a cor-
and discriminability interact. rect red response) and pg (the probability of
Separate measures of discriminability and reinforcement for a correct green response)
bias can be derived from the model, which are both 1.0 (cf. Figure 3, Panel A). Further
simulates events in a delayed matching-to- assume that x on a given red trial happened
sample procedure on a trial-by-trial basis. The to equal 0 (i.e., x on this trial happened to
model generates frequencies of correct and fall at the intersection of the two distribu-
error responses separately for trials with red tions). In the past for that value of x, a rein-
and green prior events as a function of the forcer was as likely to be set up on red as on
relative reinforcer frequencies that vary over green, so an unbiased bird would be equally
trials as a result of varying the values of stim- likely to choose either alternative. On the
ulus effect. next red trial, imagine that x equaled D/2
(i.e., x fell at the mean x value for red trials).
Predictions from the Model Note that, at that point, the height of the red
Predictions from the model were generat- distribution exceeds that of the green distri-
ed by running many simulations, each for bution by about 6 to 1. That is, in the past,
5,000 trials of a delayed matching-to-sample this value of x has occurred on red trials
procedure. The discriminal processes are de- about six times as often as green trials. Be-
fined by normal (or, in practice, logistic ap- cause pr and pg are both 1.0, this means that,
proximations to normal) distributions with for this particular value of x, a reinforcer is
variances Sr2 and Sg2 and means of 20.5D six times as likely to be arranged on the red
(green) and 10.5D (red). The parameter D choice alternative. Thus, according to the
is disparity on the stimulus effect dimension. matching law, an unbiased bird would be
To model an experimental procedure, initial- about six times as likely to choose red over
ly a red or green sample stimulus is chosen green given this value of x.
with a particular probability (usually .5). For The process works the same way when pr
the chosen sample, say red, a value x is ran- and pg are unequal. For example, assume that
domly selected from the associated distribu- pr 5 .3 and pg 5 .7. Multiplying both distri-
tion. In the simulation, x is specified in z- butions by these values yields the distribu-
score units, so that x 5 Sr * zi 1 D/2, or if the tions shown in Figure 3 (Panel B). Assume
green distribution is selected, x 5 Sg * zi 2 once again that on a given red trial the value
D/2, where zi is a randomly selected z score, of x happened to equal 0. Although that value
that is, a value randomly selected from a nor- is as likely to occur on a red trial as on a
mal distribution with a mean of 0 and a stan- green trial (as before), it is no longer the case
dard deviation of 1. that a reinforcer is as likely to be arranged
To summarize thus far, on each trial a red on the red choice alternative as on the green
or green sample is chosen with a signal pre- choice alternative. Instead, as illustrated in
Figure 3, a reinforcer is now about twice as

likely to be arranged on the green alternative
as on the red alternative. This can be most
easily appreciated by noting that the height
of the green distribution at 0 is about twice
the height of the red distribution (Figure 3,
Panel B). According to the matching law, an
unbiased bird will, under these conditions, be
twice as likely to choose green as red. Note
that the indifference point (i.e., the value of
x that yields indifference between red and
green) has shifted to the right. A bird that
was previously indifferent between red and
green at an x of 0 (when pr and pg both
equaled 1.0) and that is now indifferent be-
tween red and green when x is about 10.5
(the indifference point when pr and pg equal
Fig. 4. Values of discriminability predicted by the
.3 and .7, respectively) has exhibited some model as a function of increasing standard deviation with
sensitivity to reinforcement. As described in disparity D set at 1 (top panel) and as a function of in-
more detail later, the degree to which that creasing disparity with standard deviations set at 1 (bot-
indifference point shifts with changes in the tom panel).
scheduled probabilities of reinforcement is
directly related to D and the standard devia-
tions of the distributions. In general, the inforcer had been set up on a probabilistic
greater the overlap between the distributions basis. Accordingly, the result of a particular
by virtue of smaller D or larger standard de- trial would include the sample selected, the
viations, the more a given change in pr and choice response made, and whether a rein-
pg changes the indifference point. This is an- forcer was obtained, just as in a standard ex-
other way of saying that the model predicts perimental procedure. The values of discrim-
that sensitivity to reinforcement will be in- inability, log d, obtained from the simulations
versely related to discriminability. The exper- for different values of the standard deviations
iments described later provide a test of that with D 5 1 (top panel) and for different val-
prediction. ues of the disparity parameter, D, with Sr 5 Sg
For the first 100 trials of the simulation, the 5 1 (bottom panel) are shown in Figure 4.
reinforcer frequencies associated with that Whereas the change in predicted discrimi-
value of x are given by the programmed prob- nability with increasing standard deviation
abilities. However, for later trials, the ob- tends to mimic a forgetting function, the in-
tained reinforcer proportions (in the simu- crease in discriminability with increasing D is
lation) were used to compute reinforcer virtually linear.
probabilities. That is, for a particular value of Figure 5 shows the results of two sets of
x, the choice of red or green on that trial was simulations, conducted as described above,
predicted by the ratio of obtained reinforcers and just as if an actual delayed matching-to-
for that value of x on previous trials. If, for sample procedure was being conducted. For
example, x equals 1, and the number of ob- one set, the disparity parameter was set at D
tained reinforcers for choosing red and 5 4, as might be the case for an overall easier
green in the past for an x of 1 equaled 20 discrimination. For the other set, D was set at
and 40, respectively, then on the current trial 1, as might be the case for a harder discrim-
the probability of choosing red would be 20/ ination. The standard deviation values were
(20 1 40), or .33. That is, the simulated birds Sr 5 Sg 5 1. For both sets, 12 pairs of rein-
were assumed to match. forcer probabilities for correct red and green
Once the choice response, red or green, is choices were varied over values ranging from
selected, reinforcer occurrence is deter- .9, .1 to .1, .9. From the 5,000 trials of each
mined by whether the sample stimulus on simulation, frequencies of reinforcers for cor-
that trial was red or green and whether a re- rect red and green choices were used to cal-
Fig. 5. Predictions of the model for low (D 5 1) and

high (D 5 4) levels of disparity and constant standard
deviations, for log ratios of red to green choices as a func-
tion of log ratios of red to green reinforcers obtained by
the correct choices.
culate log ratios of red to green reinforcers. Fig. 6. The relation between sensitivity to reinforce-
Corresponding frequencies of red and green ment predicted by the model and discriminability pre-
choices on red-sample and green-sample tri- dicted by the model, for instances when changes in both
als were used to calculate the mean of the log were produced by varying the standard deviation with D
5 1 (top panel) and varying the disparity parameter D,
ratio of red to green choices following the with SD 5 1 (bottom panel).
red sample and the log ratio of red to green
choices following the green sample. This
measure is the same as the bias measure pro- ratio, as given by the slope of the straight line,
posed by Luce (1963) and Davison and Tus- is smaller for the easy discrimination.
tin (1978), and is described by Equation 2
above. The log ratio of choices from the sim- Predicting the Interaction Between
ulations was satisfactorily described by a lin- Discriminability and Bias
ear function of the log ratio of reinforcers, Figure 6 shows predictions from the model
consistent with the generalized matching law for sensitivity to reinforcement and discrimi-
(Baum, 1974). That is, the present model nability, one plotted as a function of the oth-
predicts a power function relation between er. These results were obtained in the same
choice and reinforcer ratios. way as for Figure 5 but with two pairs of re-
Three aspects of the predictions in Figure inforcer probabilities (pr and pg) for each in-
5 are noteworthy. First, the present model stance of standard deviation and D (pr/pg 5
predicts undermatching in the conditional .2/.8 or .8/.2). The top panel shows the in-
discrimination, on the basis of the assump- verse relation between reinforcer sensitivity
tion that the simulated bird matches choice and discriminability when standard deviations
proportions to reinforcer proportions at each were varied over 14 values with D 5 1. The
value of x on the stimulus effect continuum. bottom panel shows the inverse relation ob-
Second, with decreasing distance between the tained by varying D over 17 values with the
discriminal processes (decreasing D), the re- standard deviations of the discriminal pro-
inforcer ratios are predicted to cover a wider cesses set at 1. Each 5,000-trial simulation
range (despite entering the same values of generated a matrix of obtained response and
programmed reinforcer probabilities for reinforcer frequencies, which were used to
each D). Third, the function for the easy dis- calculate log ratios of red to green reinforcers
crimination is flatter than the function for and log ratios of red to green choice respons-
the hard discrimination. That is, sensitivity of es. Reinforcer sensitivity was estimated from
the choice ratio to changes in the reinforcer the slope of the function relating ratios of
Fig. 7. The relation between sensitivity to reinforcement and discriminability at different delay intervals for in-
dividual birds (shown by different symbols) in the study by Jones and White (1992) and the regression line for the
predictions from the present model (from Figure 6, top panel).
choices to ratios of reinforcers (cf. Figure 5). discriminal processes is influenced by several
Discriminability was calculated from the log sources of discriminability. These include the
ratios of correct to error responses (Davison wavelength disparity (for example) of the
& Tustin, 1978; Luce, 1963; McMillan & sample stimuli. Retention-interval duration
Creelman, 1991). Figure 6 shows the inverse also influences predicted discriminability, but
and virtually linear relation between reinforc- by increasing the variances of the distribu-
er sensitivity and discriminability predicted by tions. Other factors that influence discrimin-
the model. The results in the top panel ob- abilty are the fixed-ratio requirement for sam-
tained when the the standard deviations were ple-key responding and sample-presentation
increased, illustrate the pattern expected duration (White, 1985). We now report a set
when retention-interval duration is increased. of experimental conditions for pigeons work-
This is the interaction evident in the data re- ing in a delayed matching-to-sample proce-
ported by Jones and White (1992). The re- dure in order to examine further the inverse
sults in the bottom panel are what might be relation between discriminability and the bi-
expected when other factors that influence asing effects of the reinforcer differential. In
overall task difficulty are manipulated. all cases, retention-interval duration was var-
Figure 7 shows the data for individual birds ied in order to generate further evidence for
from Jones and White (1992) along with the the inverse relation predicted when standard
regression line that best fits the predictions deviations of the discriminal processes are in-
from our simulation for standard devation creased. In other conditions, factors influenc-
varied with the disparity parameter arbitrarily ing discrimination difficulty were included in
set at 1 (shown in the top panel of Figure 6). order to assess the relation predicted when
The predictions are generally consistent with the distance between the means of the dis-
the data. criminal processes changes. Reinforcement
rate was determined by the independent
Sources of Discriminability probability that a correct choice was followed
An important assumption of the model is by a reinforcer. Correct choices that were not
that the distance D between the means of the followed by reinforcers had no other conse-
quence, consistent with the typical method experimental sessions. Water and grit were
for arranging delayed matching-to-sample freely available in the home cages. The colo-
procedures and successive discriminations. In ny room was naturally illuminated with a pho-
the latter, for example, when responses in toperiod of approximately 14:10 hr.
one component are reinforced at variable in-
tervals, nonreinforced responses have no oth- Apparatus
er consequence that distinguishes them from A light- and sound-attenuating experimen-
the nonreinforced responses in the other tal chamber, 33 cm wide, 33 cm deep, and 34
component in which extinction may be ar- cm high, was painted matte black. A ventila-
ranged. In some very few studies in which all tion fan provided masking noise. Three trans-
correct choices in delayed matching to sam- lucent response keys, each 2.9 cm in diame-
ple are followed by hopper illumination and ter, were mounted on one wall 9.2 cm center
only some also produce grain (McCarthy & to center and 25.6 cm above the grid floor.
Davison, 1991), hopper illumination could Each key could be lit red or green and could
serve as a conditional reinforcer, and its prob- be operated by a minimum force of 0.15 N.
ability could be confounded with the proba- A central hopper opening below the center
bility of the food reinforcer. key and 5 cm from the grid floor allowed 3-s
access to wheat. The only illumination in the
chamber was provided by red or green on
EXPERIMENT 1:
center or side keys or by a white hopper light
ABSOLUTE RATE OF
during 3-s wheat presentations.
REINFORCEMENT
In the signaled magnitude effect described Procedure
by Nevin and Grosch (1990), accuracy is over- Sessions were conducted 7 days per week.
all higher on trials in which larger reinforcers Each session consisted of 80 trials, separated
follow correct choices than on trials in which by 15-s intertrial intervals (ITIs) during which
small reinforcers follow correct choices. The the chamber was dark and responses were in-
two types of trials are differentially signaled. effective. On each trial, a fixed ratio (FR) of
The result was confirmed by McCarthy and five responses to a red or green sample stim-
Voss (1995) and Jones, White, and Alsop ulus presented on the center key terminated
(1995). Our generalization of the result is the sample and initiated a dark delay period.
that accuracy may be overall higher when The delay lasted for 0.2 s, 1 s, 4 s, or 12 s,
higher absolute rates of reinforcement are ar- with delay durations mixed within sessions.
ranged in delayed matching to sample. Presentation of red and green comparison
We compared the effects of overall rich ver- stimuli on side keys followed the delay. A sin-
sus overall lean reinforcement rate in a de- gle choice response to one of the side keys
layed matching-to-sample procedure. We darkened both keys. Correct (matching)
asked whether we could replicate the effect choices produced 3-s access to grain with a
reported by Jones and White (1992) and pre- given probability. Unreinforced matching re-
dicted by our model, in which sensitivity to sponses and incorrect choices produced 3-s
reinforcement increased with increasing de- blackout periods, followed by the dark ITI.
lay-interval duration, and whether the inverse The order of red and green sample stimuli,
relation between reinforcer sensitivity and whether the red and green comparison stim-
discriminability was generalizable to the dif- uli appeared on the left or the right, and the
ferent performance levels generated by the order of delay intervals were random within
manipulation of absolute reinforcer rate. each session, with the constraint that each
M ETHOD combination of sample, comparison-stimulus
location, and delay occurred equally often
Subjects and with no more than three consecutive tri-
Three adult homing pigeons with prior ex- als with the same sample stimulus.
perience in delayed matching-to-sample pro- Reinforcer probabilities were fixed at cer-
cedures were maintained within 12 g of 80% tain values for each of 20 sessions per con-
of their free-feeding body weights by supple- dition. In five conditions, the probabilities
mentary feeding with mixed grain following were overall rich, and in five conditions they
Table 1
Reinforcer probabilities for correct choices of red and
green comparison stimuli. 20 sessions were conducted
for each condition.
Condition Order Red Green
Rich FR 5 15-s ITI

1 1 .3 .9
2 2 .9 .3
3 3 .6 .6
4 9 .96 .24
5 10 .24 .96
Lean FR 5 15-s ITI
6 4 .1 .3
7 5 .3 .1
8 6 .2 .2
9 7 .075 .3
10 8 .3 .075
FR 1 15-s ITI
11 11 .3 .075
12 12 .075 .3
13 13 .3 .1
14 15 .1 .3
15 14 .2 .2
FR 1 1-s ITI
16 16 .3 .075
17 17 .075 .3
18 18 .3 .1
19 19 .1 .3 Fig. 8. Discriminability as a function of delay-interval
duration for rich and lean reinforcement-rate conditions
in Experiment 1.
were overall lean. The probabilities are

higher when the overall reinforcer rate was
shown in Table 1, which indicates that in
rich.
some conditions, correct red choices were re-
The functions in Figure 8 are standard for-
inforced with higher probabilites than were
getting functions, with a clear reduction in
correct green choices, and vice versa for oth-
discriminability with increasing delay dura-
er conditions. Table 1 shows the reinforcer
tion (White, 1985). For the pooled data for
probabilities for all experiments reported in
each bird at each delay, the frequencies of
the present paper; Conditions 1 to 10 con-
correct red and correct green responses were
tributed to Experiment 1.
each higher in the rich reinforcement con-
R ESULTS ditions than in lean, and the frequencies of
red and green error responses were each low-
Data analyses were based on frequencies of er in rich than in lean conditions (except for
responses and reinforcers obtained at each Bird S3 at the two shortest delays). Accord-
delay summed over the last eight sessions per ingly, for each bird, the forgetting function
condition. Eight sessions allowed a maximum for the lean reinforcement conditions was
response frequency of 80 correct responses in consistently lower than that for the rich con-
the cells of the 2 3 2 matrix for red and ditions, although the difference was small. A
green responses following red and green sam- similarly small improvement in overall accu-
ple stimuli at each delay, for each reinforcer- racy as a result of increasing overall reinforce-
ratio condition. Figure 8 shows that when re- ment probability has been reported by
sponses were pooled over the five different Blough (1998) for a matching-to-sample pro-
reinforcer conditions, discriminability (log d, cedure with no delays and categories of hues
calculated according to Equation 1) de- as samples.
creased with increasing delay, and was overall The effect of variation in the reinforcer
Fig. 9. Log ratio of red to green choices as a function of log ratio of red to green reinforcers for the rich
reinforcer-rate conditions of Experiment 1, as a function of delay interval. Values of the parameter estimates for the
slope (reinforcer sensitivity) and intercept are shown.
probability ratio is shown in Figure 9 (for the inforcer probabilities, the same effect as was
rich reinforcement conditions) and Figure 10 reported by Jones and White (1992) is clear
(for the lean reinforcement conditions). in Figures 9 and 10. At the short delays, the
Here, the log ratio of red to green choices slopes of the matching lines are relatively flat,
provides a measure of the tendency to choose and at the long delays they are steep.
red versus green, and was calculated accord- The slopes, which provide an index of sen-
ing to Equation 2. For both rich and lean re- sitivity to reinforcement, are compared in Fig-
Fig. 10. Log ratio of red to green choices as a function of log ratio of red to green reinforcers for the lean
reinforcer-rate conditions of Experiment 1, as a function of delay interval. Values of the parameter estimates for the
ure 11 for rich and lean reinforcement con-

ditions. With the exception of the lean
condition for Bird S1, as the delay lengthens,
reinforcer sensitivity increases. This result is
consistent with an inverse relation between
reinforcer sensitivity and discriminability.
There was no consistent difference in rein-
forcer sensitivity between rich and lean con-
ditions, as indicated by the overlapping stan-
dard error bars in Figure 11. In the context
of the inverse relation between reinforcer
sensitivity and discriminability that is being
explored here, this result is perhaps not sur-
prising in view of the small difference in dis-
criminability between the rich and lean con-
ditions. In addition, although the general
outcome of the model is an inverse relation
between reinforcer discriminability and sen-
sitivity, the model predicts that variation in
the absolute reinforcer rate should have no
effect on sensitivity because the same propor-
tions of obtained reinforcers are maintained
for rich and lean conditions.
Fig. 11. Sensitivity to reinforcement (values of the
slopes in Figures 9 and 10) as a function of delay-interval
EXPERIMENT 2: duration for rich and lean reinforcer-rate conditions of
SAMPLE RATIO Experiment 1.
REQUIREMENT
In Experiment 2 we examined the inter- shows the order in which Conditions 11 to 15
action between reinforcer sensitivity and the were conducted. As for the earlier conditions,
change in discriminability caused by decreas- each was conducted for 20 sessions. Analyses
ing the sample-key ratio requirement. A stan- were based on response and reinforcer fre-
dard result is that discriminability is overall quencies summed for the last eight sessions
higher at all delay intervals when more re- of each condition.
sponses are required to the sample stimulus
(Roberts, 1972; White, 1985). The same 3 R ESULTS
birds and the same procedure as for the lean Figure 12 shows the forgetting functions
conditions in Experiment 1 were used. An ad- for the FR 1 and FR 5 conditions, for data
ditional five reinforcer probability conditions pooled over the five reinforcer conditions in
were conducted, but with a ratio requirement each set for each bird. The result shows the
of FR 1 for sample-key responding. Com- typical overall reduction in discriminability
pared to the function for the FR 5 sample that results from decreasing the sample-key
requirement in Experiment 1, an overall low- ratio requirement (Roberts, 1972; White,
er level of discriminability was expected for 1985). Figure 13 shows that the matching law
the FR 1 conditions, together with an overall functions for the FR 1 conditions behaved in
higher level of reinforcer sensitivity. very much the same way as for the FR 5 lean
conditions in Experiment 1 (Figure 10). The
M ETHOD slopes of the functions tended to increase
The 3 birds of Experiment 1 continued un- with increasing delay-interval duration.
der the same procedural conditions as in the Figure 14 summarizes the change in rein-
lean reinforcer probability conditions of Ex- forcer sensitivity over delay-interval duration
periment 1, but with just a single response to for the FR 1 and FR 5 conditions. With the
the red or green sample stimulus being re- exception of the 0.1-s and 4-s delays for Bird
quired to initiate the delay interval. Table 1 S1 and the 12-s delay for Bird S2, reinforcer
sensitivity at the different delays is lower for

the FR 5 conditions than for the lower dis-
criminability FR 1 conditions. The tendency
for reinforcer sensitivity to increase with in-
creasing delay duration, which was reported
by Jones and White (1992) and was observed
in the two sets of conditions in Experiment
1, is just as apparent for the FR 1 conditions
of Experiment 2 for each of the 3 birds
EXPERIMENT 3:
INTERTRIAL INTERVAL
DURATION
An inconsistency between our result and
model predictions and some data reported by
McCarthy and Davison (1991) and McCarthy
and Voss (1995) has left us with an interesting
puzzle. They reported that reinforcer sensitiv-
ity decreased with increasing delay-interval du-
ration. McCarthy and Davison varied rein-
forcer probability over three values in a
delayed matching procedure in which the
samples were different brightnesses. For de-
lays of 0 s, 1 s, 3 s, and 25 s, respectively, re-
Fig. 12. Discriminability as a function of delay-interval inforcer sensitivities averaged .75, .52, .56,
duration for conditions with sample ratio requirements and .37 (with standard errors of about .09).
of FR 5 (from Experiment 1) and FR 1 (Experiment 2). This result is not consistent with those re-
ported by Jones and White (1992) and in the
present Experiments 1 and 2, and is incon-
sistent with the expectation that when control
Fig. 13. Log ratio of red to green choices as a function of log ratio of red to green reinforcers for the FR 1 ratio-
requirement condition of Experiment 1, as a function of delay interval. Values of the parameter estimates for the
and White (1992), except for (a) use of the

procedure to control the ratio of obtained re-
inforcers, (b) presentation of the hopper
light following every correct response, and
(c) the use of a short ITI, among other dif-
ferences. Their general result was consistent
with that reported by McCarthy and Davison
(1991), namely a decrease in reinforcer sen-
sitivity with increasing delay. In some instanc-
es for long delays when there was zero dis-
criminability, there was also zero reinforcer
sensitivity. That is, variation in the reinforcer
ratio for correct choices had no effect on the
choice, a puzzling result. In the experiments
by McCarthy and her colleagues, there were
generally low levels of discriminability, com-
pared to those reported by Jones and White
(1992). In the experiments by McCarthy and
her colleagues, short intertrial intervals were
used, which are known to lower discrimina-
bility (Edhouse & White, 1988; Roberts, 1972;
White, 1985). Therefore, in Experiment 3 we
compared the effect of a 1-s ITI to the effect
Fig. 14. Sensitivity to reinforcement (values of the of a 15-s ITI using the same procedure as for
slopes in Figures 10 and 13) as a function of delay-inter- Experiment 2.
val duration for FR 1 and FR 5 sample ratio-requirement
conditions in Experiments 1 and 2. M ETHOD
The subjects, apparatus, and procedure
were the same as those in Experiment 2 with
by the conditional discrimination is minimal, the FR 1 sample ratio requirement except
choice should be governed by the reinforcer that the intertrial interval was reduced to 1 s.
ratio (cf. Wixted, 1989). Four ratios of reinforcer probabilities were
In the study by McCarthy and Davison used. The reinforcer probabilities and the or-
(1991), each condition was conducted with der of conditions are given in Table 1. Twenty
just one delay interval in each session (where- sessions were conducted for each condition.
as delays were mixed within sessions in the Analyses were based on response and rein-
present study). In addition, they employed a forcer frequencies summed over the last eight
reinforcement procedure in which a reinforc- sessions for each condition.
er for a correct choice on one alternative
could not be obtained until the reinforcer R ESULTS
that had been set up for a correct choice on Figure 15 shows that the usual ITI effect
the other alternative had been obtained in a on the forgetting functions was obtained. The
previous trial. This procedure resulted in a functions with the 1-s ITI exhibit overall low
reduction in total obtained reinforcers with discriminability, compared to the functions
increasing delay, where total reinforcers ob- for the 15-s ITI from Conditions 11 through
tained at the 25-s delay were about half those 15 of Experiment 2. The matching law func-
at the three shorter delays. Futhermore, tions in Figure 16 are systematic, but com-
Jones and White (1992) have shown that the pared to those in Experiments 1 and 2, ex-
reinforcement scheduling procedure, per- hibit steep slopes at all delays.
haps along with the use of just one delay per Figure 17 compares the change in rein-
session, results in a large response bias for forcer sensitivity with increasing delay-inter-
choice between left and right keys. val duration for the 1-s ITI conditions to the
The procedure used by McCarthy and Voss function for the corresponding 15-s ITI con-
(1995) was more similar to that used by Jones ditions. Whereas the function for the 15-s ITI
increases, the function for the 1-s ITI decreas-

es. We therefore suppose that short ITIs may
be responsible for the high reinforcer sensi-
tivity at short delays seen here and in the data
of McCarthy and Voss (1995) and McCarthy
and Davison (1991). Note, however, that at
longer delays reinforcer sensitivity remains
high, whereas in the data reported by Mc-
Carthy and Voss and McCarthy and Davison
it drops to near-zero levels. That is, even with
a short ITI we were unable to produce the
result seen in the studies by McCarthy and
Voss and McCarthy and Davison.
The overall high levels of reinforcer sensi-
tivity in Experiment 3 are predicted by the
present model in that discriminability was
overall very low. But in order to predict the
high sensitivity at the shortest delays, an ad-
ditional assumption is needed. One possibil-
ity is that with short delays, short ITIs, and
very low discriminability levels, there may be
a stronger proactive effect of the reinforced
response from the preceding trial. This situ-
Fig. 15. Discriminability as a function of delay-interval ation was modeled by running simulations as
duration for ITIs of 15 s (Experiment 2 data) and 1 s above in order to generate a set of reinforcer
(Experiment 3 data).
sensitivities that corresponded to the similarly
small discriminability values obtained for the
1-s ITI condition. The additional assumption
was incorporated by amplifying the tendency
to make a choice on trials preceded by a re-
inforcer, in the direction of the previously re-
Fig. 16. Log ratio of red to green choices as a function of the log ratio of red to green reinforcers obtained for
correct choices as a function of delay, for the 1-s ITI condition of Experiment 3.
Fig. 17. Sensitivity to reinforcement as a function of

delay interval for conditions with 15-s ITIs (Experiment
2) and 1-s ITIs (Experiment 3). Triangles in the panel
for the mean data are the predictions from the model.
inforced response, by an arbitrarily chosen

factor of five. (By comparison, the level of Fig. 18. The relation between reinforcer sensitivity
left-right bias at long delays in the study by and discriminability in Experiments 1 and 2. For each
bird, individual data points are shown for the four delay
McCarthy & Davison, 1991, ranged from 1.26 intervals for each of four sets of conditions combining
to 8.91, and in the study by McCarthy & Voss, FR 1 or FR 5 with rich or lean reinforcer rates and 1-s or
1995, they ranged from 1.15 to 5.99.) The 15-s ITIs. The straight line is the prediction of the model
resulting predictions are shown as triangles taken from Figure 6 (top panel) and also drawn in Fig-
along with the mean data in Figure 17. The ure 7.
predicted values are satisfactorily close to the
obtained values for reinforcer sensitivity for
the 1-s ITI condition. Our model therefore Figure 18 shows values of reinforcer sensitiv-
copes well with the present data. ity for each delay and each of the three sets
of conditions, plotted against the correspond-
S UMMARY OF R ESULTS ing values of discriminability (12 points per
The main approach that we took to evalu- bird). The predicted straight-line function in-
ating the present model was to vary the level cluded in each panel is exactly the same func-
of discriminability in delayed matching to tion as in Figure 7, in which the data from
sample by varying delay-interval duration and the study by Jones and White (1992) were
other factors, and to ask whether there was plotted. That is, the same predicted function
an inverse relation between reinforcer sensi- is shown with the data for the different birds
tivity and discriminability as predicted by the and the three different procedures in the
model. In order to summarize the relation present study. Using the straight line in Fig-
between reinforcer sensitivity and discrimi- ure 18 to predict values for reinforcer sensi-
nability for Experiments 1 and 2, in which tivity, the slopes of the regression (through
there was clear variation in discriminability, the origin) relating obtained to predicted val-
ues were 1.03, 0.90, and 1.07 for Birds S1, S2, inforcement history. Thus, instead of two
and S3, respectively. The results of Experi- physical stimuli, the model takes account of
ment 3 were not included in Figure 18 be- a whole range of effective stimuli that happen
cause the lack of variation in discriminability to be normally distributed. But the model is
did not contribute to assessment of the in- otherwise just the familiar matching law. The
verse relation predicted by the model. Nev- model predicts that the effect of the reinforc-
ertheless, the overall high levels of reinforcer er ratio in biasing choices is amplified by de-
sensitivity in Experiment 3 (Figure 17) are creasing the level of discriminability that oc-
consistent with the present model in that they curs with longer retention inter vals. For
are expected at low discriminability levels. example, if the distributions overlap exten-
When the data from Experiment 3 are plot- sively, variations in the arranged reinforcer
ted in Figure 18, they cluster at the lowest ratio result in large changes in obtained re-
levels of discriminability, and around the line inforcer ratios and hence choice ratios across
predicted by the model. Because the same the range of stimulus effect values. But with
predicted function as in Figure 7, generated virtually nonoverlapping distributions, ob-
by varying standard deviations with D arbi- tained reinforcer ratios and hence choice ra-
trarily fixed at 1, is drawn for the different tios uniformly favor red for x values under
birds in Figure 18, the correspondence of the the red distribution and uniformly favor
data to the predicted function provides con- green under the green distribution, relatively
vincing evidence that our model successfully independently of variation in the arranged
predicts the interaction between reinforcer reinforcer ratios. In the experiments report-
sensitivity and discriminability. ed here, there was clear evidence for this pre-
diction, and also for the more general pre-
diction that factors that decrease overall
GENERAL DISCUSSION levels of discriminability increase sensitivity to
To summarize the main point of the model reinforcement.
we propose here, the individual’s trial-by-trial Our behavioral theory of remembering
tendency to choose one or the other com- bears obvious similarities to signal-detection
parison stimulus in a delayed matching pro- theory, but it differs in one critical respect.
cedure is determined by the relative proba- Detection theory assumes that the individual
bility of previously obtained reinforcers arrives at a decision by setting a criterion
associated with the value x of the stimulus ef- somewhere along the stimulus effect contin-
fect on the current trial. The value x is the uum. If x on a given trial exceeds that crite-
stimulus effect at the time of remembering. rion, the pigeon chooses red; otherwise,
The variation in stimulus effect is defined in green is selected. According to this account,
terms of a pair of hypothetical discriminal every trial involves a decision on the part of
processes or distributions. With increasing re- the bird (namely, does x exceed the criterion
tention-interval duration, the variances of the or not?). The decision criterion plays no role
distributions (discriminal dispersions) in- in the present nonmediational theory of re-
crease. Discriminability is predicted to de- membering. Instead, on each trial, the bird’s
crease as a result of the increased overlap be- behavior is governed by the history of rein-
tween the distributions. Discriminability may forcement for choosing red or green under
also decrease as a result of other factors that the prevailing conditions. If, given a value of
lead to a reduction in the separation of the x, a response to green has been reinforced
means of the distributions. more often than a response to red, the bird
The model can be characterized in terms will be more likely to choose green than red
of a conditional discrimination in which according to the matching law.
there is a continuum of values of the discrim- The version of the model described above
inative stimuli. That is, although only two has two parameters, one for the distance be-
stimuli are actually used (red and green), tween the discriminal processes and another
their effects vary from trial to trial, thereby for their variance. Further possibilities in-
creating a distribution of effective stimulus clude different variances for the two distri-
values for each stimulus. Each effective stim- butions, as may occur in delayed matching
ulus value has associated with it a unique re- for asymmetrical sample stimuli (Wixted &
Dougherty, 1996). Bias and undermatching of a composite of the factors that influence
in the effect of the reinforcer ratio on the the overall level of performance. That is, the
trial-by-trial choice probability may also be in- disparity parameter D may be related to wave-
cluded. These factors have not been incor- length disparity (for a hue discrimination) in
porated here because they involve adding addition to factors such as the FR sample re-
free parameters to an otherwise simple mod- quirement or ITI duration. A related issue is
el. Quantitative fits of the model to data whether the stimulus effect dimension should
would be enhanced, however, by inclusion of be construed in sensory or memorial terms,
the additional parameters. an issue related to our notion that stimulus
The present model bears some apparent effect is defined at the time of remembering.
similarity to detection models in that both as- Although a memorial interpretation is an ob-
sume distributions of stimulus effect along a vious possibility, the notion that the effect of
psychological continuum, following Thur- the temporally distant sample stimulus may
stone’s notion of discriminal processes (Luce, be direct is consistent with a sensory inter-
1994). The question of how to interpret this pretation (White, 1991). That is, if all of the
continuum is of importance to the issue of variation in stimulus effect can be attributed
how the present model differs from other to factors in the environment, a ‘‘direct re-
models of recognition memory. membering’’ approach can be sustained. It
Signal-detection theory was applied to rec- may be more plausible, however, to adopt the
ognition memory by Parks (1966) as suggest- view that organismic processes contribute to
ed by Murdock (1965) and later by Banks stimulus variation. In either case, the stimulus
(1970) and Lockhart and Murdock (1970). effect dimension seems to be related to hue
Snodgrass and Corwin (1988) give a clear (e.g., for red and green sample stimuli), in
summary of these models and discuss the which case the means of the stimulus effect
problem of measuring response bias in rec- distributions should not drift towards each
ognition memory. In the model described by other with increasing delay, as was assumed
Parks, old and new items vary in their degree by White and Cooney (1996). For this reason,
of ‘‘familiarity’’ or, in other models, ‘‘memory it was assumed in the present model that dis-
strength.’’ Familiarity or memory strength tribution variances increased with increasing
varies from item to item according to a nor- delay, with the location of the means remain-
mal distribution. The mean of the probability ing constant. A mediational view consistent
distribution for old items is greater in famil- with signal-detection interpretations of the
iarity or memory strength than that for new stimulus effect dimension is that a value x on
items, if old is discriminable from new. The a given trial provides ‘‘evidence’’ on the basis
dimension along which item strength varies of which the choice response is made. For
is a psychological continuum (Luce, 1994), in example, for red versus triangle samples (for
that it reflects variation in how stimuli may be stimuli on different dimensions), the value of
represented in the nervous system or stored x provides evidence for the redness versus tri-
in long-term memory. As applied to the angularity of the previously presented sam-
matching-to-sample paradigm or similar two- ple. As it happens, the present model is silent
alternative choice procedures, however, there on these alternatives, but does emphasize
is little sense in defining a zero point for that the stimulus effect value on which the
memory strength or familiarity, unless the choice is conditional in the model is defined
stimulus effect continuum is interpreted as at the time of remembering, consistent with
extending from ‘‘greenness’’ to ‘‘redness.’’ the treatment of remembering as delayed
In detection models, the dimension is por- stimulus control.
trayed as a decision axis (Green & Swets, As a final note, we offer a comment on the
1966). A decision based on a criterion value scope of the model. The aim was to present
on the dimension provides the main basis for a model that incorporated the effects of the
the recognition response. But in the present reinforcer ratio in delayed matching-to-sam-
model, there is some difficulty in identifying ple procedures, or other remembering pro-
the continuum as a decision axis, because no cedures in which the choice alternatives were
criteria are assumed. One possibility is to de- explicit. The general prediction that the re-
fine the stimulus effect dimension in terms inforcer effect becomes more influential as
the delay interval lengthens has been noted REFERENCES

before (Wixted, 1989) and was given a quan- Alsop, B., & Davison, M. (1991). Effects of varying stim-
titative basis in the present model. Although ulus disparity and the reinforcer ratio in concurrent-
the present paper has focused on remember- schedule and signal-detection procedures. Journal of
ing, the model may be extended in future ap- the Experimental Analysis of Behavior, 56, 67–80.
plications to account for stimulus–reinforcer Anderson, J. R. (1995). Learning and memory: An inte-
grated approach. New York: Wiley.
interactions in signal-detection procedures Anderson, J. R., & Schooler, L. J. (1991). Reflections of
and may be compared to other possible mod- the environment in memory. Psychological Science, 2,
els in relation to the prediction of such in- 396–408.
teractions (Nevin et al., 1993). It is of interest Banks, W. P. (1970). Signal detection theory and human
memory. Psychological Bulletin, 74, 81–99.
to note that the model described by Alsop Baum, W. M. (1974). On two types of deviation from the
and Davison (1991, Equation 3) predicts that matching law: Bias and undermatching. Journal of the
reinforcer sensitivity decreases with increas- Experimental Analysis of Behavior, 22, 231–242.
ing stimulus discriminability when discrimi- Blough, D. S. (1998). Context reinforcement degrades
nation between the choice alternatives is discriminative control: A memory approach. Journal of
Experimental Psychology: Animal Behavior Processes, 24,
imperfect. Future research involving manipu- 185–199.
lation of comparison–stimulus disparity in de- Creelman, C. D., & Donaldson, W. (1968). ROC curves
layed matching and signal-detection proce- for discrimination of linear extent. Journal of Experi-
dures (cf. Nevin et al., 1993; White, 1986) mental Psychology, 77, 514–516.
Davison, M., & McCarthy, D. (1988). The matching law: A
might assist in determining whether the pres- research review. Hillsdale, NJ: Erlbaum.
ent model and that described by Alsop and Davison, M. C., & Tustin, D. (1978). The relation be-
Davison are equally successful in accounting tween the generalized matching law and signal-detec-
for the inverse relation between reinforcer tion theory. Journal of the Experimental Analysis of Be-
sensitivity and discriminability described in havior, 29, 331–336.
Dusoir, A. E. (1975). Treatments of bias in detection and
the present paper. recognition models: A review. Perception & Psychophys-
Given programmed reinforcer probabili- ics, 17, 167–178.
ties, the present model predicts obtained re- Ebbinghaus, H. (1964). Memory: A contribution to experi-
inforcer frequencies as well as response fre- mental psychology (H. A. Ruger, C. E Bussenius, & E. R.
Hilgard, Trans.). New York: Dover. (Original work
quencies, and on the basis of an assumption published 1885)
of strict matching of the ratio of choice prob- Edhouse, W. V., & White, K. G. (1988). Sources of pro-
abilities on a given trial to reinforcer proba- active interference in animal memory. Journal of Ex-
bilities (given a value of x), the model pre- perimental Psychology: Animal Behavior Processes, 14, 56–
dicts the power function relation between 71.
Egan, J. P. (1975). Signal detection theory and ROC analysis.
response and reinforcer ratios. Also to be ex- New York: Academic Press.
plored in future analyses, the model predicts Fechner, G. T. (1860). Elemente der psychophysik. Leipzig:
change in performance from the early trials Breitkopf & Hartel.
in which the reinforcer effects are variable Fetterman, J. G. (1996). Dimensions of stimulus com-
plexity. Journal of Experimental Psychology: Animal Behav-
owing to few obtained reinforcers, to the ior Processes, 22, 3–18.
‘‘steady-state’’ performance shown here. The Green, D. M., & Swets, J. A. (1966). Signal detection theory
main prediction offered by the model, as em- and psychophysics. New York: Wiley.
phasized in the present paper, is the inverse Healy, A. F., & Kubovy, M. (1981). Probability matching
relation between discriminability and rein- and the formation of conservative decision rules in a
numerical analog of signal detection. Journal of Exper-
forcer sensitivity. Although it is possible to in- imental Psychology: Human Learning and Memory, 7,
corporate in the model factors such as con- 344–354.
stant response bias and proactive effects of Herrnstein, R. J. (1970). On the law of effect. Journal of
reinforcers on prior trials, the simplest ver- the Experimental Analysis of Behavior, 13, 243–266.
Jones, B. M., & White, K. G. (1992). Stimulus discrimi-
sion of the model has two free parameters, nability and sensitivity to reinforcement in delayed
one for the distance between the distribution matching to sample. Journal of the Experimental Analysis
means and one for the standard deviation of of Behavior, 58, 159–172.
the distributions. Although there is no free Jones, B. M., White, K. G., & Alsop, B. (1995). On two
parameter that describes the reinforcer ef- effects of signaling the consequences for remember-
ing. Animal Learning & Behavior, 23, 256–272.
fect, the model predicts systematic variation Link, S. W. (1994). Rediscovering the past: Gustav Fech-
in reinforcer effect on the basis of factors af- ner and signal detection theory. Psychological Science,
fecting discriminability. 5, 335–340.
Lockhart, R. S., & Murdock, B. B. (1970). Memory and Roberts, W. A. (1972). Short-term memory in the pi-
the theory of signal detection. Psychological Review, 74, geon: Effects of repetition and spacing. Journal of Ex-
100–109. perimental Psychology, 94, 74–83.
Luce, R. D. (1963). Detection and recognition. In R. D. Snodgrass, J. G., & Corwin, J. (1988). Pragmatics of mea-
Luce, R. R. Bush, & E. Galanter (Eds.), Handbook of suring recognition memory: Applications to dementia
mathematical psychology (Vol. 1, pp. 103–189). New and amnesia. Journal of Experimental Psychology: General,
York: Wiley. 117, 34–50.
Luce, R. D. (1994). Thurstone and sensory scaling: Stevens, S. S. (1961). To honor Fechner and repeal his
Then and now. Psychological Review, 101, 271–277. law. Science, 133, 80–86.
Macmillan, N. A., & Creelman, C. D. (1991). Detection Thomas, E. A. C. (1975). Criterion adjustment and prob-
theory: A user’s guide. New York: Cambridge University ability matching. Perception & Psychophysics, 18, 158–
Press. 162.
McCarthy, D., & Davison, M. (1991). The interaction be- Thomas, E. A. C., & Legge, D. (1970). Probability match-
tween stimulus and reinforcer control on remembering as a basis for detection and recognition decisions.
ing. Journal of the Experimental Analysis of Behavior, 56, Psychological Review, 77, 65–72.
51–66. Thurstone, L. L. (1927). A law of comparative judgment.
McCarthy, D., & Voss, P. (1995). Delayed matching-to- Psychological Review, 34, 273–286.
sample performance: Effects of relative reinforcer fre- White, K. G. (1985). Characteristics of forgetting func-
quency and of signaled versus unsignaled reinforcer tions in delayed matching to sample. Journal of the Ex-
frequencies. Journal of the Experimental Analysis of Be- perimental Analysis of Behavior, 44, 15–34.
havior, 63, 33–52. White, K. G. (1986). Conjoint control of performance
McCarthy, D., & White, K. G. (1987). Behavioral models in conditional discrimination by successive and simul-
of delayed detection and their application to memory. taneous stimuli. Journal of the Experimental Analysis of
In M. L. Commons, J. Mazur, J. A. Nevin, & H. C. Behavior, 45, 161–174.
Rachlin (Eds.), Quantitative analyses of behavior: Vol. 5. White, K. G. (1991). Psychophysics of direct remember-
The effect of delay and intervening events on reinforcement ing. In J. A. Commons, M. C. Davison, & J. A. Nevin
value (pp. 29–54). New York: Erlbaum. (Eds.), Models of behavior: Signal detection (pp. 221–
McLean, A. P., & White, K. G. (1983). Temporal con- 237). New York: Erlbaum.
straint on choice: Sensitivity and bias in multiple White, K. G., & Cooney, E. B. (1996). The consequences
schedules. Journal of the Experimental Analysis of Behav- of remembering: Independence of performance at
ior, 39, 405–426. different retention intervals. Journal of Experimental
Murdock, B. B. (1965). Signal-detection theory and Psychology: Animal Behavior Processes, 22, 51–59.
short-term memory. Journal of Experimental Psychology, White, K. G., Pipe, M.-E., & McLean, A. P. (1985). A note
70, 443–447. on the measurement of stimulus discriminability in
Nevin, J. A. (1969). Signal detection theory and operant conditional discriminations. Bulletin of the Psychonomic
behavior: A review of David M. Green and John A. Society, 23, 153–155.
Swets’ Signal Detection Theory and Psychophysics. Journal Williams, B. A. (1988). Reinforcement, choice, and re-
of the Experimental Analysis of Behavior, 12, 475–480. sponse strength. In R. C. Atkinson, R. J. Herrnstein,
Nevin, J. A. (1981). Psychophysics and reinforcement G. Lindzey, & R. D. Luce (Eds.), Stevens’ handbook of
schedules. In M. L. Commons, J. E. Mazur, J. A. Nevin, experimental psychology: Vol. 2. Learning and cognition
& H. Rachlin (Eds.), Quantitative analyses of behavior: (pp. 167–244). New York: Wiley.
Vol. 1. Discriminative properties of reinforcement schedules Wixted, J. T. (1989). Nonhuman short-term memory: A
(pp. 3–27). Hillsdale, NJ: Erlbaum. quantitative reanalysis of selected findings. Journal of
Nevin, J. A., Cate, H., & Alsop, B. (1993). Effects of dif- the Experimental Analysis of Behavior, 52, 409–426.
ferences between stimuli, responses, and reinforcer Wixted, J. T. (1993). A signal detection analysis of mem-
rates on conditional discrimination performance. ory for nonoccurrence in pigeons. Journal of Experi-
Journal of the Experimental Analysis of Behavior, 59, 147– mental Psychology: Animal Behavior Processes, 19, 400–
161. 411.
Nevin, J. A., & Grosch, J. (1990). Effects of signaled re- Wixted, J. T., & Dougherty, D. H. (1996). Memory for
inforcer magnitude on delayed matching-to-sample asymmetric events. In D. Medin (Ed.), The psychology
performance. Journal of Experimental Psychology: Animal of learning and motivation (Vol. 35, pp. 89–126). San
Behavior Processes, 16, 298–305. Diego: Academic Press.
Parks, T. E. (1966). Signal-detectability theory of recog- Wixted, J. T., & Ebbesen, E. B. (1991). On the form of
nition-memory performance. Psychological Review, 73, forgetting. Psychological Science, 6, 409–415.
44–58. Yonelinas, A. P. (1994). Receiver-operating characteris-
Ratcliff, R., McKoon, G., & Tindall, M. (1994). Empirical tics in recognition memory: Evidence for a dual-pro-
generality of data from recognition memory receiver- cess model. Journal of Experimental Psychology: Learning,
operating characteristic functions and implications Memory, and Cognition, 20, 1341–1354.
for the global memory models. Journal of Experimental
Psychology: Learning, Memory, and Cognition, 20, 763– Received April 29, 1998
785. Final acceptance August 13, 1998

Psychophysics of Remembering

Transféré par

Informations du document

Description originale:

Titre original

Copyright

Formats disponibles

Partager ce document

Partager ou intégrer le document

Options de partage

Avez-vous trouvé ce document utile ?

Ce contenu est-il inapproprié ?

Droits d'auteur :

Formats disponibles

Psychophysics of Remembering

Transféré par

Droits d'auteur :

Formats disponibles

JOURNAL OF THE EXPERIMENTAL ANALYSIS OF BEHAVIOR 1999, 71, 91–113 NUMBER 1 (JANUARY)

The choice following one sample is influ-

the reinforcer probability for correctly re-

Figure 3, a reinforcer is now about twice as

Fig. 5. Predictions of the model for low (D 5 1) and

Condition Order Red Green

Rich FR 5 15-s ITI

were overall lean. The probabilities are

ure 11 for rich and lean reinforcement con-

sensitivity at the different delays is lower for

and White (1992), except for (a) use of the

increases, the function for the 1-s ITI decreas-

Fig. 17. Sensitivity to reinforcement as a function of

inforced response, by an arbitrarily chosen

the delay interval lengthens has been noted REFERENCES

Vous aimerez peut-être aussi