Académique Documents
Professionnel Documents
Culture Documents
PSYCHOPHYSICS OF REMEMBERING
K. G EOFFREY W HITE AND J OHN T. W IXTED
UNIVERSIT Y OF OTAGO, NEW ZEALAND AND
UNIVERSIT Y OF CALIFORNIA, SAN DIEGO
We present a new model of remembering in the context of conditional discrimination. For proce-
dures such as delayed matching to sample, the effect of the sample stimuli at the time of remem-
bering is represented by a pair of Thurstonian (normal) distributions of effective stimulus values.
The critical assumption of the model is that, based on prior experience, each effective stimulus value
is associated with a ratio of reinforcers obtained for previous correct choices of the comparison
stimuli. That ratio determines the choice that is made on the basis of the matching law. The standard
deviations of the distributions are assumed to increase with increasing retention-interval duration,
and the distance between their means is assumed to be a function of other factors that influence
overall difficulty of the discrimination. It is a behavioral model in that choice is determined by its
reinforcement history. The model predicts that the biasing effects of the reinforcer differential
increase with decreasing discriminability and with increasing retention-interval duration. Data from
several conditions using a delayed matching-to-sample procedure with pigeons support the predic-
tions.
Key words: remembering, conditional discrimination, forgetting functions, discriminability, rein-
forcer probability, delayed matching to sample, pigeon
The foundation for a psychophysical anal- offers well-established methods for quantify-
ysis of remembering was laid over a century ing the changes in behavior that result from
ago, with Fechner’s quantitative analysis of measurable changes in the physical environ-
sensation and Ebbinghaus’ experimental ment.
analysis of memory. Fechner (1860) had pro- Ebbinghaus (1885/1964) showed that
posed that sensory experience is a logarith- changes in remembering were amenable to
mic function of stimulus intensity. Although quantification. His measure of retention, the
the function relating experience and environ- percentage savings in time to relearn a list of
ment has since been interpreted as following nonsense words, decreased logarithmically as
a power law (Stevens, 1961) Fechner’s fun- retention interval increased over a period of
damental contribution was to render sensory several weeks. It has since been shown that
experience amenable to scientific analysis. Ebbinghaus’ data are better described by a
Fechner had also described a theory for dis- power function (Anderson & Schooler, 1991;
crimination that predated the more recent Wixted & Ebbesen, 1991). Ebbinghaus’ early
development of signal-detection theor y attempt to quantify memory nevertheless es-
(Link, 1994). The field of psychophysics now tablished the possibility that remembering
may be amenable to analysis in the same
Earlier versions of the model described in this paper terms as sensing and perceiving.
were presented to the meetings of the Society for Quan- The aim of the present paper is to offer an
titative Analyses of Behavior, Washington, D.C., June, analysis of remembering that follows the gen-
1995, and the Behavior Symposium, Christchurch, New eral approach of signal-detection theory and
Zealand, August, 1996. Preparation of the manuscript
was facilitated by the generous hospitality provided by
to apply it to remembering in nonhuman an-
Peter Killeen during the first author’s leave at Arizona imals. The analysis assumes that the effect of
State University. Killeen’s constructive comments consid- the stimulus can be represented in terms of
erably improved the manuscript. We are grateful to An- discriminal processes of the kind suggested
gela Ruske for her outstanding assistance during the con-
duct of the experiments reported here, to members of
by Thurstone (1927). Unlike decision-theo-
our laboratory groups, especially Emily Cooney and Deir- retic approaches, however, a decision criteri-
dra Dougherty, for their helpful discussions, and to Barry on is not assumed; instead, the animal’s
Dingwall, for his essential technical expertise. choice is assumed to be influenced directly
Requests for reprints should be addressed to the first
author at the Department of Psychology, University of
by the payoff ratio. This approach makes the
Otago, Dunedin, New Zealand (E-mail: kgwhite@otago. interesting prediction that the effect of pay-
ac.nz or jwixted@ucsd.edu). offs in biasing an animal’s choice depends on
91
92 K. GEOFFREY WHITE and JOHN T. WIXTED
the discriminability of the stimuli to be dis- adjusted so that more events are reported as
criminated. familiar. In principle, and in the absence of
criterion variance, the analysis prescribed by
Detection Models of Recognition the signal-detection approach leaves the mea-
The treatment of remembering in the sure of discriminability untainted by errant
same terms as sensory discrimination was not changes in response tendencies, and reveals
developed until 80 years after Ebbinghaus, pure memorability.
when Murdock (1965) suggested that remem-
bering was a matter of discriminating familiar Criterion Location
from novel events (see also Banks, 1970; Egan (1975) and Macmillan and Creelman
Lockhart & Murdock, 1970; Parks, 1966). An (1991) have summarized possibilities for
important assumption was that familar and choice of a decision rule for models that rely
novel items were assumed to vary in ‘‘mem- on the assumption of a decision criterion. (a)
ory strength’’ or ‘‘familiarity,’’ with the mean A probability matching rule was proposed by
of the distribution of novel events at zero and Parks (1966) and elaborated by Thomas and
the mean of the distribution of familiar Legge (1970) and Thomas (1975). The prob-
events at a value of memory strength greater ability matching rule is assumed to apply in
than zero. The familiarity of an event increas- cases in which observers have incomplete in-
es with training or practice. The increase in formation about underlying distributions in
familiarity (or item strength) tends to follow contrast to the ‘‘ideal observer’’ who has com-
a power function of the number of training plete information and is thus able to utilize a
trials (Anderson, 1995). According to the the- likelihood ratio criterion. For symmetrical
ory of signal detection (Green & Swets, payoff matrices, the probability matching rule
1966), the task of deciding whether a partic- assumes that the observer reports occurrence
ular event is familiar is accomplished by using of the target items with a probability that
a decision rule: Events of memory strength matches their a priori occurrence. Thus, for
greater than a criterion value are categorized example, if Stimulus A occurs on 80% of the
as familiar and hence are responded to as re- trials and Stimulus B occurs on 20% of the
membered, whereas those of weaker strength trials, the criterion will be placed on the de-
than the criterion are categorized as novel. cision axis in such a way that the probability
of reporting Stimulus A is 80%. Creelman
Discriminability and Bias and Donaldson (1968) showed that for judg-
The signal-detection approach to recogni- ments of line length, changes in the prior
tion generated an extensive empirical litera- probability of the stimuli did not affect dis-
ture that has benefited from perhaps the criminability (i.e., the distance between the
most influential theoretical assumption of distributions) but did affect the placement of
psychophysics: the independence of discrim- the decision criterion such that the highly
inability and response bias. The approach trained subjects matched response propor-
yields a measure of discriminability or mem- tions to relative stimulus probability. Because
orability, measured by the distance d9 be- correct responses produced monetary re-
tween the means of the distributions of novel wards, it is also possible that response pro-
and familiar events. Memorability is indepen- portions were sensitive to the relative mone-
dent of the location of the criterion value for tar y payoff. In other studies, response
deciding between familiar and novel. The lo- proportions undermatched prior stimulus
cation of the criterion value can be used as a probability (Dusoir, 1975). (b) An alternative
measure of response bias, that is, the tenden- decision rule is one that maximizes expected
cy to report familiar versus novel indepen- value. If expected value is to be maximized,
dently of the discriminability of familiar from the rule takes account of both the prior prob-
novel. A major determinant of the location of abilities of occurrence of the stimuli and the
the criterion is relative payoff, although spe- payoff matrix. It assumes that the decision
cific rules relating criterion value to relative rule is based on the likelihood ratio, that is,
payoff have yet to be developed (Macmillan the ratio of probabilities of occurrence of one
& Creelman, 1991). If the payoff favors re- sample versus the other (Egan, 1975). Healy
porting an event as familar, the criterion is and Kubovy (1981) orthogonally varied pay-
PSYCHOPHYSICS OF REMEMBERING 93
off and prior probability in a numerical cat- volves continual changes in discriminability
egorization task. They reported that prior or stimulus control. At very short retention
probabilities had larger effects on the likeli- intervals, remembering is easy and the bias-
hood ratio criteria than did payoff, but that ing effects of differential payoffs are attenu-
a probability matching rule that included a ated, whereas at very long retention intervals,
constant based on the payoff matrix could ac- remembering is difficult and the effects of
count for their data. differential payoffs are magnified (Wixted,
1989). The model we describe here addresses
Conditional Discrimination this issue and at the same time deals with an-
The separation of discriminability from other more fundamental difficulty associated
bias in psychophysics has a parallel in the with the earlier signal-detection approach to
study of conditional discrimination learning, remembering, namely the problem of speci-
where the effects of the discriminative stimuli fying how the subject’s decision rule is deter-
may be separated from the effects of the dif- mined by the location of the criterion on the
ferential reinforcer probabilities that main- memory strength continuum.
tain the discrimination (Nevin, 1981; White, The model we describe does not rely on
1986; Williams, 1988). In the context of the assumption of a criterion. It combines the
choice procedures, differential reinforce- time-honored assumption that stimulus ef-
ment biases the choice towards one alterna- fects are represented by random values drawn
tive versus another. Variation in the reinforc- from normal density functions, as in Thur-
er probabilities for two choices results in a stone’s (1927) discriminal processes, with the
power function relation between the ratios of more recent generalization that the prefer-
the choices and the reinforcer ratios (Baum, ence or choice between alternatives is a func-
1974). Variation in the disparity of the dis- tion of the payoffs they produce (Baum,
criminative stimuli of a conditional discrimi- 1974; Herrnstein, 1970).
nation allows an assessment of the extent to
which stimulus disparity limits the discrimi- Interaction of Reinforcer and
nation (Nevin, 1969). In conditional discrim- Stimulus Control
inations, however, the effects of the reinforc- Quantification of the effects of payoffs on
er differential are modulated by stimulus remembering is easily achieved in terms of
disparity. When choice relies on both stimu- the matching law. Jones and White (1992) ex-
lus and reinforcer differences, large stimulus amined performance in a standard delayed
differences attenuate the effect of the rein- matching-to-sample procedure in which five
forcer differential and small stimulus differ- responses to a red or green sample stimulus
ences amplify the reinforcer effect (White, initiated a delay interval of variable duration,
1986). Signal-detection procedures, which followed by a choice between red and green
are also classed as conditional discriminations comparison stimuli. The probability of rein-
(McCarthy & White, 1987), are associated forcers for correct (matching) choices follow-
with a similar problem. The extent of discrim- ing red and green sample stimuli was varied
inability may modulate the effects of the re- over several conditions. When the probability
inforcers that otherwise bias choice towards of a reinforcer for a correct choice of red was
one or the other alternative. In situations in higher than for a correct choice of green, the
which discriminability is expected to vary, the tendency to choose red was greater than the
effect of the reinforcer differential may inter- tendency to choose green. More generally,
act with the effects of the discriminative stim- the ratio of red to green choices following
uli. red and green samples was a power function
of the ratio of reinforcers obtained by correct
Discriminability Decrement in Remembering red versus green choices.
The interaction of reinforcer effects with The result reported by Jones and White
stimulus effects is especially evident in re- (1992) is summarized in Figure 1, for 1 of
membering by virtue of its primary charac- their birds, X1. Panel A shows the standard
teristic, the systematic decrement in discrim- decrement in matching accuracy with increas-
inability with increasing retention-interval ing delay-interval duration. Matching accu-
duration. Remembering (or forgetting) in- racy was assessed in terms of a measure of
94 K. GEOFFREY WHITE and JOHN T. WIXTED
Fig. 1. Data for Bird X1 replotted from results reported in Appendix A of Jones and White (1992). In (A), the
forgetting function is defined by the decrease in discriminability as a function of delay-interval duration. In (B), the
slope of the function relating the log ratio of red to green choices to the log ratio of red to green reinforcers steepens
with increasing delay, as summarized in (C).
discriminability (see below). Panel B shows McLean, 1985). Here, reinforcer control of
that the tendency to choose red versus green the choice was weak for an easy line-tilt dis-
is a function of the ratio of reinforcers ob- crimination and strong for a difficult discrim-
tained by red versus green choices, as ex- ination. Nevin, Cate, and Alsop (1993) have
pected from the power function version of also reported a stronger effect of varying re-
the matching law (Baum, 1974). The slopes inforcer probability for a smaller disparity be-
of the functions in Panel B are a measure of tween discriminative stimuli in a discrete-tri-
the exponent of the power function relating als conditional discrimination. That is, there
the ratio of choices to the ratio of reinforcers. is a parallel between reinforcer sensitivity ef-
The exponent is usually interpreted as an in- fects for perceptual and memorial proce-
dex of sensitivity to reinforcement (Davison dures. If a discrimination is easy because of
& McCarthy, 1988). Panel C summarizes large stimulus disparity or a short retention
these slopes and, more importantly, indicates interval, there is little effect of varying the
that sensitivity to reinforcement is a function reinforcer ratio. If a discrimination is diffi-
of delay-interval duration. The slopes of the cult, as with small stimulus disparity or long
functions in Panel B gradually increase with retention intervals, behavior is very sensitive
increasing retention-interval duration. to variation in the reinforcer ratio. This result
The result that the biasing effect of the re- is consistent with the model of delayed
inforcer ratio is weak at short delays when dis- matching-to-sample performance described
criminability is high and is strong at long de- by Wixted (1989). According to this model,
lays when discriminability is weak has a the influence of sample stimuli and the ratio
parallel in an earlier result for conditional of reinforcers obtained by correct choices
discriminations (White, 1986; White, Pipe, & have separate effects that depend on the de-
PSYCHOPHYSICS OF REMEMBERING 95
lay interval. At short delays the conditional Consistent with the generalized matching law,
discrimination is influential and the reinforc- the log ratio of red to green choices is a lin-
er effect is weak. At long delays when the con- ear function of the log ratio of reinforcers
ditional discrimination is weaker, the rein- obtained by correct red versus green choices
forcer effect is strong. (Rr, Rg). The slope of the function, a, mea-
sures sensitivity of choice to variation in the
Response Measures reinforcer ratio, and c is a constant describing
The measures of discriminability and bias overall (unexplained) preference for one or
that we adopt are, respectively, the geometric the other choice alternative.
mean of the ratio of correct to error respons- The result that the biasing effects of the
es following each sample and the geometric reinforcer ratio depend on the delay suggests
mean of the ratio of red to green choices fol- that discriminability and bias may not be in-
lowing each sample. These measures can be dependent, as otherwise assumed by the stan-
derived from different theoretical assump- dard signal-detection approach or early ver-
tions, but can otherwise be treated as theory- sions of the behavioral detection approach
free measures of performance (as is our pref- (e.g., Davison & Tustin, 1978; but see Alsop
erence here). The measures were originally & Davison, 1991). Both approaches assume
proposed by Luce (1963) in the context of separate and independent influences of the
choice theory, and as anticipated by Nevin discriminative stimuli and factors that gener-
(1969), Davison and Tustin (1978), and Nev- ate bias. In the signal-detection model, the
in (1981) in the context of behavioral detec- location of a decision criterion along the ev-
tion theory. From a theory-free perspective, idence variable (the determinant of bias) is
the measures reflect the likelihood that the independent of the distance between signal
subject makes correct responses versus errors, and noise distributions (the determinant of
or reports one alternative versus the other. By discriminability). In the behavioral detection
taking ratios, the measurement scale is not model, the biasing effects of stimulus dispar-
constrained in the way that, for example, a ity on choice are independent of the biasing
proportion scale is bounded by 1.0. The dis- effects of the reinforcers obtained by the
criminability measure (log d, or log a) is lin- choices. In a more recent version of the be-
early related to the discriminability measure havioral detection model (Alsop & Davison,
d9 derived from signal detection theory, and 1991; see also Nevin et al., 1993, especially
satisfies the requirement that both hits and Equation 8), although the two free parame-
false alarms contribute to the measure of dis- ters describing stimulus and reinforcer effects
criminability (Macmillan & Creelman, 1991, are said to be independent, sensitivity to re-
pp. 11–13). The discriminability measure is inforcement, as measured by Equation 2, re-
calculated by taking the logarithm (base 10) flects the joint effects of stimulus difference
of the ratio of correct (c) to error (e) respons- and differential reinforcement under some
es following each sample (subscripts r and g conditions. An interesting issue for the be-
for red and green samples) and averaging havioral detection model is that it assumes
them according to that the choice between two alternatives fol-
discriminability, lowing one sample is determined by the ratio
of reinforcers obtained by correct choices fol-
log d 5 .5·log[(c r /e r )·(c g /e g )]. (1) lowing both samples. That is, it treats the mul-
The tendency or bias to choose red versus tiple concurrent schedule as if it were a con-
green is calculated by averaging the loga- current schedule. McLean and White (1983)
rithms of the ratios of choices of red to choic- have claimed that in multiple concurrent
es of green following each sample. In Panel B schedules (including detection procedures),
of Figure 1, it is plotted as a function of the the reinforcers obtained by correct choices
red to green reinforcer ratio, according to following one sample do not influence choice
following the other sample (with the excep-
log red/green choices tion of reallocation of extraneous reinforcers
5 0.5·log[(c r /c g )·(e r /e g )] from one time to another). An alternative as-
sumption was made by Nevin (1981) and is
5 a log(R r /R g ) 1 c. (2) consistent with Luce’s (1963) choice theory:
96 K. GEOFFREY WHITE and JOHN T. WIXTED
(within the bounds of criterion variance). For sentation probability of .5, and a value x is
values of x greater than the decision criteri- randomly sampled from the stimulus effect
on, one response alternative is chosen, and dimension according to the probability den-
for values of x smaller than the criterion, the sity function defined for each sample stimu-
other alternative is chosen. In the present lus. The reinforcer distributions are defined
model, choice is not related to the likelihood by multiplying the discriminal processes by
ratio because it depends on the current ratio the reinforcer probabilities associated with
of reinforcer probabilities that change with x. the distributions. For a given value of x, the
The probability of choosing one versus the proportion of the heights of the resulting dis-
other alternative varies over the x axis and is tributions (at x) directly predicts the proba-
not all or none, as in the detection models. bility of choosing red versus green on that
Whereas the adoption of a constant decision trial. The heights (or probabilities) are given
criterion in the signal-detection models re- by p(Rx) 5 pr * N(D/2, Sr) and p(Gx) 5 pg *
sults in the derivation of independent mea- N(D/2, Sg). These distributions represent the
sures of discriminability and bias, the assump- probabilities that a reinforcer is arranged on
tion that choice is determined by the the red and green choice alternatives for a
reinforcer ratio for a given value of x in the given value of x. For example, assume that pr
present model allows the prediction that bias (the probability of reinforcement for a cor-
and discriminability interact. rect red response) and pg (the probability of
Separate measures of discriminability and reinforcement for a correct green response)
bias can be derived from the model, which are both 1.0 (cf. Figure 3, Panel A). Further
simulates events in a delayed matching-to- assume that x on a given red trial happened
sample procedure on a trial-by-trial basis. The to equal 0 (i.e., x on this trial happened to
model generates frequencies of correct and fall at the intersection of the two distribu-
error responses separately for trials with red tions). In the past for that value of x, a rein-
and green prior events as a function of the forcer was as likely to be set up on red as on
relative reinforcer frequencies that vary over green, so an unbiased bird would be equally
trials as a result of varying the values of stim- likely to choose either alternative. On the
ulus effect. next red trial, imagine that x equaled D/2
(i.e., x fell at the mean x value for red trials).
Predictions from the Model Note that, at that point, the height of the red
Predictions from the model were generat- distribution exceeds that of the green distri-
ed by running many simulations, each for bution by about 6 to 1. That is, in the past,
5,000 trials of a delayed matching-to-sample this value of x has occurred on red trials
procedure. The discriminal processes are de- about six times as often as green trials. Be-
fined by normal (or, in practice, logistic ap- cause pr and pg are both 1.0, this means that,
proximations to normal) distributions with for this particular value of x, a reinforcer is
variances Sr2 and Sg2 and means of 20.5D six times as likely to be arranged on the red
(green) and 10.5D (red). The parameter D choice alternative. Thus, according to the
is disparity on the stimulus effect dimension. matching law, an unbiased bird would be
To model an experimental procedure, initial- about six times as likely to choose red over
ly a red or green sample stimulus is chosen green given this value of x.
with a particular probability (usually .5). For The process works the same way when pr
the chosen sample, say red, a value x is ran- and pg are unequal. For example, assume that
domly selected from the associated distribu- pr 5 .3 and pg 5 .7. Multiplying both distri-
tion. In the simulation, x is specified in z- butions by these values yields the distribu-
score units, so that x 5 Sr * zi 1 D/2, or if the tions shown in Figure 3 (Panel B). Assume
green distribution is selected, x 5 Sg * zi 2 once again that on a given red trial the value
D/2, where zi is a randomly selected z score, of x happened to equal 0. Although that value
that is, a value randomly selected from a nor- is as likely to occur on a red trial as on a
mal distribution with a mean of 0 and a stan- green trial (as before), it is no longer the case
dard deviation of 1. that a reinforcer is as likely to be arranged
To summarize thus far, on each trial a red on the red choice alternative as on the green
or green sample is chosen with a signal pre- choice alternative. Instead, as illustrated in
PSYCHOPHYSICS OF REMEMBERING 99
culate log ratios of red to green reinforcers. Fig. 6. The relation between sensitivity to reinforce-
Corresponding frequencies of red and green ment predicted by the model and discriminability pre-
choices on red-sample and green-sample tri- dicted by the model, for instances when changes in both
als were used to calculate the mean of the log were produced by varying the standard deviation with D
5 1 (top panel) and varying the disparity parameter D,
ratio of red to green choices following the with SD 5 1 (bottom panel).
red sample and the log ratio of red to green
choices following the green sample. This
measure is the same as the bias measure pro- ratio, as given by the slope of the straight line,
posed by Luce (1963) and Davison and Tus- is smaller for the easy discrimination.
tin (1978), and is described by Equation 2
above. The log ratio of choices from the sim- Predicting the Interaction Between
ulations was satisfactorily described by a lin- Discriminability and Bias
ear function of the log ratio of reinforcers, Figure 6 shows predictions from the model
consistent with the generalized matching law for sensitivity to reinforcement and discrimi-
(Baum, 1974). That is, the present model nability, one plotted as a function of the oth-
predicts a power function relation between er. These results were obtained in the same
choice and reinforcer ratios. way as for Figure 5 but with two pairs of re-
Three aspects of the predictions in Figure inforcer probabilities (pr and pg) for each in-
5 are noteworthy. First, the present model stance of standard deviation and D (pr/pg 5
predicts undermatching in the conditional .2/.8 or .8/.2). The top panel shows the in-
discrimination, on the basis of the assump- verse relation between reinforcer sensitivity
tion that the simulated bird matches choice and discriminability when standard deviations
proportions to reinforcer proportions at each were varied over 14 values with D 5 1. The
value of x on the stimulus effect continuum. bottom panel shows the inverse relation ob-
Second, with decreasing distance between the tained by varying D over 17 values with the
discriminal processes (decreasing D), the re- standard deviations of the discriminal pro-
inforcer ratios are predicted to cover a wider cesses set at 1. Each 5,000-trial simulation
range (despite entering the same values of generated a matrix of obtained response and
programmed reinforcer probabilities for reinforcer frequencies, which were used to
each D). Third, the function for the easy dis- calculate log ratios of red to green reinforcers
crimination is flatter than the function for and log ratios of red to green choice respons-
the hard discrimination. That is, sensitivity of es. Reinforcer sensitivity was estimated from
the choice ratio to changes in the reinforcer the slope of the function relating ratios of
PSYCHOPHYSICS OF REMEMBERING 101
Fig. 7. The relation between sensitivity to reinforcement and discriminability at different delay intervals for in-
dividual birds (shown by different symbols) in the study by Jones and White (1992) and the regression line for the
predictions from the present model (from Figure 6, top panel).
choices to ratios of reinforcers (cf. Figure 5). discriminal processes is influenced by several
Discriminability was calculated from the log sources of discriminability. These include the
ratios of correct to error responses (Davison wavelength disparity (for example) of the
& Tustin, 1978; Luce, 1963; McMillan & sample stimuli. Retention-interval duration
Creelman, 1991). Figure 6 shows the inverse also influences predicted discriminability, but
and virtually linear relation between reinforc- by increasing the variances of the distribu-
er sensitivity and discriminability predicted by tions. Other factors that influence discrimin-
the model. The results in the top panel ob- abilty are the fixed-ratio requirement for sam-
tained when the the standard deviations were ple-key responding and sample-presentation
increased, illustrate the pattern expected duration (White, 1985). We now report a set
when retention-interval duration is increased. of experimental conditions for pigeons work-
This is the interaction evident in the data re- ing in a delayed matching-to-sample proce-
ported by Jones and White (1992). The re- dure in order to examine further the inverse
sults in the bottom panel are what might be relation between discriminability and the bi-
expected when other factors that influence asing effects of the reinforcer differential. In
overall task difficulty are manipulated. all cases, retention-interval duration was var-
Figure 7 shows the data for individual birds ied in order to generate further evidence for
from Jones and White (1992) along with the the inverse relation predicted when standard
regression line that best fits the predictions deviations of the discriminal processes are in-
from our simulation for standard devation creased. In other conditions, factors influenc-
varied with the disparity parameter arbitrarily ing discrimination difficulty were included in
set at 1 (shown in the top panel of Figure 6). order to assess the relation predicted when
The predictions are generally consistent with the distance between the means of the dis-
the data. criminal processes changes. Reinforcement
rate was determined by the independent
Sources of Discriminability probability that a correct choice was followed
An important assumption of the model is by a reinforcer. Correct choices that were not
that the distance D between the means of the followed by reinforcers had no other conse-
102 K. GEOFFREY WHITE and JOHN T. WIXTED
quence, consistent with the typical method experimental sessions. Water and grit were
for arranging delayed matching-to-sample freely available in the home cages. The colo-
procedures and successive discriminations. In ny room was naturally illuminated with a pho-
the latter, for example, when responses in toperiod of approximately 14:10 hr.
one component are reinforced at variable in-
tervals, nonreinforced responses have no oth- Apparatus
er consequence that distinguishes them from A light- and sound-attenuating experimen-
the nonreinforced responses in the other tal chamber, 33 cm wide, 33 cm deep, and 34
component in which extinction may be ar- cm high, was painted matte black. A ventila-
ranged. In some very few studies in which all tion fan provided masking noise. Three trans-
correct choices in delayed matching to sam- lucent response keys, each 2.9 cm in diame-
ple are followed by hopper illumination and ter, were mounted on one wall 9.2 cm center
only some also produce grain (McCarthy & to center and 25.6 cm above the grid floor.
Davison, 1991), hopper illumination could Each key could be lit red or green and could
serve as a conditional reinforcer, and its prob- be operated by a minimum force of 0.15 N.
ability could be confounded with the proba- A central hopper opening below the center
bility of the food reinforcer. key and 5 cm from the grid floor allowed 3-s
access to wheat. The only illumination in the
chamber was provided by red or green on
EXPERIMENT 1:
center or side keys or by a white hopper light
ABSOLUTE RATE OF
during 3-s wheat presentations.
REINFORCEMENT
In the signaled magnitude effect described Procedure
by Nevin and Grosch (1990), accuracy is over- Sessions were conducted 7 days per week.
all higher on trials in which larger reinforcers Each session consisted of 80 trials, separated
follow correct choices than on trials in which by 15-s intertrial intervals (ITIs) during which
small reinforcers follow correct choices. The the chamber was dark and responses were in-
two types of trials are differentially signaled. effective. On each trial, a fixed ratio (FR) of
The result was confirmed by McCarthy and five responses to a red or green sample stim-
Voss (1995) and Jones, White, and Alsop ulus presented on the center key terminated
(1995). Our generalization of the result is the sample and initiated a dark delay period.
that accuracy may be overall higher when The delay lasted for 0.2 s, 1 s, 4 s, or 12 s,
higher absolute rates of reinforcement are ar- with delay durations mixed within sessions.
ranged in delayed matching to sample. Presentation of red and green comparison
We compared the effects of overall rich ver- stimuli on side keys followed the delay. A sin-
sus overall lean reinforcement rate in a de- gle choice response to one of the side keys
layed matching-to-sample procedure. We darkened both keys. Correct (matching)
asked whether we could replicate the effect choices produced 3-s access to grain with a
reported by Jones and White (1992) and pre- given probability. Unreinforced matching re-
dicted by our model, in which sensitivity to sponses and incorrect choices produced 3-s
reinforcement increased with increasing de- blackout periods, followed by the dark ITI.
lay-interval duration, and whether the inverse The order of red and green sample stimuli,
relation between reinforcer sensitivity and whether the red and green comparison stim-
discriminability was generalizable to the dif- uli appeared on the left or the right, and the
ferent performance levels generated by the order of delay intervals were random within
manipulation of absolute reinforcer rate. each session, with the constraint that each
M ETHOD combination of sample, comparison-stimulus
location, and delay occurred equally often
Subjects and with no more than three consecutive tri-
Three adult homing pigeons with prior ex- als with the same sample stimulus.
perience in delayed matching-to-sample pro- Reinforcer probabilities were fixed at cer-
cedures were maintained within 12 g of 80% tain values for each of 20 sessions per con-
of their free-feeding body weights by supple- dition. In five conditions, the probabilities
mentary feeding with mixed grain following were overall rich, and in five conditions they
PSYCHOPHYSICS OF REMEMBERING 103
Table 1
Reinforcer probabilities for correct choices of red and
green comparison stimuli. 20 sessions were conducted
for each condition.
Fig. 9. Log ratio of red to green choices as a function of log ratio of red to green reinforcers for the rich
reinforcer-rate conditions of Experiment 1, as a function of delay interval. Values of the parameter estimates for the
slope (reinforcer sensitivity) and intercept are shown.
probability ratio is shown in Figure 9 (for the inforcer probabilities, the same effect as was
rich reinforcement conditions) and Figure 10 reported by Jones and White (1992) is clear
(for the lean reinforcement conditions). in Figures 9 and 10. At the short delays, the
Here, the log ratio of red to green choices slopes of the matching lines are relatively flat,
provides a measure of the tendency to choose and at the long delays they are steep.
red versus green, and was calculated accord- The slopes, which provide an index of sen-
ing to Equation 2. For both rich and lean re- sitivity to reinforcement, are compared in Fig-
Fig. 10. Log ratio of red to green choices as a function of log ratio of red to green reinforcers for the lean
reinforcer-rate conditions of Experiment 1, as a function of delay interval. Values of the parameter estimates for the
slope (reinforcer sensitivity) and intercept are shown.
PSYCHOPHYSICS OF REMEMBERING 105
EXPERIMENT 3:
INTERTRIAL INTERVAL
DURATION
An inconsistency between our result and
model predictions and some data reported by
McCarthy and Davison (1991) and McCarthy
and Voss (1995) has left us with an interesting
puzzle. They reported that reinforcer sensitiv-
ity decreased with increasing delay-interval du-
ration. McCarthy and Davison varied rein-
forcer probability over three values in a
delayed matching procedure in which the
samples were different brightnesses. For de-
lays of 0 s, 1 s, 3 s, and 25 s, respectively, re-
Fig. 12. Discriminability as a function of delay-interval inforcer sensitivities averaged .75, .52, .56,
duration for conditions with sample ratio requirements and .37 (with standard errors of about .09).
of FR 5 (from Experiment 1) and FR 1 (Experiment 2). This result is not consistent with those re-
ported by Jones and White (1992) and in the
present Experiments 1 and 2, and is incon-
sistent with the expectation that when control
Fig. 13. Log ratio of red to green choices as a function of log ratio of red to green reinforcers for the FR 1 ratio-
requirement condition of Experiment 1, as a function of delay interval. Values of the parameter estimates for the
slope (reinforcer sensitivity) and intercept are shown.
PSYCHOPHYSICS OF REMEMBERING 107
Fig. 16. Log ratio of red to green choices as a function of the log ratio of red to green reinforcers obtained for
correct choices as a function of delay, for the 1-s ITI condition of Experiment 3.
PSYCHOPHYSICS OF REMEMBERING 109
ues were 1.03, 0.90, and 1.07 for Birds S1, S2, inforcement history. Thus, instead of two
and S3, respectively. The results of Experi- physical stimuli, the model takes account of
ment 3 were not included in Figure 18 be- a whole range of effective stimuli that happen
cause the lack of variation in discriminability to be normally distributed. But the model is
did not contribute to assessment of the in- otherwise just the familiar matching law. The
verse relation predicted by the model. Nev- model predicts that the effect of the reinforc-
ertheless, the overall high levels of reinforcer er ratio in biasing choices is amplified by de-
sensitivity in Experiment 3 (Figure 17) are creasing the level of discriminability that oc-
consistent with the present model in that they curs with longer retention inter vals. For
are expected at low discriminability levels. example, if the distributions overlap exten-
When the data from Experiment 3 are plot- sively, variations in the arranged reinforcer
ted in Figure 18, they cluster at the lowest ratio result in large changes in obtained re-
levels of discriminability, and around the line inforcer ratios and hence choice ratios across
predicted by the model. Because the same the range of stimulus effect values. But with
predicted function as in Figure 7, generated virtually nonoverlapping distributions, ob-
by varying standard deviations with D arbi- tained reinforcer ratios and hence choice ra-
trarily fixed at 1, is drawn for the different tios uniformly favor red for x values under
birds in Figure 18, the correspondence of the the red distribution and uniformly favor
data to the predicted function provides con- green under the green distribution, relatively
vincing evidence that our model successfully independently of variation in the arranged
predicts the interaction between reinforcer reinforcer ratios. In the experiments report-
sensitivity and discriminability. ed here, there was clear evidence for this pre-
diction, and also for the more general pre-
diction that factors that decrease overall
GENERAL DISCUSSION levels of discriminability increase sensitivity to
To summarize the main point of the model reinforcement.
we propose here, the individual’s trial-by-trial Our behavioral theory of remembering
tendency to choose one or the other com- bears obvious similarities to signal-detection
parison stimulus in a delayed matching pro- theory, but it differs in one critical respect.
cedure is determined by the relative proba- Detection theory assumes that the individual
bility of previously obtained reinforcers arrives at a decision by setting a criterion
associated with the value x of the stimulus ef- somewhere along the stimulus effect contin-
fect on the current trial. The value x is the uum. If x on a given trial exceeds that crite-
stimulus effect at the time of remembering. rion, the pigeon chooses red; otherwise,
The variation in stimulus effect is defined in green is selected. According to this account,
terms of a pair of hypothetical discriminal every trial involves a decision on the part of
processes or distributions. With increasing re- the bird (namely, does x exceed the criterion
tention-interval duration, the variances of the or not?). The decision criterion plays no role
distributions (discriminal dispersions) in- in the present nonmediational theory of re-
crease. Discriminability is predicted to de- membering. Instead, on each trial, the bird’s
crease as a result of the increased overlap be- behavior is governed by the history of rein-
tween the distributions. Discriminability may forcement for choosing red or green under
also decrease as a result of other factors that the prevailing conditions. If, given a value of
lead to a reduction in the separation of the x, a response to green has been reinforced
means of the distributions. more often than a response to red, the bird
The model can be characterized in terms will be more likely to choose green than red
of a conditional discrimination in which according to the matching law.
there is a continuum of values of the discrim- The version of the model described above
inative stimuli. That is, although only two has two parameters, one for the distance be-
stimuli are actually used (red and green), tween the discriminal processes and another
their effects vary from trial to trial, thereby for their variance. Further possibilities in-
creating a distribution of effective stimulus clude different variances for the two distri-
values for each stimulus. Each effective stim- butions, as may occur in delayed matching
ulus value has associated with it a unique re- for asymmetrical sample stimuli (Wixted &
PSYCHOPHYSICS OF REMEMBERING 111
Dougherty, 1996). Bias and undermatching of a composite of the factors that influence
in the effect of the reinforcer ratio on the the overall level of performance. That is, the
trial-by-trial choice probability may also be in- disparity parameter D may be related to wave-
cluded. These factors have not been incor- length disparity (for a hue discrimination) in
porated here because they involve adding addition to factors such as the FR sample re-
free parameters to an otherwise simple mod- quirement or ITI duration. A related issue is
el. Quantitative fits of the model to data whether the stimulus effect dimension should
would be enhanced, however, by inclusion of be construed in sensory or memorial terms,
the additional parameters. an issue related to our notion that stimulus
The present model bears some apparent effect is defined at the time of remembering.
similarity to detection models in that both as- Although a memorial interpretation is an ob-
sume distributions of stimulus effect along a vious possibility, the notion that the effect of
psychological continuum, following Thur- the temporally distant sample stimulus may
stone’s notion of discriminal processes (Luce, be direct is consistent with a sensory inter-
1994). The question of how to interpret this pretation (White, 1991). That is, if all of the
continuum is of importance to the issue of variation in stimulus effect can be attributed
how the present model differs from other to factors in the environment, a ‘‘direct re-
models of recognition memory. membering’’ approach can be sustained. It
Signal-detection theory was applied to rec- may be more plausible, however, to adopt the
ognition memory by Parks (1966) as suggest- view that organismic processes contribute to
ed by Murdock (1965) and later by Banks stimulus variation. In either case, the stimulus
(1970) and Lockhart and Murdock (1970). effect dimension seems to be related to hue
Snodgrass and Corwin (1988) give a clear (e.g., for red and green sample stimuli), in
summary of these models and discuss the which case the means of the stimulus effect
problem of measuring response bias in rec- distributions should not drift towards each
ognition memory. In the model described by other with increasing delay, as was assumed
Parks, old and new items vary in their degree by White and Cooney (1996). For this reason,
of ‘‘familiarity’’ or, in other models, ‘‘memory it was assumed in the present model that dis-
strength.’’ Familiarity or memory strength tribution variances increased with increasing
varies from item to item according to a nor- delay, with the location of the means remain-
mal distribution. The mean of the probability ing constant. A mediational view consistent
distribution for old items is greater in famil- with signal-detection interpretations of the
iarity or memory strength than that for new stimulus effect dimension is that a value x on
items, if old is discriminable from new. The a given trial provides ‘‘evidence’’ on the basis
dimension along which item strength varies of which the choice response is made. For
is a psychological continuum (Luce, 1994), in example, for red versus triangle samples (for
that it reflects variation in how stimuli may be stimuli on different dimensions), the value of
represented in the nervous system or stored x provides evidence for the redness versus tri-
in long-term memory. As applied to the angularity of the previously presented sam-
matching-to-sample paradigm or similar two- ple. As it happens, the present model is silent
alternative choice procedures, however, there on these alternatives, but does emphasize
is little sense in defining a zero point for that the stimulus effect value on which the
memory strength or familiarity, unless the choice is conditional in the model is defined
stimulus effect continuum is interpreted as at the time of remembering, consistent with
extending from ‘‘greenness’’ to ‘‘redness.’’ the treatment of remembering as delayed
In detection models, the dimension is por- stimulus control.
trayed as a decision axis (Green & Swets, As a final note, we offer a comment on the
1966). A decision based on a criterion value scope of the model. The aim was to present
on the dimension provides the main basis for a model that incorporated the effects of the
the recognition response. But in the present reinforcer ratio in delayed matching-to-sam-
model, there is some difficulty in identifying ple procedures, or other remembering pro-
the continuum as a decision axis, because no cedures in which the choice alternatives were
criteria are assumed. One possibility is to de- explicit. The general prediction that the re-
fine the stimulus effect dimension in terms inforcer effect becomes more influential as
112 K. GEOFFREY WHITE and JOHN T. WIXTED
Lockhart, R. S., & Murdock, B. B. (1970). Memory and Roberts, W. A. (1972). Short-term memory in the pi-
the theory of signal detection. Psychological Review, 74, geon: Effects of repetition and spacing. Journal of Ex-
100–109. perimental Psychology, 94, 74–83.
Luce, R. D. (1963). Detection and recognition. In R. D. Snodgrass, J. G., & Corwin, J. (1988). Pragmatics of mea-
Luce, R. R. Bush, & E. Galanter (Eds.), Handbook of suring recognition memory: Applications to dementia
mathematical psychology (Vol. 1, pp. 103–189). New and amnesia. Journal of Experimental Psychology: General,
York: Wiley. 117, 34–50.
Luce, R. D. (1994). Thurstone and sensory scaling: Stevens, S. S. (1961). To honor Fechner and repeal his
Then and now. Psychological Review, 101, 271–277. law. Science, 133, 80–86.
Macmillan, N. A., & Creelman, C. D. (1991). Detection Thomas, E. A. C. (1975). Criterion adjustment and prob-
theory: A user’s guide. New York: Cambridge University ability matching. Perception & Psychophysics, 18, 158–
Press. 162.
McCarthy, D., & Davison, M. (1991). The interaction be- Thomas, E. A. C., & Legge, D. (1970). Probability match-
tween stimulus and reinforcer control on remember- ing as a basis for detection and recognition decisions.
ing. Journal of the Experimental Analysis of Behavior, 56, Psychological Review, 77, 65–72.
51–66. Thurstone, L. L. (1927). A law of comparative judgment.
McCarthy, D., & Voss, P. (1995). Delayed matching-to- Psychological Review, 34, 273–286.
sample performance: Effects of relative reinforcer fre- White, K. G. (1985). Characteristics of forgetting func-
quency and of signaled versus unsignaled reinforcer tions in delayed matching to sample. Journal of the Ex-
frequencies. Journal of the Experimental Analysis of Be- perimental Analysis of Behavior, 44, 15–34.
havior, 63, 33–52. White, K. G. (1986). Conjoint control of performance
McCarthy, D., & White, K. G. (1987). Behavioral models in conditional discrimination by successive and simul-
of delayed detection and their application to memory. taneous stimuli. Journal of the Experimental Analysis of
In M. L. Commons, J. Mazur, J. A. Nevin, & H. C. Behavior, 45, 161–174.
Rachlin (Eds.), Quantitative analyses of behavior: Vol. 5. White, K. G. (1991). Psychophysics of direct remember-
The effect of delay and intervening events on reinforcement ing. In J. A. Commons, M. C. Davison, & J. A. Nevin
value (pp. 29–54). New York: Erlbaum. (Eds.), Models of behavior: Signal detection (pp. 221–
McLean, A. P., & White, K. G. (1983). Temporal con- 237). New York: Erlbaum.
straint on choice: Sensitivity and bias in multiple White, K. G., & Cooney, E. B. (1996). The consequences
schedules. Journal of the Experimental Analysis of Behav- of remembering: Independence of performance at
ior, 39, 405–426. different retention intervals. Journal of Experimental
Murdock, B. B. (1965). Signal-detection theory and Psychology: Animal Behavior Processes, 22, 51–59.
short-term memory. Journal of Experimental Psychology, White, K. G., Pipe, M.-E., & McLean, A. P. (1985). A note
70, 443–447. on the measurement of stimulus discriminability in
Nevin, J. A. (1969). Signal detection theory and operant conditional discriminations. Bulletin of the Psychonomic
behavior: A review of David M. Green and John A. Society, 23, 153–155.
Swets’ Signal Detection Theory and Psychophysics. Journal Williams, B. A. (1988). Reinforcement, choice, and re-
of the Experimental Analysis of Behavior, 12, 475–480. sponse strength. In R. C. Atkinson, R. J. Herrnstein,
Nevin, J. A. (1981). Psychophysics and reinforcement G. Lindzey, & R. D. Luce (Eds.), Stevens’ handbook of
schedules. In M. L. Commons, J. E. Mazur, J. A. Nevin, experimental psychology: Vol. 2. Learning and cognition
& H. Rachlin (Eds.), Quantitative analyses of behavior: (pp. 167–244). New York: Wiley.
Vol. 1. Discriminative properties of reinforcement schedules Wixted, J. T. (1989). Nonhuman short-term memory: A
(pp. 3–27). Hillsdale, NJ: Erlbaum. quantitative reanalysis of selected findings. Journal of
Nevin, J. A., Cate, H., & Alsop, B. (1993). Effects of dif- the Experimental Analysis of Behavior, 52, 409–426.
ferences between stimuli, responses, and reinforcer Wixted, J. T. (1993). A signal detection analysis of mem-
rates on conditional discrimination performance. ory for nonoccurrence in pigeons. Journal of Experi-
Journal of the Experimental Analysis of Behavior, 59, 147– mental Psychology: Animal Behavior Processes, 19, 400–
161. 411.
Nevin, J. A., & Grosch, J. (1990). Effects of signaled re- Wixted, J. T., & Dougherty, D. H. (1996). Memory for
inforcer magnitude on delayed matching-to-sample asymmetric events. In D. Medin (Ed.), The psychology
performance. Journal of Experimental Psychology: Animal of learning and motivation (Vol. 35, pp. 89–126). San
Behavior Processes, 16, 298–305. Diego: Academic Press.
Parks, T. E. (1966). Signal-detectability theory of recog- Wixted, J. T., & Ebbesen, E. B. (1991). On the form of
nition-memory performance. Psychological Review, 73, forgetting. Psychological Science, 6, 409–415.
44–58. Yonelinas, A. P. (1994). Receiver-operating characteris-
Ratcliff, R., McKoon, G., & Tindall, M. (1994). Empirical tics in recognition memory: Evidence for a dual-pro-
generality of data from recognition memory receiver- cess model. Journal of Experimental Psychology: Learning,
operating characteristic functions and implications Memory, and Cognition, 20, 1341–1354.
for the global memory models. Journal of Experimental
Psychology: Learning, Memory, and Cognition, 20, 763– Received April 29, 1998
785. Final acceptance August 13, 1998