Académique Documents
Professionnel Documents
Culture Documents
Diagnostic Ability?
Robert D. Morgan
Oklahoma State University
Robert D. Morgan is now a postdoctoral fellow in forensic psychology in the Department of Psychiatry at the
University of Missouri-Kansas City.
Correspondence concerning this article should be addressed to Robert D. Morgan, Department of Forensic
Services, Western Missouri Mental Health Center, 600 E. 22nd Street, Kansas City, MO 64108; e-mail:
RobertDMorgan@juno.com
Introduction
erations of history, signs, and symptoms (Reid & Wise, 1989). The DSM-III-R (Ameri-
can Psychiatric Association, 1987) decision trees for differential diagnosis were developed
to assist professionals in understanding the organization and hierarchic structure of the
classification (p. 377). It does not seem unreasonable to suggest that increased under-
standing in this regard would result in increased diagnostic accuracy.
Millon (1983) described what are probably two typical but contrasting views of the
usefulness of the decision trees: on the one hand he commented that the decision trees are
likely to be considered an unnecessary encumbrance for routine diagnostic tasks, quite
impractical for everyday decision making and perhaps most relevantly, abhorrent to cli-
nicians accustomed to the diagnostic habit of intuitive synthesis (p. 809). Contrary to
Millons hypothesis, Timmermans and Vlek (1992) indicate that decision aids are designed
to solve difficult decision problems. The justification for using decision aids lies in the
shortcomings of human judgment (p. 50). Furthermore, complex problems require more
cognitive effort and often result in the use of simplified decision strategies and a less
complete evaluation of information (e.g., Olshavsky, 1979; Payne, 1976). Millon (1983)
also appears to recognize the potential of the decision trees as he observed that should
the method guarantee significantly greater diagnostic accuracy . . . then it might gain a
sufficient following to override the inertia of traditional practice (p. 809).
The DSM decision trees appear to be a novel experiment by the DSM-III task force
that has not been taken too seriously as there are no studies that have investigated their
use in diagnostic decision-making. In fact, no studies have investigated the use of the
DSM decision trees in diagnostic decision-making. The purpose of these experiments
was to evaluate the effects of the DSM decision trees on diagnostic accuracy. Experiment
1 examined whether the use of the decision trees increased the accuracy of DSM-III-R
diagnoses, and it was hypothesized that they would. The purpose of Experiment 2 was to:
(a) replicate and expand Experiment 1, and (b) examine whether the use of the DSM-
III-R decision trees increased the accuracy of diagnoses for those participants with less
DSM experience and also decreased the time of diagnoses. It was hypothesized that the
use of the decision trees would increase diagnostic accuracy across groups with various
levels of DSM-III-R experience and that the use of the decision trees would decrease the
diagnostic time across these groups.
Experiment 1
Method
(with and without the decision trees), and the order of presentation of the vignettes also
was assigned randomly. Thus, each vignette had an equal opportunity to be in either
condition and appear in any position (e.g., first, second, third, etc.).
The participants used the DSM-III-R and photocopies of the five DSM-III-R deci-
sion trees when making their diagnoses. An 11-item questionnaire concerning the use of
the decision trees also was administered to the participants. The first six questions inquired
about the participants previous use, understanding, and attitude about using the decision
trees, and possible future use of the decision trees. Participants responded to these ques-
tions by making ratings on Likert-type scales ranging from one (not at all) to five
(very much). The last five questions pertained to the method employed when using the
decision trees and demographics including sex, age, year in graduate school, and number
of weeks of DSM-III-R training.
Procedure. The participants first were informed that they would receive five case
vignettes and that they were to make an Axis I DSM-III-R diagnosis. They were instructed
to use the method that they usually used when making their diagnoses with the exception
that if they usually used the decision trees provided in the manual they were to refrain
from using them with these vignettes. The participants then were given five of the case
vignettes and instructed to make an Axis I diagnosis for each case. They were told that the
Axis I diagnosis was to come from one of 11 categories, all of which were represented in
the decision trees (e.g., schizophrenia, mood disorder, and anxiety disorder).
Upon completion of the five vignettes, the participants were informed that they would
be given five more case vignettes, and that again they were to make an Axis I DSM-III-R
diagnosis. This time however, they were to use the decision trees when making their
diagnosis. The experimenter then gave two suggestions on how to implement the decision
trees when making a diagnosis. Method one consisted of using the decision trees as a
check of their usual method of diagnosing (i.e., method used on the first five vignettes).
Method two consisted of locating the appropriate decision tree and then proceeding down
the tree, referring to the diagnostic criteria in the manual as needed, until the participants
arrived at a diagnosis.
The participants then were given the other five case vignettes (again representing the
five decision trees) and copies of the five DSM-III-R decision trees. They were instructed
to implement the decision trees to make an Axis I DSM-III-R diagnosis from one of the
11 categories using either one of the two methods suggested by the experimenter or any
other preferred method. When finished with these case vignettes, the participants were
asked to complete the questionnaire.
Figure 1. Diagnostic accuracy when using the decision trees by weeks of experience.
78 Journal of Clinical Psychology, January 2000
Table 1
Descriptive Statistics for Preference and Use Ratings
With Respect to the Decision Trees
Standard
Question Mean Deviation
Experiment 2
Experiment 2 expands upon Experiment 1 and addresses some of the limitations in the
original study. Lambert and Wertheimer (1988) found that diagnostic accuracy increases
significantly with relevant education and relevant experience. A study by Webb, Gold,
Johnstone, and Diclemente (1981) showed that after a two and one half day training
program, participants with no previous DSM-III exposure were able to agree with an
expert opinion on 74% of diagnoses of videotaped cases. While the training program
used by Webb et al. lasted two and one half days, Malt (1986) demonstrated that reliable
DSM-III diagnoses could be achieved with a training program that lasts only a few hours
and with reference to the decision trees and the diagnostic criteria alone. These studies
indicate that diagnostic accuracy can be improved with training.
A final note concerning the decision trees is their practicality for use in clinical
practice. Millon (1983) indicated that the decision trees would seem to increase the diag-
nostic time and would be an encumbrance for the diagnostic process; however, an
alternative hypothesis would be that the use of the decision trees may provide clinicians
with a more-effective diagnostic approachtherefore diagnostic time would decrease.
Experiment 2 improves upon Experiment 1 with an increased sample size, an a priori
distinction between participants level of DSM experience, and an assessment of diag-
nostic speed and diagnostic confidence.
Method
Conditions A and B then were assigned randomly to one of two treatment orders
(decision trees used on the first five vignettes or second five vignettes) for each partici-
pant. Thus, every participant received Condition A and Condition B, but half (10 from
each level of experience group) used the decision trees on their first five vignettes and
half (the remaining 10 from each level of experience group) used the decision trees on
their second five vignettes. The conditions (A & B) were assigned randomly to the order
of tree use, with the exception that the two conditions appeared in both orders of tree use
an equal number of times (30 times). Thus, of the 60 participants, 15 (five from each level
of experience group) received condition A first and used the decision trees when making
their diagnosis; 15 participants received condition A first and did not use the decision
trees when making their diagnosis; 15 participants received condition A second and used
the decision trees when making their diagnosis; and 15 participants received condition A
second and did not use the decision trees when making their diagnosis. The same proce-
dure was incorporated for condition B.
The participants were allowed to use the DSM-III-R when making their diagnoses.Also,
a 5-item questionnaire about the use of the decision trees was administered to the partici-
pants after diagnosing all of the vignettes. Participants responded to these questions by making
ratings on Likert-type scales with one being not at all and seven equaling very much.
Finally, a standard digital stopwatch was used for the timing of the diagnoses.
Procedure. The participants from the different groups were assigned to one of two
Orders of decision-tree use before they arrived to participate in this experiment. Partici-
pants in the first Order used the decision trees on the first five vignettes and did not use
the decision trees on the second five vignettes. Participants in Order 2 did not use the
decision trees on the first five vignettes and did use the decision trees on the second five
vignettes. Each Order of decision-tree use consisted of 10 participants from each expe-
rience group so that of the 20 participants from the experienced group, 10 were in Order 1
and 10 were in Order 2. A partially random assignment procedure was used in that after
10 participants from a particular experience group had been assigned randomly to one of
the two Orders, the rest of the participants in that group were assigned to the other Order.
The participants were informed that they would receive five case vignettes one at a
time and that they were to make an Axis I DSM-III-R diagnosis for each vignette. They
were informed that while they were to make an Axis I diagnosis, they did not need to
make detailed diagnostic specifications (e.g., severity). An example of an Axis I diagno-
sis without specifications was given to the participants. The participants were told that
the Axis I diagnosis was to come from one of 11 categories, all of which were represented
in the decision trees. A listing of these categories was given to the participants. The
participants also were informed that they would be timed on each vignette but that there
was no time limit and they could take as much time as they needed to complete each
vignette. They also were asked to make confidence ratings pertaining to each diagnosis.
The participants then were given five of the case vignettes one at a time and instructed to
make an Axis I diagnosis for each case vignette.
The participants were instructed to use the decision trees or to not use the decision
trees when making their diagnoses, depending upon order. Because some participants
might have been unfamiliar with the decision trees, the experimenter provided two sug-
gestions on how to implement the decision trees when making a diagnosis (see Experi-
ment 1). When using the decision trees the participants could use one of the two methods,
experiment with both methods, or use another preferred method. Finally, the participants
were timed for each vignette starting from the time they were given the vignette and
ending when they were completed with their diagnosis (i.e., when they were done writing
their diagnosis).
Do the DSM Decision Trees Improve Diagnostic Ability? 81
Upon completion of the first five vignettes, the participants were informed that they
would be given five more case vignettes, again one at a time, and that they were again to
make an Axis I DSM-III-R diagnosis. This time, however, participants in Order 1 were
told to use the decision trees and participants in Order 2 were told to not use the decision
trees on the five vignettes. The participants then were given the other five case vignettes
(again representing the five decision trees) one at a time. They again were instructed to
make an Axis I DSM-III-R diagnosis from one of the 11 categories without using the
decision trees or by using the decision trees, depending on order. The participants again
were timed on each vignette and asked to make confidence ratings. When finished with
these case vignettes, the participants were asked to complete the questionnaire.
Table 2
Means and Standard Deviations for Experience Level With and Without the Trees
on Diagnostic Accuracy, Time, and Confidence
Group
Dependent Variable M SD M SD M SD M SD M SD M SD
Specific accuracy
with tree 0.50 0.71 0.20 0.42 0.70 0.95 0.50 0.71 1.90 0.57 2.20 1.30
Specific accuracy
without tree 0.50 0.85 0.60 0.84 1.00 1.05 0.70 0.68 2.20 0.79 1.90 0.99
Class accuracy
with tree 2.10 1.29 2.40 1.17 2.20 0.79 3.10 0.99 2.50 0.85 3.60 0.84
Class accuracy
without tree 1.60 1.43 1.60 1.27 2.80 1.62 1.80 1.03 3.20 0.92 2.90 0.88
Time with tree
(per vignette) 6.27 2.14 5.95 1.23 7.45 2.54 5.11 1.48 7.03 2.23 5.94 0.99
Time without tree
(per vignette) 4.91 1.83 7.20 2.31 5.99 1.37 5.77 2.02 5.68 1.36 5.81 1.14
Confidence with tree
(per vignette) 4.24 0.84 5.04 0.84 3.92 0.68 3.96 0.91 5.54 0.71 4.74 0.72
Confidence without
tree (per vignette) 3.78 1.01 4.60 1.36 4.20 1.22 3.56 0.95 5.01 0.70 4.84 0.90
Note. Order 1 5 Trees used only on first 5 vignettes. Order 2 5 Trees used only on second 5 vignettes. Range of scores for
each dependent variable for each group can be determined from the raw data.
ever, on class diagnostic accuracy, the experienced participants were significantly more
accurate than the no-experience participants only. These results substantiated Lambert
and Wertheimers (1988) study in that experienced participants made more accurate
diagnosis.
In addition to the main effect of experience level, a significant Trees 3 Order inter-
action, F(1, 54) 5 8.06, p 5 .006, was found for class diagnostic accuracy (see Fig. 2). In
this interaction, t-tests indicated that when using decision trees, the participants were
Table 3
Multivariate Analysis of Variance
(Experience Level 3 Decision Tree 3 Order)
Figure 2. Class diagnostic accuracy when using the decision trees by order. For Order 1, when the decision
trees were used, it was on the first five vignettes; thus no decision trees were used on the second five
vignettes. For Order 2, when the decision trees were used, it was on the second five vignettes; thus no
trees were used on the first five vignettes.
significantly more accurate using the trees when they were accompanied by some prac-
tice (i.e., using the trees on the second five vignettes rather than on the first five), t 5
2.85, p 5 .005; however, when not using the trees, practice did not significantly affect
class diagnostic accuracy. In addition, on the first five vignettes, there was no significant
difference in class diagnostic accuracy between using the trees and not using the trees;
however, when using the trees on the second five vignettes there was a significant dif-
ference than when not using the trees on the first five vignettes, t 5 3.382, p , .001.
This Tree 3 Order interaction for class diagnostic accuracy showed that participants
were most accurate if they used the trees on the second five vignettes. Thus, as indicated
by the interaction, practice will help with diagnostic accuracy but practice and the use of
the decision trees combined led to the most accurate class diagnosis. This result clearly
indicated that using the trees alone, and practice alone, were not sufficient to improve
class diagnostic accuracy.
There were no other significant univariate main effects or interaction effects for
specific diagnostic accuracy or class diagnostic accuracy. Analysis for the diagnostic
time data revealed a significant Tree 3 Order interaction, F(1, 54) 5 17.06, p 5 .001, and
is shown in Figure 3. t-Tests showed that participants who used the trees on the first five
vignettes were significantly slower on these vignettes than when not using the trees on
the second five vignettes t 5 3.898, p , .001. There were no significant differences
between participants who did not use the trees on the first five vignettes and participants
who did use the trees on the second five vignettes. In addition, when using the trees,
participants took significantly less time to make their diagnosis if they used the trees on
the second five vignettes rather than on the first five vignettes t 5 2.63, p , .02. How-
84 Journal of Clinical Psychology, January 2000
Figure 3. Diagnostic time when using the decision trees by order. For Order 1, when the decision trees
were used, it was on the first five vignettes; thus no decision trees were used on the second five vignettes.
For Order 2, when the decision trees were used, it was on the second five vignettes; thus no trees were used
on the first five vignettes.
ever, when not using the trees, there were no significant differences between the two
orders (indicating that practice did not effect diagnostic time when not using the trees).
This interaction indicates that when using the decision trees, participants took less time to
make a diagnosis if they had some practice, and when not using the decision trees prac-
tice made no significant difference on diagnostic time. However, participants took less
time to make a diagnosis when they were not using the trees but did have some practice
than when using the trees but without practice. There were no other significant effects for
diagnostic time.
Finally, univariate analyses performed on the Diagnostic Confidence data showed
significant main effects for Experience Level, F(2, 54) 5 8.18, p 5 .001, and Trees
F(1, 54) 5 4.92, p 5 .031. Specific comparisons were made using the Scheffe procedure,
and results indicated that the experienced group was significantly ( p , .05) more confi-
dent in their diagnosis than were the less-experienced participants. However, they were
not significantly more confident than the no-experience participants. Also, the less-
experienced and no-experience participants did not differ significantly in their diagnostic
confidence. The main effect for the Trees variable indicates that when the participants
used the decision trees, they were more confident in their diagnosis than when they did
not use the decision trees. A significant Experience Level 3 Order interaction, F(2, 54) 5
Do the DSM Decision Trees Improve Diagnostic Ability? 85
Table 4
Frequency for Response on Decision-Tree Use
Question 1 2 3 4 5 6 7
accuracy. Thus, the great majority of participants perceived the decision trees to be at
least somewhat helpful. Similarly, 58 of the 60 participants responded that they did not
have the decision trees memorized prior to this study and that they used the trees essen-
tially as a check of their usual method of diagnosing or as a check of an initial diagnosis.
Some limitations with regard to the present study need to be addressed. First, this
study did not adequately assess specific diagnostic accuracy. As indicated above, those
participants with less DSM-III-R experience tended to make class diagnoses on some or
all of the vignettes, thus possibly distorting the accuracy of the specific diagnostic results.
In addition, the no-experience participants were undergraduate psychology majors, thus
their representation as DSM users and potential generalizability is limited. Another pos-
sible limitation of this study could have been subject familiarity with the case vignettes.
While an attempt was made to exclude those vignettes that may have been utilized pre-
viously in academic coursework, subjects were not asked about their familiarity with the
case vignettes. This poses a potential limitation to the internal validity of the study as
subjects diagnostic accuracy may have been affected by familiarity with any of the case
vignettes. Finally, to simplify diagnostic comparisons, only those diagnoses that are rep-
resented by the decision trees (i.e., 11 diagnostic classes) were selected for inclusion in
this study; however, this experimental control may limit the generalizability of the study.
In spite of these limitations, the findings of this study suggest that the decision trees
warrant consideration as a diagnostic tool for those clinicians with limited diagnostic
experience. As indicated in the results, the decision trees did have a significant affect on
class diagnostic accuracy and time depending on the order in which the trees were used.
Like the initial experiment, these results indicated that if the decision trees and some
practice are provided, the result will be improved diagnostic accuracy and decreased
diagnostic time. While the results from the two experiments are similar, the latter exper-
iment expanded the initial experiment by including diagnostic time. This experiment also
used actual experience-level differences whereas the first experiment included experience-
level differences found in post-hoc analysis. The present experiment used 60 participants
whereas the first experiment used only 15 participants. In addition, the present study had
equal sample sizes for the order of using the trees. That is, 30 participants (10 from each
group) used the decision trees on the first five vignettes and 30 used the trees on the
second five vignettes. In the first experiment all participants used the decision trees on
the second five vignettes, thus leaving open the possibility of practice effects. Thus, the
first experiment had several limitations (possibility of practice effects, groups selected on
post-hoc analysis) for which the latter experiment accounted.
From these findings, it is evident that the trees facilitate a more accurate class diag-
nosis as well as decreased diagnostic time when at least minimal practice is employed.
This finding suggests that teaching of the trees alone is not sufficient, but if the trees are
taught and practice is made available, then participants will tend to make a more accurate
class diagnosis and decrease their diagnostic time.
General Discussion
The results of these studies parallel those of Elsteins et al. (1978) results with medical
students by deemphasizing an over reliance on diagnostic intuition or insight by stressing
the inclusion of a systematized guide for determining diagnosis. More specifically, the
two experiments presented here demonstrate that the DSM decision trees can be a useful
tool, especially for practitioners with minimal DSM experience. It was demonstrated that
for those clinicians with less experience, the decision trees, when provided with minimal
Do the DSM Decision Trees Improve Diagnostic Ability? 87
practice, facilitated more accurate diagnoses and also decreased participants diagnostic
time. Thus, Millons (1983) initial assessment of the trees as unnecessary and an encum-
brance was premature. These results suggest that neophyte clinicians may benefit from
instruction using the trees and that with minimal practice the trees can be a useful tool in
the neophytes clinical repertoire. Finally, this study provides support for the inclusion of
the decision trees in the DSM classification system. While these experiments imple-
mented the use of the DSM-III-R, it does not seem unreasonable to suggest that similar
findings would be found with the DSM-IV.
While the results presented here provide preliminary evidence for the usefulness of
the decision trees, further studies are warranted. Future studies need to account for the
limitations of these studies and need to investigate how the trees may be implemented
more effectively as a tool for clinicians in the real world. First, a similar study needs to
incorporate a more representative sample between participants experience and training
level (e.g., doctoral level practitioners, masters degree level clinicians, advanced gradu-
ate students, and intermediate graduate students). Additionally, future studies should
replicate these studies with professionals from disciplines who engage in diagnostic
decision-making (e.g., social workers and psychiatrists). In addition, further studies need
to incorporate the full range of diagnosis and not limit the diagnostic options to only
those covered by the decision trees. Such studies would increase positively the general-
izability of the results to real world practitioners.
In conclusion, while these studies provide evidence for the utility of the DSM deci-
sion trees, a caution is necessary. While inexperienced participants were used in this
study, and the use of the decision trees with some practice on vignettes appeared to
facilitate a more accurate class diagnosis as well as decreased diagnostic time, the use of
the decision trees should not be construed as a replacement for experience or clinical
intuition; rather, the decision trees may serve as an adjunct to the diagnostic decision-
making process.
References
Adams, H.E., & Cassidy, J.F. (1993). The classification of abnormal behavior. In P.B. Sutker &
H.E. Adams (Eds.), Comprehensive handbook of psychopathology (pp. 325). New York:
Plenum Press.
American Psychiatric Association. (1952). Diagnostic and statistical manual of mental disorders.
Washington, D.C.: Author.
American Psychiatric Association. (1980). Diagnostic and statistical manual of mental disorders
(3rd ed.). Washington, D.C.: Author.
American Psychiatric Association. (1987). Diagnostic and statistical manual of mental disorders
(3rd ed., rev.). Washington, D.C.: Author.
American Psychiatric Association. (1989). Treatments of psychiatric disorders. Washington, D.C.:
Author.
Ash, P. (1949). The reliability of psychiatric diagnoses. Journal of Abnormal and Social Psychol-
ogy, 44, 272276.
Carson, R.C., Butcher, J.N., & Coleman, J.C. (1988). Abnormal psychology and modern life (8th
ed.). Glenview, IL: Scott, Foresman and Company.
Elstein, A.S., Shulman, L.S., & Sprafka, S.A. (1978). Medical problem solving: An analysis of
clinical reasoning. Cambridge, MA: Harvard University Press.
Lambert, L.E., & Wertheimer, M. (1988). Is diagnostic ability related to relevant training and
experience? Professional Psychology: Research and Practice, 19, 5052.
LeLaurin, K. (1990). Judgment-based assessment: Making the implicit explicit. TECSE, 10, 96110.
88 Journal of Clinical Psychology, January 2000
Malt, U.F. (1986). Teaching DSM-III to clinicians. Acta Psychiatrica Scandinavica Supplementum,
73, 6875.
Mehlman, B. (1952). The reliability of psychiatric diagnoses. Journal of Abnormal and Social
Psychology, 47, 577578.
Millon, T. (1983). The DSM-III: An insiders perspective. American Psychologist, 38, 804814.
Olshavsky, R.W. (1979). Task complexity and contingent processing in decision making: A repli-
cation and extension. Organizational Behavior and Human Performance, 24, 300316.
Payne, J.W. (1976). Task complexity and contingent processing in decision making: An information
search and protocol analysis. Organizational Behavior and Human Performances, 16, 366387.
Reid, W.H., & Wise, M.G. (1989). DSM-III-R training guide: For use with the American Psychi-
atric Associations diagnostic and statistical manual of mental disorders (3rd ed. rev.). New
York: Brunner/Mazel.
Spitzer, R.L., & Fleiss, J.L. (1974). A re-analysis of the reliability of psychiatric diagnosis. British
Journal of Psychiatry, 125, 341347.
Spitzer, R.L., Forman, J.B.W., & Nee, J. (1979). DSM-III field trials: I. Initial interrater diagnostic
reliability. American Journal of Psychiatry, 136, 815817.
Spitzer, R.L., Gibbon, M., Skodol, A.E., Williams, J.B.W., & First, M.B. (1989). Diagnostic and
statistical manual of mental disorders casebook (rev. ed.). Washington, D.C.: American Psy-
chiatric Press.
Stout, C. (1991). A methodological approach to differential diagnostics. In K.N. Anchor (Ed.), The
handbook of medical psychotherapy: Cost effective strategies in mental health (p. 141). Lewiston,
NY: Hogrefe and Huber.
Timmermans, D., & Vlek, C. (1992). Multi-attribute decision support and complexity: An evalua-
tion and process analysis of aided versus unaided decision making. Acta Psychologica, 80,
49 65.
Vaglum, P., Friis, S., Vaglum, S., & Larsen, F. (1989). Comparison between personality disorder
diagnoses in DSM-III and DSM-III-R: Reliability, diagnostic overlap, predictive validity. Psy-
chopathology, 22, 309314.
Ward, C.H., Beck, A.T., Mendelson, M., Mock, J.E., & Erbaugh, J.K. (1962). The psychiatric nomen-
clature. Archives of General Psychiatry, 7, 60 67.
Webb, L.J., Gold, R.S., Jonstone, E.E., & Diclemente, C.C. (1981). Accuracy of DSM-III diagnoses
following a training program. American Journal of Psychiatry, 138, 376378.