Académique Documents
Professionnel Documents
Culture Documents
Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at
http://www.jstor.org/page/info/about/policies/terms.jsp. JSTOR's Terms and Conditions of Use provides, in part, that unless
you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you
may use content in the JSTOR archive only for your personal, non-commercial use.
Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at
http://www.jstor.org/action/showPublisher?publisherCode=bmj.
Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed
page of such transmission.
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact support@jstor.org.
Digitization of the British Medical Journal and its forerunners (1840-1996) was completed by the U.S. National
Library of Medicine (NLM) in partnership with The Wellcome Trust and the Joint Information Systems
Committee (JISC) in the UK. This content is also freely available on PubMed Central.
BMJ Publishing Group is collaborating with JSTOR to digitize, preserve and extend access to BMJ: British
Medical Journal.
http://www.jstor.org
Why we need observational studies to evaluate the effectiveness of
healthcare
Nick Black
The view is widely held that experimental methods all of the latter. I will return to this distinction later,
(randomised controlled trials) are the "gold stan? but firstit is necessary to document the reasons why
dard" for evaluation and that observational methods observational methods are needed. There are four
(cohort and case control studies) have little or no main reasons: experimentation may be unnecessary,
value. This ignores the limitations of randomised inappropriate, impossible, or inadequate.
quest for scientific rigour in evaluation, regardless of fractured bones. In all these examples observational
the method used. studies were adequate to demonstrate effectiveness.
My main contention is that those who are opposed to still uncertainty as to whether or not laparoscopic
the use of observational methods have assumed that techniques are associated with an increased risk
they represent an alternative to experimentation rather of injuries, such as bile duct damage during chole
than a set of complementary approaches. This in turn cystectomy.3 Huge observational dataseis are the only
stems from a misguided notion that everything can be practical means of acquiring such vital information.
investigated using a randomised controlled trial. In A second limitation, also arising from study size,
response Iwant to outline the limitations of randomised is the difficulty of evaluating interventions designed
trials and show that observational methods are needed to prevent rare events. Examples include accident
both to evaluate the parts randomised trials cannot prevention schemes and placing infants supine or on
reach and to help design and interpret correctly the their side sleep toprevent to sudden infant death
results obtained from these trials. syndrome. A randomised trial would have needed a
Before doing so I must clarify what I mean by few hundred thousand babies.
"observational" in this context. I am referring exclu? A third limitation of trials iswhen the outcomes of
sively to quantitative, epidemiological methods and interest are far in the future. Three well known
not qualitative, sociological methods in which data examples are the long term
consequences of oral
are collected through observation. The principal contraceptives, which may not be manifest for decades;
observational epidemiological methods are non the use of hormone replacement therapy to prevent
randomised trials, cohort studies (prospective and femoral fractures; and the loosening of artificial hip
retrospective), and case-control methods, though joints, forwhich a 10 to 15 year follow up isneeded. The
Health Services Research
relatively little use has been made of the latter beyond practical difficulties in maintaining such prolonged
Unit, Department of Public measures. studies or obser?
evaluating preventive prospective (whether experimental
Health and Policy, London can seen as are as are their costs.
The limitations of randomised trials be vational) considerable, With
School of Hygiene and
deriving from either the inherent nature of the method luck, there will occasionally be times when a random?
Tropical Medicine, Keppel
Street, London WOE 7HT (a limitation in principle) or from the way trials are ised trial addressing the question of current interest
conducted (a limitation in procedure). The importance has already been established decades before and
Nick Black, professor of
health services research of this distinction is that while little can be done patients from it can then be followed up. Unfortunately
about the former, improvements in the conduct of such serendipity is all too rare. As a practical alternative
BMJ 1996;312:1215-8 randomised trials could, in theory, overcome some or to doing nothing, retrospective observational studies
prostatic hyperplasia.910
What works well in pharmacological research may not work in the messier world of clinical
care POLITICAL AND LEGAL OBSTACLES
can be used to obtain some information on long term Thirdly, theremay be political obstacles if those who
outcomes.4 fund and manage health services do not want their
quence, the lack of any subsequent difference in attempt to subject radial k?ratotomy (an operation to
outcome between groups may under? correct short sightedness) to a randomised trial in the
comparison
estimate the benefits of the intervention. For example, United States.13 The
researchers were blocked by
it is well recognised that clinical audit is successful in private sector ophthalmologists who faced a major loss
improving the quality of health care only if the of income if the procedure was declared "experimental"
clinicians participating have a sense of ownership of the because this would have meant that health insurance
process.5 Such a "bottom up" approach is in stark companies would no longer reimburse them. As a
contrast to experimentation, in which the investigator result of legal action, the academic ophthalmologists
seeks to impose asmuch control on the subjects in the were forced to declare
the operation safe and effective
study as possible?that is, a "top down" approach. As and abandon any attempt at evaluation.
a consequence, randomised trials of audit might find Fortunately, legal obstacles are rare, but a common
less benefit than observational studies. The same may problem is that some interventions simply cannot be
be true formany interventions for which clinicians, or allocated on a random basis. These tend to be questions
patients, or both, have a preference (despite agreeing of how best to organise and deliver an intervention. For
to random allocation), and where patients need to example, a current consensus is that clinicians and
practical to subject a limited number of items to the gold standard. None of the 25 instruments that
experimental evaluation.16 We therefore need to take have been developed to judge the methodological
advantage of other methods to try and fill in the huge quality of trials includes any consideration of this
gaps that are always likely to exist in the experimental aspect.24 The same is true for the guidance provided by
published findings. the Cochrane Collaboration.25
be Discussion
Experimentation may inadequate
or Randomised controlled trials occupy a special
The external validity, "generalisability," of the
results extent to place in the pantheon of methods for assessing the
of randomised trials is often low.17 The
effectiveness of health care interventions. When
which the results of a trial are generalisable depends on
extent to which appropriate, practical, and ethical, a randomised trial
the the outcome of the intervention is
design should be used. I have tried to show that, for all
determined by the particular person providing the
care. At one extreme the outcome of pharmaceutical their well known methodological strengths, trials
validity. The first is that the health care professionals ? Failure to assess rare outcomes (by mounting large
who participate may be unrepresentative. They may trialswith thousands of patients)
have a particular interest in the topic or be enthusiasts ? Failure to assess long term outcomes (by continuing
and innovators. The setting may also be atypical, a
to follow up patients for many years)
teaching hospital for example. In one of the few
? Elimination of clinicians' and
randomised trials
of surgery for glue ear undertaken in patients' preferences
the United all the outpatient and surgical (by introducing preference arms26)
Kingdom,
care was performed ? Refusal to participate more
by a highly experienced consultant by clinicians (by using
surgeon; in real life most such work is performed acceptable methods of randomisation27)
by
relatively inexperienced junior surgeons.18 ? Ethical objections to randomisation (by exploring
Secondly, the patients who participate may be alternative less methods of
demanding obtaining
atypical. All trials exclude certain categories of patients. informed consent28)
Often the exclusion criteria are so restrictive that the
? Political and legal obstacles (by persuasion)
patients who are eligible for inclusion represent only a
small of the patients treated in
? The daunting size of the task (by vastly expanding
proportion being
normal practice. Only 4% of patients currently
the available funds for experimental studies)
undergoing coronary revascularisation in the United ? Overrestrictive patient eligibility criteria (by
States would have been eligible for inclusion in the rather than
undertaking pragmatic explanatory trials23)
trials that were conducted in the 1970s.19 It has
While all the proposed solutions could work in
been suggested that the same problem will limit the
theory, few of them are realistic in practice, presenting
usefulness of the current randomised controlled trials as they do enormous problems for researchers and,
comparing coronary artery surgery and angioplasty.20
more importantly, for research funding bodies. For
Similar problems occur in trials of cancer treatment.21
example, it is feasible to randomise tens of thousands of
Another facet of this problem is the absence of
people in a drug trial in which death is the only
privately funded patients from almost all randomised outcome of interest, but it is unrealistic if more
trials in theUnited Kingdom.
complex and sophisticated outcomes are the relevant
The problem eligibility of
may be exacerbated by a In many ways the problems that randomised
endpoints.
poor recruitmentrate. Although most trials fail to
trials encounter arise from a largely uncritical transfer
report their recruitment rate,4 those that do suggest
of a well developed scientific method in
pharma?
rates are often very low. As little is yet known about the
cological research to the evaluation of other health
sort of people who are prepared to have their treatment
technologies and to health services.
allocated on a random basis, it seems wise to assume
Several of the other limitations cannot be polarised
that theymay differ in important ways from those who between and but are a complex
principle practice mix
decline to take part.
of the two. These include:
And the third and final problem in generalising the
? Contamination between treatment groups
results of randomised trials is that treatment may be
Patients who may receive better ? The of clinicians who volunteer
atypical. participate unrepresentativeness
care, regardless of which arm of the trial they are in.22 to participate