Doctors are being encouraged to improve their critical
appraisalskills to make better use of medical research. But when
usingthese skills, it is important to remember that interpretationof data is inevitably subjective and can itself result in bias.
Facts do not accumulate on the blank slates of researchers'minds
and data simply do not speak for themselves.1
Good science inevitably embodies a tension between the empiricismof
concrete data and the rationalism of deeply held convictions.
Unbiased interpretation of data is as important as performing
rigorous experiments. This evaluative process is never totally
objective or completely independent of scientists' convictionsor
theoretical apparatus. This article elaborates on an insightof
Vandenbroucke, who noted that "facts and theories remaininextricably
linked... At the cutting edge of scientific progress,where new ideas
develop, we will never escape subjectivity."2
Interpretation can produce sound judgments or systematic error.Only
hindsight will enable us to tell which has occurred. Nevertheless,
awareness of the systematic errors that can occur in evaluative
processes may facilitate the self regulating forces of scienceand
help produce reliable knowledge sooner rather than later.
Interpretative processes and biases
in medical science
Science demands a critical attitude, but it is difficult toknow
whether you have allowed for too much or too little scepticism.Also,
where is the demarcation between the background necessaryfor making
judgments (such as theoretical commitments and previousknowledge)
and the scientific goal of being objective and freeof
preconceptions? The interaction between data and judgmentis often
ignored because there is no objective measure for the subjective components of
interpretation. Taxonomies of biasusually emphasise technical
problems that can be fixed.3 The
biases discussed below, however, may be present in the mostrigorous
science and are obvious only in retrospect.
Quality assessment and confirmation
bias
The quality of any experimental findings must be appraised.Was
the experiment well performed and are the outcomes reliableenough
for acceptance? This scrutiny, however, may cause aconfirmation
bias: researchers may evaluate evidence that supportstheir prior
belief differently from that apparently challengingthese
convictions. Despite the best intentions, everyday experienceand
social science research indicates that higher standardsmay be
expected of evidence contradicting initial expectations.
Two examples might be helpful. Koehler asked 297 advanced university science
graduate students to evaluate two supposedly genuineexperiments
after being induced with different "doses" of positiveand negative
beliefs through false background papers.4
Questionnairesshowed that their beliefs were successfully
manipulated. The students gave significantly higher rating to reports that
agreedwith their manipulated beliefs, and the effect was greateramong those induced to hold stronger beliefs. In another experiment,398 researchers who had previously reviewed experiments fora
respected journal were unknowingly randomly assigned to assess
fictitious reports of treatment for obesity. The reports were
identical except for the description of the intervention being
tested. One intervention was an unproved but credible treatment
(hydroxycitrate); the other was an implausible treatment (homoeopathicsulphur). Quality assessments were significantly higher forthe
more plausible version.5 Such confirmation bias
may becommon.w1 w2
Definitions of interpretation biases
Confirmation biasevaluating
evidence that supports one's preconceptions differently
fromevidence that challenges these
convictions
Rescue biasdiscountingdata by
finding selective faults in the experiment
Auxiliaryhypothesis biasintroducing
ad hoc modifications to implythat an
unanticipated finding would have been otherwise hadthe experimental conditions been different
Mechanism biasbeingless sceptical
when underlying science furnishes credibility
for the data
"Time will tell" biasthe phenomenon that
different scientists need different amounts of
confirmatoryevidence
Orientation biasthe possibility that the
hypothesisitself introduces prejudices and
errors and becomes a determinateof
experimental outcomes
Expectation and rescue and auxiliary
hypothesis biases
Experimental findings are inevitably judged by expectations,and
it is reasonable to be suspicious of evidence that is inconsistent
with apparently well confirmed principles. Thus an unexpectedresult
is initially apt to be considered an indication thatthe experiment
was poorly designed or executed.6 w3 This processof interpretation, so necessary in science, can give rise to
rescue bias, which discounts data by selectively finding faultsin
the experiment. Although confirmation bias is usually unintended,
rescue bias is a deliberate attempt to evade evidence that
contradicts expectation.
Instances of rescue bias are almost as numerous as letters tothe
editors in journals. The avalanche of letters in responseto the
Veterans Administration Cooperative randomised controlledtrial
examining the efficacy of coronary artery bypass graftingpublished
in 1977 is a well documented example.7 The trial
found no significant difference in mortality between 310 patients
treated medically and 286 treated surgically. A subgroup of113
patients with obstruction of the left main coronary artery,however,
clearly benefited from surgery.8 Instead of
settlingthe clinical question, the trial spurred fierce debate in
which supporters and detractors of the surgery perceived flaws that,
they claimed, would skew the evidence away from their preconceived
position. Each stakeholder found selective faults to justify
preexisting positions that reflected their disciplinary affiliations
(cardiology v cardiac surgeon), traditions of research (clinicalv physiological), and personal experience.9
Auxiliary hypothesis bias is a form of rescue bias. Insteadof
discarding contradictory evidence by seeing fault in theexperiment,
the auxiliary hypothesis introduces ad hoc modificationsto imply
that an unexpected finding would have been otherwisehad the
experimental conditions been different. Because experimental
conditions can easily be altered in so many ways, adjustinga
hypothesis is a versatile tool for saving a cherished theory.w4
Evidence pointing to an unwelcome finding in a randomised controlled
trial, for example, can easily be dismissed by arguments against the therapeutic
dose, its timing, or how patients were selected.Lakatos termed such
reluctance to accept an experimental verdicta scientist's "thick
skin."10 Thus, when early randomised
controlled trials showed that hormone replacement therapy didnot
reduce the risk of coronary heart disease,11
advocatesof hormone replacement therapy argued that it was still
valuablefor primary prevention because the study group was women
withestablished coronary heart disease, making the disease toofar advanced to benefit from the treatment.
Plausibility and mechanism bias
Evidence is more easily accepted when supported by accepted
scientific mechanisms. This understandable tendency to be less
sceptical when underlying science furnishes credibility cangive rise
to mechanism bias. Often, such scientific plausibilityunderlies and
overlaps the other biases I've described. Manyexamples exist where
with hindsight it is clear that plausibilitycaused systematic
misinterpretation of evidence. For example,the early negative
evidence for hormone replacement therapywould have undoubtedly been
judged less cautiously if a biologicalrationale had not already
created a strong expectation thatoestrogens would benefit the
cardiovascular system.12 w5 Similarly,the rationale for antiarrhythmic drugs for myocardial infarction was so
imbedded that each of three antiarrhythmic drugs hadto be proved
harmful individually before each trial could be terminated.13 w6And the link between Helicobacter pylori andpeptic ulcer was rejected initially because the stomach was
considered to be too acidic to support bacterial growth.14
Waiting for more evidence and "time
will tell" bias
The position that more evidence is necessary before making a
judgment indicates a judicious attitude that is central toa
scientific scepticism. None the less, different scientistsseem to
need different amounts of confirmatory evidence tofeel satisfied.
This discrepancy in duration conceals a subjectiveprocess that
easily can become a "time will tell" bias. Theevangelist, at one
extreme, is quick to accept the data as good evidence (or even proof).
Evangelists often have a vested intellectual, professional, or personal
commitment and may have taken partin the experiment being assessed.
At the other extreme arethe snails, who invariably find the data
unconvincing, perhapsbecause of their personal and intellectual
investment in old"facts." At the two extremes, as well as at all
points in between,there is no objective way to tell whether good
judgment or systematic error is operating. Max Planck described the "timewill tell" bias cynically: "a new scientific truth does not
triumph by convincing its opponents and making them see thelight,
but rather because its opponents eventually die, anda new generation
grows up that is familiar with it."15
Hypothesis and orientation bias
The above categories of potential biases all occur after dataare
collected. Sometimes, however, conviction may affect thecollection
of data, creating orientation bias. Psychologistscall this the
"experimenter's hypothesis as an unintended determinantof
experimental results."16 Thus, psychology
graduate students,when informed that rats were specially bred for
maze brightness,found that these rats outperformed those bred for
maze dullness,despite both groups really being standard laboratory
rats assignedat random.17 Somehow,
experimental and recording errors tendto be larger and more in the
direction supporting the hypothesis.w7w8
Summary points
Evidence does not speak for itself and mustbe interpreted for quality and likelihood of error
Interpretationis never completely
independent of a scientist's beliefs, preconceptions,or theoretical commitments
On the cutting edge of science,scientific
interpretation can lead to sound judgment or
interpretativebiases; the distinction can
often be made only in retrospect
Commoninterpretative biases include
confirmation bias, rescue bias, auxiliary hypothesis
bias, mechanism bias, "time will tell"bias,
and orientation bias
The interpretative process is anecessary
aspect of science and represents an ignored subjectiveand human component of rigorous medical inquiry
Numerous studies have noted that randomised controlled trials
sponsored by the pharmaceutical industry consistently favournew
therapies.18 Research outcomes seem to be
affected bywhat the researcher is looking for. It is unclear to what
extentthese apparent successes are the result of publication biasor matters of study design. Nonetheless, such results are consistentwith an orientation bias and explain the fact that some early double blind
randomised controlled trials performed by enthusiastsshow
efficacylike hyperbaric oxygen for multiple sclerosis19w9 or endotoxin antibodies for Gram negative septic shock20whereas subsequent trials cannot replicate the
outcome.19
Comments
This article is written from the perspective of philosophy of
science. From a statistical point of view, the arguments presented
are obviously compatible with a subjectivist or bayesian framework
that formally incorporates previous beliefs in calculationsof
probability. But even if we accept that probabilities measure
objective frequencies of events, the arguments still apply.After
all, the overall experiment still has to be assessed.
I have argued that research data must necessarily undergo atacit
quality control system of scientific scepticism and judgmentthat is
prone to bias. Nonetheless, I do not mean to reducescience to a
naive relativism or argue that all claims to knowledgeare to be
judged equally valid because of potential subjectivityin science.
Recognition of an interpretative process does notcontradict the fact
that the pressure of additional unambiguous evidence acts as a self regulating
mechanism that eventuallycorrects systematic error. Ultimately,
brute data are coercive.However, a view that science is totally
objective is mythical,and ignores the human element of medical
inquiry. Awarenessof subjectivity will make assessment of evidence
more honest,rational, and reasonable.21
Further
references (denoted by theprefix "w") are available on
bmj.com
This article is a shortened version of a paper written for aseminar on bias led by Fredrick Mosteller at Harvard University
and reflects his helpful feedback. Peter Goldman criticisedearlier
versions of the article and helped make it understandable.The
comments of Iain Chalmers and Al Fishman have been helpful,as was
the dedicated research of Cleo Youtz. All errors and shortcomings of the paper
belong solely to the author.
Funding: In part from grants 1R01
AT00402-01 and 1R01
AT001414
[GenBank]
from the National Institutes of Health, Bethesda, MD.
Competing interests: None declared.
References
Elstein AS. Human factors in clinical judgment: discussion
of Scriven's clinical judgment. In: Engelhardt HT, Spicker SF, Towers B.
Clinical judgment: a critical appraisal. Dordrecht: Reidel, 1979.
Vandenbroucke JP. Medical journals and the shaping of
medical knowledge. Lancet 1998;352: 2001-6.[CrossRef][ISI][Medline]
Koehler JJ. The influence of prior beliefs on scientific
judgments of evidence quality. Organ Behav Hum Decision Processess
1993;56: 28-55.[CrossRef]
Resch KI, Ernst E, Garrow J. A randomized controlled study
of reviewer bias against an unconventional therapy. J R Soc Med
2000;93: 164-7.[Abstract]
Kuhn TS. The structure of scientific revolutions.
Chicago: University of Chicago Press, 1962.
Jones DS. Visions of cure: visualization, clinical trials,
and controversies in cardiac therapeutics, 1968-1998. Isis 2000;91;
504-41.[ISI][Medline]
Murphy ML, Hultgren HN, Detre K, Thomsen J, Takaro T,
Veterans Administration Cooperative Study. Treatment of chronic stable
angina: a preliminary report of survival data of the randomized Veterans
Administration cooperative study. N Engl J Med 1977;297: 621-7.[Abstract]
Special correspondence: a debate on coronary bypass. N
Engl J Med 1977;297: 1464-70.[ISI][Medline]
Lakatos I. The methodology of scientific research
programmes. Cambridge: Cambridge University Press, 1978.
Herrington DM, Rebuossin DM, Brosnihan B, Sharp PC,
Shumaker SA, Snyder TE, et al. Effects of estrogen replacement on the
progression of coronary-artery atherosclerosis. N Engl J Med
2000;343: 522-9.[Abstract/Free Full Text]
Nabel EG. Coronary heart disease in womenan ounce of
prevention. N Engl J Med 2000;343: 572-574.[Free Full Text]
Cardiac Arrhythmia Suppression Trial II Investigators.
Effect of the antiarrhythmic agent moricizine on survival after myocardial
infarction. N Engl J Med 1992;327: 227-33.[Abstract]
Thagard P. How scientists explain disease.
Princeton: Princeton University Press, 1999.
Planck M. Scientific autobiography and other papers.
London: Williams and Norgate, 1950.
Rosenthal R. On the social psychology of the psychological
experiment: the experimenter's hypothesis as an unintended determinant of
experimental results. Am Sci 1963;51: 268-83[ISI]
Rosenthal R, Fode KL. Three experiments in experimenter
bias. Psychol Rep Monog 1963:3: 12-8.
Djulbegovic B, Lacevic M, Cantor A, Fields KK, Bennett CL,
Adams J, et al. The uncertainty principle and industry-sponsored research.
Lancet 2000;356: 635-8.[CrossRef][ISI][Medline]
Robinson I. Clinical trials and the collective ethic: the
case of hyperbaric oxygen therapy and the treatment of multiple sclerosis.
In: Weisz G, ed. Social science perspectives on medical ethics.
Philadelphia: University of Pennsylvania Press, 1991.
Ziegler EJ, Fisher CJ Jr, Sprung CL, Straube RC, Sadoff JC,
Foulke GE, et al. Treatment of gram-negative bacteremia and septic shock
with HA-1A human monoclonal antibody against endotoxin. A randomized,
double-blind, placebo controlled trial. N Engl J Med 1991;324:
429-36.[Abstract]
Toulmin S. Return to reason. Cambridge: Harvard
University Press, 2001.
DISCLAIMER:
All information, data, and material contained, presented, or provided here
is for general information purposes only and is not to be construed as
reflecting the knowledge or opinions of the publisher, and is not to be
construed or intended as providing medical or legal advice. The decision
whether or not to vaccinate is an important and complex issue and should
be made by you, and you alone, in consultation with your health care
provider.
"A foolish faith in authority is the worst enemy of truth."
-- Albert Einstein, letter to a friend, 1901
"I know of no safe depository of the ultimate powers of the society but the people themselves, and if we think them not enlightened enough to exercise control with a wholesome discretion, the remedy is not to take it from them, but to inform their discretion by education."
-- Thomas Jefferson, letter to William C. Jarvis, September 28, 1820
"What's the point of vaccination if it doesn't protect you from the unvaccinated?"
-- Sandy Gottstein
"Who gets to decide what the greater good is and how many will be sacrificed to it?"