Jump to: Page Content, Site Navigation, Site Search,
You are seeing this message because your web browser does not support basic web standards. Find out more about why this message is appearing and what you can do to make your experience on this site better.
Jonathan A C Sterne Medical Research Council Health Services Research
Collaboration, Department of Social Medicine, University of Bristol,
Bristol BS8 2PR
Correspondence to: J A C Sterne
jonathan.sterne{at}bristol.ac.uk
Studies that show a significant effect of treatment are
more likely to be published, be published in English, be cited by other
authors, and produce multiple publications than other
studies.1-8 Such studies are therefore also more likely
to be identified and included in systematic reviews, which may
introduce bias.9 Low methodological quality of studies
included in a systematic review is another important source of
bias.10
All these biases are more likely to affect small studies than large
ones. The smaller a study the larger the treatment effect necessary for
the results to be significant. The greater investment of time and money
in larger studies means that they are more likely to be of high
methodological quality and published even if their results are
negative. Bias in a systematic review may therefore become evident
through an association between the size of the treatment effect and
study size
Funnel plots
such associations may be examined both graphically and
statistically.
Summary points
Asymmetrical funnel plots may indicate publication bias or be due
to exaggeration of treatment effects in small studies of low quality
Bias is not the only explanation for funnel plot asymmetry; funnel
plots should be seen as a means of examining "small study effects"
(the tendency for the smaller studies in a meta-analysis to show larger
treatment effects) rather than a tool for diagnosing specific types of
bias
Statistical methods may be used to examine the evidence for bias and to
examine the robustness of the conclusions of the meta-analysis in
sensitivity analyses
"Correction" of treatment effect estimates for bias should be
avoided as such corrections may depend heavily on the assumptions made
Multivariable models may be used, with caution, to examine the relative
importance of different types of bias
![]()
Graphical methods for detecting bias
Funnel plots were first used in educational research and
psychology.11 They are simple scatter plots of the
treatment effects estimated from individual studies (horizontal axis)
against some measure of study size (vertical axis). Because precision
in estimating the underlying treatment effect increases as a study's
sample size increases, effect estimates from small studies scatter
more widely at the bottom of the graph, with the spread narrowing among
larger studies. In the absence of bias the plot therefore resembles a
symmetrical inverted funnel (fig 1
(left)).

View larger version (14K):
[in a new window]
Fig 1.
Hypothetical funnel plots: left, symmetrical
plot in absence of bias (open circles are smaller studies showing no
beneficial effects); centre, asymmetrical plot in presence of
publication bias (smaller studies showing no beneficial effects are
missing); right, asymmetrical plot in presence of bias due to low
methodological quality of smaller studies (open circles are small
studies of inadequate quality whose results are biased towards larger
effects). Solid line is pooled odds ratio and dotted line is null
effect (1). Pooled odds ratios exaggerate treatment effects in presence
of bias
relative risk or odds ratio
are
plotted on a logarithmic scale, so that effects of the same magnitude
but in opposite directions
for example, 0.5 and 2
are equidistant
from 1.0.12 Treatment effects have generally been plotted
against sample size or log sample size. However, the statistical power
of a trial is determined by both the sample size and the number of
participants developing the event of interest, and so the use of
standard error as the measure of study size is generally a good choice.
Plotting against precision (1/standard error) emphasises differences
between larger studies, which may be useful in some situtations.
Guidelines on the choice of axis in funnel plots are presented
elsewhere.13
Reporting bias
for example, because smaller studies showing no
statistically significant beneficial effect of the treatment (open
circles in fig 1 (left)) remain unpublished
leads to an asymmetrical
appearance with a gap in the bottom right of the funnel plot (fig 1
(centre)). In this situation the combined effect from meta-analysis
overestimates the treatment's effect.
14 15
Smaller
studies are, on average, conducted and analysed with less methodological rigour than larger ones, so that asymmetry may also
result from the overestimation of treatment effects in smaller studies
of lower methodological quality (fig 1 (right)).
Alternative explanations of funnel plot asymmetry
It is important to realise that funnel plot asymmetry may
have causes other than bias.14 Heterogeneity between trials leads to asymmetry if the true treatment effect is larger in the
smaller trials. For example, if a combined outcome is considered then
substantial benefit may be seen only in patients at high risk for the
component of the combined outcome affected by the intervention.
16 17
Trials conducted in patients at high
risk also tend to be smaller because of the difficulty in recruiting such patients and because increased event rates mean that smaller sample sizes are required to detect a given effect. Some interventions may have been implemented less thoroughly in larger trials, which thus
show decreased treatment effects. For example, an asymmetrical funnel
plot was found in a meta-analysis of trials examining the effect of
comprehensive assessment on mortality. An experienced consultant
geriatrician was more likely to be actively involved in the smaller
trials, and this may explain the larger treatment effects observed in
these trials.
14 18
Examining biological plausibility
In some circumstances the possible presence of bias can be
examined through markers of adherence to treatment, such as drug
metabolites in patients' urine or markers of the biological effects of
treatment such as the achieved reduction in cholesterol concentration
in trials of cholesterol lowering drugs. If patients' adherence to an
effective treatment varies across trials this should result in
corresponding variation in treatment effects. Scatter plots of
treatment effect against adherence should be compatible with there
being no treatment effect at 0% adherence, and so a simple regression
line should intercept the vertical axis at zero treatment effect. If a
scatter plot indicates a treatment effect even when no patients adhere
to treatment then bias is a possible explanation. Such plots provide an
analysis that is independent of study size. For example, in a
meta-analysis of trials examining the effect of reducing dietary sodium
on blood pressure Midgley et al plotted the reduction in blood pressure against the reduction in urinary sodium concentration for each study
and performed a linear regression analysis (fig 2).20 The
results show a reduction in blood pressure even in the absence of a
reduction in urinary sodium concentration, which may indicate the
presence of bias.
|
| |
Statistical methods for detecting and correcting for bias |
|---|
Selection models
"Selection models" to detect publication bias model the
selection process that determines which results are published, based on
the assumption that the study's P value affects its probability of
publication.21-23 The methods can be extended to estimate
treatment effects, corrected for the estimated publication bias,24 but avoidance of strong assumptions about the
nature of the selection mechanism means that a large number of studies is required so that a sufficient range of P values is included. Published applications include a meta-analysis of trials of homoeopathy and correction of estimates of the association between passive smoking
and lung cancer.
25 26
The complexity of the methods and
the large number of studies needed probably explains why selection models have not been widely used in practice.
Trim and fill
Duval and Tweedie have proposed "trim and fill"; a
method based on adding studies to a funnel plot so that it becomes
symmetrical.30-32 Smaller studies are omitted until the
funnel plot is symmetrical (trimming). The trimmed funnel plot is used
to estimate the true "centre" of the funnel, and then the omitted
studies and their missing "counterparts" around the centre are
replaced (filling). This provides an estimate of the number of missing
studies and an adjusted treatment effect, including the "filled"
studies. A recent study that used the trim and fill method in 48 meta-analyses estimated that 56% of meta-analyses had at least one
study missing whereas the number of missing studies in 10 was
statistically significant.33 However, simulation studies have found that the trim and fill method detects "missing" studies in a substantial proportion of meta-analyses, even in the absence of
bias.34 Thus there is a danger that in many meta-analyses application of the method could mean adding and adjusting for non-existent studies in response to funnel plot asymmetry arising from
nothing more than random variation.
Statistical methods for detecting funnel plot asymmetry
An alternative approach, which does not attempt to define
the selection process leading to publication, is to examine
associations between study size and estimated treatment effects. Begg
and Mazumdar proposed a rank correlation method to examine the
association between the effect estimates and their variances (or,
equivalently, their standard errors),35 whereas Egger et
al introduced a linear regression approach, which is equivalent to a
weighted regression of treatment effect (for example, log odds ratio)
on its standard error, with weights inversely proportional to the
variance of the effect size.14 Because each of these
approaches looks for an association between the study's treatment
effect and its standard error, they can be seen as statistical analogues of funnel plots. The regression method is more sensitive than
the rank correlation approach, but the sensitivity of both methods is
generally low in meta-analyses based on less than 20 trials.36
the "ecological
fallacy."39
|
| |
Case study |
|---|
Is the effect of homoeopathy due to the placebo effect?
The placebo effect is a popular explanation for the apparent
efficacy of homoeopathic remedies.42-44 Linde et al
addressed this question in a systematic review and meta-analysis of 89 published and unpublished reports of randomised placebo controlled
trials of homoeopathy.25 They did an extensive literature search and quality assessment that covered dimensions of internal validity known to be associated with treatment effects.10
![]() |
| (Credit: MARK OLDROYD) |
|
|
Summary recommendations on investigating and dealing with
publication and other biases in a meta-analysis
Examining for bias
Dealing with bias
|
Conclusions
Prevention is better than cure. In conducting a systematic
review and meta-analysis, investigators should make strenuous efforts
to find all published studies and search for unpublished work. The
quality of component studies should also be carefully
assessed.10 The box shows summary recommendations on
examining for, and dealing with, bias in meta-analysis. Selection models for publication bias are likely to be of most use in sensitivity analyses in which the robustness of a meta-analysis to possible publication bias is assessed. Funnel plots should be used in most meta-analyses to provide a visual assessment of whether the estimates of treatment effect are associated with study size. Statistical methods
may be used to examine the evidence for funnel plot asymmetry and
competing explanations for heterogeneity between studies. The power of
these methods is, however, limited, particularly for meta-analyses
based on a small number of small studies. The results of such
meta-analyses should always be treated with caution.
| |
Acknowledgments |
|---|
We thank Klaus Linde and Julian Midgley for unpublished data.
| |
Footnotes |
|---|
Series editor: Matthias Egger
Competing interests:
Systematic Reviews
in Health Care: Meta-analysis in Context can be purchased
through the BMJ Bookshop (www.bmjbookshop.com); further information and
updates for the book are available (www.systematicreviews.com)
| |
References |
|---|
| 1. | Easterbrook PJ, Berlin JA, Gopalan R, Matthews DR. Publication bias in clinical research. Lancet 1991; 337: 867-872[CrossRef][Medline]. |
| 2. | Dickersin K, Min YI, Meinert CL. Factors influencing publication of research results: follow-up of applications submitted to two institutional review boards. JAMA 1992; 263: 374-378. |
| 3. |
Stern JM, Simes RJ.
Publication bias: evidence of delayed publication in a cohort study of clinical research projects.
BMJ
1997;
315:
640-645 |
| 4. | Egger M, Zellweger-Zähner T, Schneider M, Junker C, Lengeler C, Antes G. Language bias in randomised controlled trials published in English and German. Lancet 1997; 350: 326-329[CrossRef][Medline]. |
| 5. | Gøtzsche PC. Reference bias in reports of drug trials. BMJ 1987; 295: 654-656. |
| 6. | Ravnskov U. Cholesterol lowering trials in coronary heart disease: frequency of citation and outcome. BMJ 1992; 305: 15-19. |
| 7. |
Tramèr MR, Reynolds DJ, Moore RA, McQuay HJ.
Impact of covert duplicate publication on meta-analysis: a case study.
BMJ
1997;
315:
635-640 |
| 8. | Gøtzsche PC. Multiple publication of reports of drug trials. Eur J Clin Pharmacol 1989; 36: 429-432[CrossRef][Medline]. |
| 9. | Egger M, Dickersin K, Davey Smith G. Problems and limitations in conducting systematic reviews. In: Egger M, Davey Smith G, Altman DG, eds. Systematic reviews in health care: meta-analysis in context 2nd ed. London: BMJ Books, 2001. |
| 10. | Jüni P, Altman DG, Egger M. Assessing the quality of controlled clinical trials. BMJ 2001; 322: 000-000. |
| 11. | Light RJ, Pillemer DB. Summing up: the science of reviewing research. Cambridge: Harvard University Press, 1984. |
| 12. | Galbraith RF. A note on graphical presentation of estimated odds ratios from several clinical trials. Stat Med 1988; 7: 889-894[Medline]. |
| 13. | Sterne JAC, Egger M. Funnel plots for detecting bias in meta-analysis: guidelines on choice of axis. J Clin Epidemiol 2001 (in press). |
| 14. |
Egger M, Davey Smith G, Schneider M, Minder C.
Bias in meta-analysis detected by a simple, graphical test.
BMJ
1997;
315:
629-634 |
| 15. | Villar J, Piaggio G, Carroli G, Donner A. Factors affecting the comparability of meta-analyses and the largest trials results in perinatology. J Clin Epidemiol 1997; 50: 997-1002[CrossRef][Medline]. |
| 16. |
Davey Smith G, Egger M.
Who benefits from medical interventions? Treating low risk patients can be a high risk strategy.
BMJ
1994;
308:
72-74 |
| 17. |
Glasziou PP, Irwig LM.
An evidence based approach to individualising treatment.
BMJ
1995;
311:
1356-1359 |
| 18. | Stuck AE, Siu AL, Wieland GD, Adams J, Rubenstein LZ. Comprehensive geriatric assessment: a meta-analysis of controlled trials. Lancet 1993; 342: 1032-1036[CrossRef][Medline]. |
| 19. | Sterne JAC, Egger M, Davey Smith G. Investigating and dealing with publication and other biases. In: Egger M, Davey Smith G, Altman DG, eds. Systematic reviews in health care: meta-analysis in context 2nd ed. London: BMJ Books, 2001. |
| 20. | Midgley JP, Matthew AG, Greenwood CMT, Logan AG. Effect of reduced dietary sodium on blood pressure. A meta-analysis of randomized controlled trials. JAMA 1996; 275: 1590-1597[Abstract]. |
| 21. | Dear KBG, Begg CB. An approach to assessing publication bias prior to performing a meta-analysis. Stat Sci 1992; 7: 237-245. |
| 22. | Hedges LV. Modeling publication selection effects in meta-analysis. Stat Sci 1992; 7: 246-255[CrossRef]. |
| 23. | Iyengar S, Greenhouse JB. Selection problems and the file drawer problem. Stat Sci 1988;109-35. |
| 24. | Vevea JL, Hedges LV. A general linear model for estimating effect size in the presence of publication bias. Psychometrika 1995; 60: 419-435[CrossRef]. |
| 25. | Linde K, Clausius N, Ramirez G, Melchart D, Eitel F, Hedges LV, et al. Are the clinical effects of homoeopathy placebo effects? A meta-analysis of placebo-controlled trials. Lancet 1997; 350: 834-843[CrossRef][Medline]. |
| 26. | Givens GH, Smith DD, Tweedie RL. Publication bias in meta-analysis: a Bayesian data-augmentation approach to account for issues exemplified in the passive smoking debate. Stat Sci 1997; 12: 221-250. |
| 27. | Copas J. What works?: selectivity models and meta-analysis. J R Stat Soc A 1999; 162: 95-109[CrossRef]. |
| 28. |
Copas JB, Shi JQ.
Reanalysis of epidemiological evidence on lung cancer and passive smoking.
BMJ
2000;
320:
417-418 |
| 29. | Begg CB. Publication bias in meta-analysis. Stat Sci 1997; 12: 241-244. |
| 30. | Taylor SJ, Tweedie RL. Practical estimates of the effect of publication bias in meta-analysis. Aust Epidemiol 1998; 5: 14-17. |
| 31. | Duval SJ, Tweedie RL. Trim and fill: a simple funnel plot based method of testing and adjusting for publication bias in meta-analysis. Biometrics 2001 (in press). |
| 32. | Duval SJ, Tweedie RL. A non-parametric "trim and fill" method of assessing publication bias in meta-analysis. J Am Stat Assoc 2001 (in press). |
| 33. |
Sutton AJ, Duval SJ, Tweedie RL, Abrams KR, Jones DR.
Empirical assessment of effect of publication bias on meta-analyses.
BMJ
2000;
320:
1574-1577 |
| 34. | Sterne JAC, Egger M. High false positive rate for trim and fill method [rapid response to (www.bmj.com/cgi/eletters/320/7249/1574#EL1)] (accessed 30 March 2001). |
| 35. | Begg CB, Mazumdar M. Operating characteristics of a rank correlation test for publication bias. Biometrics 1994; 50: 1088-1101[CrossRef][Medline]. |
| 36. | Sterne JAC, Gavaghan D, Egger M. Publication and related bias in meta-analysis: power of statistical tests and prevalence in the literature. J Clin Epidemiol 2000; 53: 1119-1129[CrossRef][Medline]. |
| 37. | Thompson SG, Sharp SJ. Explaining heterogeneity in meta-analysis: a comparison of methods. Stat Med 1999; 18: 2693-2708[CrossRef][Medline]. |
| 38. | Thompson SG. Why and how sources of heterogeneity should be investigated. In: Egger M, Davey Smith G, Altman DG, eds. Systematic reviews in health care: meta-analysis in context 2nd ed. London: BMJ Books, 2001. |
| 39. |
Morgenstern H.
Uses of ecologic analysis in epidemiologic research.
Am J Public Health
1982;
72:
1336-1344 |
| 40. | Daniels MJ, Hughes MD. Meta-analysis for the evaluation of potential surrogate markers. Stat Med 1997; 16: 1965-1982[CrossRef][Medline]. |
| 41. | Hughes MD, Daniels MJ, Fischl MA, Kim S, Schooley RT. CD4 cell count as a surrogate endpoint in HIV clinical trials: a meta-analysis of studies of the AIDS Clinical Trials Group. AIDS 1998; 12: 1823-1832[Medline]. |
| 42. | Maddox J, Randi J, Stewart WW. "High-dilution" experiments a delusion. Nature 1988; 334: 287-291[CrossRef][Medline]. |
| 43. | Gøtzsche PC. Trials of homeopathy. Lancet 1993; 341: 1533. |
| 44. | Vandenbroucke JP. Homoeopathy trials: going nowhere. Lancet 1997; 350: 824[CrossRef][Medline]. |
| 45. | Moher D, Schulz KF, Altman DG, for the CONSORT Group. The CONSORT statement: revised recommendations for improving the quality of reports of parallel group randomized trials. Lancet 2001 (in press). |
| 46. | Lefebvre C, Clarke M. Identifying randomised trials. In: Egger M, Davey Smith G, Altman DG, eds. Systematic reviews in health care: meta-analysis in context 2nd ed. London: BMJ Books, 2001. |
What can you learn from this BMJ paper? Read Leanne Tite's Paper+