Improving the quality and clinical relevance of diagnostic studies

Frans H Rutten; Karel G M Moons; Arno W Hoes

doi:10.1136/bmj.332.7550.1129

Research

Improving the quality and clinical relevance of diagnostic studies

BMJ 2006; 332 doi: https://doi.org/10.1136/bmj.332.7550.1129 (Published 11 May 2006) Cite this as: BMJ 2006;332:1129

Frans H Rutten, general practitioners (F.H.Rutten{at}umcutrecht.nl),
Karel G M Moons, professor of clinical epidemiology,
Arno W Hoes, professor of clinical epidemiology and general practice

Julius Centre for Health Sciences and Primary Care, University Medical Centre, Utrecht, 3508 AB, Netherlands

Correspondence to: F H Rutten

Bachmann and colleagues show that few studies on diagnostic accuracy include calculations of sample size. Most such studies are too small to provide precise estimates of the overall sensitivity and specificity of a test, let alone for subgroups,1 and few studies have investigated this issue. We support the authors' recommendation that all diagnostic studies should calculate sample size at the planning phase, especially as straightforward methods are available for assessing simple proportions, such as sensitivity and specificity. However, they used the specificity and sensitivity of single tests to calculate sample size (understandable given the predominance of these tests in research) and did not consider the increasing number of clinically relevant studies that measure the accuracy of several tests in combination.2

If you were testing the accuracy of B-type natriuretic peptide (BNP) for excluding heart failure in primary care, for example, precise estimation of the sensitivity and specificity of the test might seem important. Such tests, however, have limited value in clinical practice. Firstly, in daily practice positive and negative values merely help doctors to estimate the probability of disease.3 Secondly, a diagnosis in practice is seldom based on one test. Doctors would probably use the BNP test only if it provided extra diagnostic information to other measures such as signs and symptoms, which have already been assessed. To improve clinical practice, it would be better to measure the diagnostic accuracy of combinations of readily available tests (applying multivariable regression analysis with receiver operating characteristic curves) and then assess whether the addition of BNP improves accuracy.4 The BNP test should not be used when the patient's history and physical examination would provide equivalent diagnostic information.

We know even less about determinations of sample size for multivariable diagnostic studies. The number of tests studied is usually limited to allow for adequate data analysis. An often used rule is that at least 10 patients with the disease should be tested for each diagnostic test evaluated.5 Such ways of determining sample size are not ideal. If the method suggested by Bachmann and colleagues is used to determine sample size in evaluations of multiple tests, many assumptions must be made to achieve acceptable proportions of false negative and false positive diagnoses when a cut-off value is introduced.

Methodological improvements are needed to guide considerations of sample size in diagnostic research. Lack of consensus on some of these issues is no excuse for “complete” lack of prior calculations of sample size in diagnostic studies. Bachmann and colleagues showed that a lack of such calculations is common. We hope that authors of studies on diagnostic tests will soon adopt more rigorous guidelines based on the standards for reporting of diagnostic accuracy (STARD initiative; www.consort-statement.org/Initiatives/newstard.htm).

Footnotes

Contributors FHR, KGMM, and AWH critically discussed the structure of this article. FHR wrote the first draft and KGMM and AWH critically revised the manuscript.
Competing interests None declared.

References

1.↵
1. Bachmann LM,
2. Puhan MA,
3. ter Riet G,
4. Bossuyt PM
.Sample sizes of studies on diagnostic accuracy: literature survey.BMJ2006; 332:1127–9.
OpenUrl Abstract/FREE Full Text
2.↵
1. Moons KG,
2. Biesheuvel CJ,
3. Grobbee DE
.Test research versus diagnostic research.Clin Chem2004; 50:473–6.
OpenUrl FREE Full Text
3.↵
1. Moons KG,
2. Harrell FE
.Sensitivity and specificity should be deemphasized in diagnostic accuracy studies.Acad Radiol2003; 10:670–2.
OpenUrl CrossRef PubMed Web of Science
4.↵
1. Rutten FH,
2. Moons KGM,
3. Cramer MJM,
4. Grobbee DE,
5. Zuithoff NPA,
6. Lammers JWJ,
7. et al
.Recognising heart failure in elderly patients with stable chronic obstructive pulmonary disease in primary care: a cross-sectional diagnostic study.BMJ2005; 331:1379–85.
OpenUrl Abstract/FREE Full Text
5.↵
1. Peduzzi P,
2. Concato J,
3. Kemper E,
4. Holford TR,
5. Feinstein AR
.A simulation study of the number of events per variable in logistic regression analysis.J Clin Epidemiol1996; 49:1373–9.
OpenUrl CrossRef PubMed Web of Science

[1] 1.↵
Bachmann LM,
Puhan MA,
ter Riet G,
Bossuyt PM
.Sample sizes of studies on diagnostic accuracy: literature survey.BMJ2006; 332:1127–9.
OpenUrl Abstract/FREE Full Text

[2] Bachmann LM,

[3] Puhan MA,

[4] ter Riet G,

[5] Bossuyt PM

[6] 2.↵
Moons KG,
Biesheuvel CJ,
Grobbee DE
.Test research versus diagnostic research.Clin Chem2004; 50:473–6.
OpenUrl FREE Full Text

[7] Moons KG,

[8] Biesheuvel CJ,

[9] Grobbee DE

[10] 3.↵
Moons KG,
Harrell FE
.Sensitivity and specificity should be deemphasized in diagnostic accuracy studies.Acad Radiol2003; 10:670–2.
OpenUrl CrossRef PubMed Web of Science

[11] Moons KG,

[12] Harrell FE

[13] 4.↵
Rutten FH,
Moons KGM,
Cramer MJM,
Grobbee DE,
Zuithoff NPA,
Lammers JWJ,
et al
.Recognising heart failure in elderly patients with stable chronic obstructive pulmonary disease in primary care: a cross-sectional diagnostic study.BMJ2005; 331:1379–85.
OpenUrl Abstract/FREE Full Text

[14] Rutten FH,

[15] Moons KGM,

[16] Cramer MJM,

[17] Grobbee DE,

[18] Zuithoff NPA,

[19] Lammers JWJ,

[20] et al

[21] 5.↵
Peduzzi P,
Concato J,
Kemper E,
Holford TR,
Feinstein AR
.A simulation study of the number of events per variable in logistic regression analysis.J Clin Epidemiol1996; 49:1373–9.
OpenUrl CrossRef PubMed Web of Science

[22] Peduzzi P,

[23] Concato J,

[24] Kemper E,

[25] Holford TR,

[26] Feinstein AR

Improving the quality and clinical relevance of diagnostic studies

Footnotes

References

Article alerts

Log in or register:

Download this article to citation manager

Help

Forward this page

Content links

About us

Resources

Explore BMJ

My account

Information

Search form

Improving the quality and clinical relevance of diagnostic studies

Footnotes

References

Article alerts

Log in or register:

Download this article to citation manager

Help

Forward this page

Content links

About us

Resources

Explore BMJ

My account

Information