BMJ 2000;320:1332-1334 ( 13 May )

Education and debate

Thyroid function tests---time for a reassessment

Denis StJ O'Reilly, consultant clinical biochemist

Department of Clinical Biochemistry and Clinic for Thyroid Diseases, Royal Infirmary, Glasgow G4 0SF

doreilly{at}clinmed.gla.ac.uk

In 1999, 890 000 measurements of thyroid stimulating hormone were performed by Scottish hospital laboratories---approximately one test for every six of Scotland's 5.1 million people.1 This number does not include tests performed in the non-NHS laboratories or as part of the screening programme for congenital hypothyroidism. Although laboratory statistics are not collected nationally in England and Wales, the market in the United Kingdom (population 59 million) for thyroid stimulating hormone diagnostic tests is currently estimated at 9-10 million each year.

A remarkable downgrading of the clinical aspects of hypothyroidism and hyperthyroidism has paralleled the inexorable increase in the number of thyroid function tests performed over the past 20 years. This has led to chaos in the diagnosis of hypothyroidism. It has been stated that a diagnosis of clinical hypothyroidism can be made on the basis of biochemical measurements alone and that signs and symptoms are unnecessary.2 Other authors protest, and maintain that biochemical tests can be misleading and that the diagnosis can be made on clinical grounds alone.3 In hyperthyroidism, a suppressed thyroid stimulating hormone concentration is currently the cornerstone of biochemical diagnosis. No numerical value has been assigned to the serum concentration of thyroid stimulating hormone below which suppression is considered to occur. This value varies from centre to centre depending on the sensitivity of the local assay. Thus, to many non-specialists the diagnosis of hyperthyroidism is also confusing.


Summary points


There are no data on the relative importance of biochemical thyroid function tests and clinical symptoms and signs in assessing thyroid dysfunction

Secretion of thyroid stimulating hormone is influenced by many factors other than the negative feedback inhibition by thyroxine or triiodothyronine

Changes in thyroid stimulating hormone, thyroxine, and triiodothyronine concentrations during systemic illness are poorly understood

Thyroid function tests cannot be interpreted in patients with systemic illness

Since thyroid stimulating hormone concentrations are distributed logarithmically in the population, minor changes are unlikely to be clinically important

The possibility of false positive and false negative results should be considered in interpreting thyroid stimulating hormone concentrations



    Methods

This review is based on my 20 years' postgraduate experience in providing biochemical thyroid function tests and treating patients with thyroid disorders. I have selected and highlighted some of the publications that have influenced my practice and call into question the increasing reliance on biochemical thyroid function tests in making a diagnosis.

    Historical setting

The treatments currently used for hyperthyroidism and hypothyroidism were established by the beginning of the 1970s. Though the symptoms and signs of these disorders had been analysed and clinical scoring indices had been developed and validated in the 1960s, clinical diagnosis remained problematic.4-8 The clinical diagnostic schemes for hypothyroidism were similar,4-6 but there were considerable differences between diagnostic schemes for hyperthyroidism. For example, atrial fibrillation was considered by Wayne and Crooks to be one of the most powerful discriminating signs, 6 7 but it was not included by Gurney et al.8 Age, on the other hand, was a major diagnostic factor according to Gurney et al,8 but was not mentioned by Wayne or Crooks. 6 7 From knowledge of the pathophysiology of the hypothalamic-pituitary-thyroid axis available at that time, it was believed that measuring the concentration of serum thyroid stimulating hormone would simplify the diagnosis.

    Hypothyroidism

The publication of a reliable and practical assay for thyroid stimulating hormone was a landmark.9 A normal range of <0.5-4.2 mU/l was established, based on measurements from 29 control subjects. One of the first applications of the assay was in patients who had undergone subtotal thyroidectomy for Graves' disease.10 In 28 "unequivocally euthyroid" patients followed for three to 21 years, the mean concentration was 8.2 mU/l (range 1.3-34.0 mU/l). In four patients followed up for four to 12 years and in whom a therapeutic trial of thyroxine had shown no benefit, the thyroid stimulating hormone concentration range was 10.5-21.5 mU/l. These patients were considered to be unequivocally euthyroid by a group who had validated clinical indices for the diagnosis of hypoparathyroidism and hyperthyroidism. 5 7 They were used to show the superiority of thyroid stimulating hormone measurements in detecting hypothyroidism, and no suggestion was made that the normal range could be widened.

In 1973, the data on which the concept of subclinical hypothyrodism was based were published.11 The reference range for thyroid stimulating hormone, established from measurement in 29 subjects,10 was used to classify 22 euthyroid subjects as having subclinical hypothyroidism. In six of the 22 subjects given a therapeutic trial of thyroxine, treatment showed no benefit, and 10 had originally been recruited as normal controls.

Whickham survey
The Whickham survey was a further landmark.12 All Whickham residents with a serum thyroid hormone concentration >6 mU/l were diagnosed as being hypothyroid, irrespective of their clinical status. This reinforced the view that the serum thyroid stimulating hormone concentration defined hypothyroidism.

The 20 year follow up study of the Whickham survey has yielded invaluable data on the natural history of thyroid disorders.13 A main conclusion of the study, disseminated to most non-specialists in a review published in the BMJ, was that "thyroid stimulating hormone concentrations above 2 mU/l are associated with an increased risk of hypothyroidism."2 Half of the population (male and female) fall into this category.12 This conclusion was based on the change in the slope of the line obtained when the log of the serum thyroid stimulating hormone concentration was related to the logit probability of developing hypothyroidism over a 20 year period in women (see box).13 The probability of a 40 year old woman with a thyroid stimulating hormone of 2.1 mU/l developing hypothyroidism is low---at 1 in 50 over 20 years. In men, the probability is so low that an equivalent equation could not be derived.13


Relation between concentration and risk

The equation to describe the relation between the probability of developing hypothyroidism and the serum thyroid stimulating hormone concentration is13: ln {P/(1-P)}=b0+b1 ln thyroid stimulating hormone+0.027 age (+1.79 if antibody positive).

b0=-5.02, b1=0.30 if thyroid stimulating hormone <2 mU/l

b0=-6.38, b1=1.97 if thyroid stimulating hormone >= 2 mU/l

Clinical features ignored
The review also highlighted the fact that in making a diagnosis of clinical or overt hypothyroidism "symptoms are not considered a criterion by some authorities."2 The review claimed great authority. It was pointed out that some of the data on which it was based had been collected for the consensus statement for good practice and audit measures in the management of hypothyroidism and hyperthyroidism published on behalf of the Royal College of Physicians of London and the Society for Endocrinology.14 This publication makes no reference to the clinical manifestations or clinical diagnosis of hypothyroidism. Thus, the clinical features of hypothyroidism seem to have been relegated to the status of historical curiosities.

    Hyperthyroidism

Assays capable of defining the lower end of the statistically derived reference range became available in the early 1980s. One evaluation of such an assay reported that all of 110 hyperthyroid patients studied had a thyroid stimulating hormone concentration <0.07 mU/l, and all 62 euthyroid control subjects had concentrations >0.07 mU/l.15 However, some clinically euthyroid subjects with abnormally low thyroid stimulating hormone concentrations were classified as having subclinical hyperthyroidism.15 Assays can now detect thyroid stimulating hormone in serum at concentrations of 0.005 mU/l.16 At this low concentration, hyperthyroid patients were not distinguished from some euthyroid, though ill, patients.16 The range of thyroid stimulating hormone concentrations in patients whose condition stabilised on thyroxine replacement treatment was <0.005 to >10.00 mU/l.16 It is therefore clear that measurement of the thyroid stimulating hormone concentration has failed to deliver what was expected of it.

Clinical aspects
During this period the clinical aspects of hyperthyroidism have also been downgraded. Most current undergraduate textbooks treat the clinical diagnosis of thyroid dysfunction by referring the student to lists. In the current edition of the Oxford Textbook of Medicine, this matter is dismissed in less than a line, and the reader is referred to unweighted lists of the symptoms and signs.17 In the popular postgraduate textbook of Clinical Endocrinology, the biochemical diagnosis and assessment of hyperthyroidism are given before the clinical features.18 Medical journals are now effectively devoid of references to the clinical features of hyperthyroidism. Though a symptom rating scale for the diagnosis of hyperthyroidism was described in 1988,19 the clinical scoring systems for assessing hypothyroidism and hyperthyroidism are now rarely cited (table).


                              
View this table:
[in this window]
[in a new window]
 

Citation frequency (in BIDS) of published papers on the clinical assessment of hypothyroidism and hyperthyroidism in relation to UK groups and worldwide,1987-97



    Non-thyroidal illness syndrome

We have recently become aware of the complexity of the effects of non-thyroidal illness on the hypothalamic-pituitary-thyroid axis and thyroid hormone metabolism. Figures like the one shown (taken from a recent review20) are frequently used to illustrate the nature of the changes that occur in serum thyroid hormone concentrations in the non-thyroidal illness syndrome. These figures have never been published with a numerical scale or error bars. The problem of interpreting free thyroxine was summarised by the author: "It is common to find that a sample obtained from a patient with non-thyroidal illness syndrome may have a raised free thyroxine by one method but a normal or low free thyroxine by another."20 The equilibrium dialysis reference method used to profile free thyroxine in the figure is technically demanding and currently not established in the United Kingdom. As the original legend to the figure explains:

The profile for free thyroxine is that obtained using equilibrium dialysis and low sample dilution. The level of free thyroxine found using commercial methods will be heavily method dependent. A profile of free triiodothyronine is not included as some ultrafiltration methods suggest that normal or raised free triiodothyronine may be found in illness whilst equilibrium dialysis methods usually show diminished or normal concentrations.20

What free thyroxine and free triiodothyronine assays actually measure is controversial.21 However, what is clear is that we cannot interpret thyroid function tests in systemically ill patients.



View larger version (27K):
[in this window]
[in a new window]
 
The effects of illness on the concentrations of thyroid hormones: the shaded area represents the reference range for each method (reproduced with permission from Beckett and Wilkinson20)


    Current status of thyroid function tests

Our understanding of the complexity of the cerebral-hypothalamic-pituitary-thyroid axis and the mechanism of thyroid hormone action has grown enormously. Current knowledge indicates that the cardiac effects of thyroid hormones, which are clinically very important, are mediated via the alpha 1 thyroid hormone receptor independent of the beta  receptors, which are the dominant regulators of thyroid stimulating hormone secretion.22

False positive and negative results
Overlap between the statistically derived normal and abnormal ranges is accepted in diagnostic tests, giving rise to false positive and false negative results. These concepts have not been applied to measurements of thyroid stimulating hormone. Rather than accepting that the test can be fallible, we transfer the problem to the patient. In patients with systemic disease, the non-thyroidal illness syndrome is invoked to explain the anomalous results, and healthy subjects are diagnosed as having subclinical hypothyroidism or hyperthyroidism. 11 15 The distribution of the serum thyroid stimulating hormone concentration in the population is logarithmic.13 Thus, minor deviations from the statistically derived reference range are unlikely to be clinically meaningful.11

Confusion
Studies in 1580 inpatients23 and in 630 patients admitted as medical emergencies24 found that thyroid function tests performed as screening tests yielded abnormal results in 33% and 20% of patients respectively. In both studies, the biochemical tests suggested thyroid disease incorrectly (that is, they gave false positive results) in nine cases out of 10. Thus, indiscriminate use of thyroid function tests is more likely to confuse than to help.

We do not know how important the thyroid function tests are for making a diagnosis of thyroid dysfunction. It is a matter of personal judgment. Experience has shown that thyroid function tests, like all the signs and symptoms associated with hypothyroidism and hyperthyroidism, are not totally reliable. As it becomes clear that biochemical assessments cannot deliver the diagnostic accuracy expected of them, the fact that the clinical aspects of assessing thyroid dysfunction are being sidelined is a cause for concern. Doing more biochemical tests will lead to further confusion, not the hoped for clarity. The information obtained from thyroid function tests, despite its quantitative numerical appearances, is "soft." How soft has yet to be established.

    Acknowledgments

I thank Dr David Lyon for mathematical help, Dr Ann Wales for obtaining the citation data given in the table, and Drs G H Beastall and H G Gray for constructive comments and discussion.

    Footnotes

Competing interests: None declared.

    References

1. The NHS in Scotland. Laboratory statistics 1999. Edinburgh: Information and Statistics Division, The NHS in Scotland, 1999.
2. Weetman AP. Hypothyroidism: screening and subclinical disease. BMJ 1997; 314: 1175-1178[Free Full Text].
3. Skinner GRB, Thomas R, Taylor M, Sellarajah M, Bolt S, Krett S, et al. Thyroxine should be tried in clinically hypothyroid but biochemically euthyroid patients. BMJ 1997; 314: 1764[Free Full Text].
4. Murray IPC. The clinical diagnosis of thyroid disease. Med J Aust 1964; 1: 827-831.
5. Billewicz WZ, Chapman RS, Crooks J, Day ME, Gossage J, Wayne E, et al. Statistical methods applied to the diagnosis of hypothyroidism. Q J Med 1969; 38: 255-266[Free Full Text].
6. Wayne EJ. Clinical and metabolic studies in thyroid disease. BMJ 1960; i: 78-90.
7. Crooks J, Murray IPC, Wayne EJ. Statistical methods applied to the clinical diagnosis of thyrotoxicosis. Q J Med 1959; 28: 211-234[Free Full Text].
8. Gurney C, Owen, SG, Hall R, Roth M, Harper M, Smart GA. Newcastle thyrotoxicosis index. Lancet 1970; ii: 1275-1278.
9. Hall R, Amos J, Ormston BJ. Radioimmunoassay of human serum thyrotrophin. BMJ 1971; i: 582-585.
10. Hedley AJ, Hall R, Amos J, Michie W, Crooks J. Serum thyrotropin levels after subtotal thyroidectomy for Graves disease. Lancet 1971; i: 455-458[CrossRef].
11. Evered DC, Ormston BJ, Smith PA, Hall R, Bird T. Grades of hypothyroidism. BMJ 1973; i: 657-662.
12. Tunbridge WMG, Evered DC, Hall R, Appleton D, Brewis M, Clark F, et al. The spectrum of thyroid disease in a community: the Whickham survey. Clin Endocrinol 1977; 7: 481-493[Medline].
13. Vanderpump MPJ, Tunbridge WMG, French JM, Appleton D, Bates M, Clark F, et al. The incidence of thyroid disorders in the community: a twenty-year follow-up of the Whickham survey. Clin Endocrinol 1995; 43: 55-68[Medline].
14. Vanderpump MPJ, Ahlquist JAO, Franklyn JA, Clayton RN. Consensus statement for good practice and audit measures in the management of hypothyroidism and hyperthyroidism. BMJ 1996; 313: 539-544[Free Full Text].
15. Seth J, Kellett HA, Caldwell G, Sweeting VM, Beckett GJ, Gow SM, et al. A sensitive immunoradiometric assay for serum thyroid stimulating hormone: a replacement for the thyrotropin releasing hormone test? BMJ 1984; 289: 1334-1336.
16. Wilkinson E, Rae PWH, Thomson KJT, Toft AD, Spencer CA, Beckett GJ. Chemiluminescent third generation assay (Amerlite TSH30) of thyroid stimulating hormone in serum or plasma assessed. Clin Chem 1993; 39: 2166-2173.
17. McGregor AM. The thyroid gland and disorders of thyroid function. In: Weatherall DJ, Ledingham JGG, Warrell DA, eds. Oxford textbook of medicine. 3rd ed. Oxford: Oxford University Press, 1996:1603-1621.
18. Hall R. Hyperthyroidism and Graves disease. In: Besser GM, Thorner MO, eds. Clinical endocrinology. 2nd ed. London: Mosby Wolfe, 1994:17: 1-24.
19. Klein I, Trzepacz PT, Roberts M, Levey GS. Symptom rating scale for assessing hyperthyroidism. Arch Intern Med 1988; 148: 387-390[Abstract].
20. Beckett GJ, Wilkinson E. Thyroid hormone metabolism and thyroid function tests in non-thyroidal illness. CPD Bull Clin Biochem 1998; 1: 9-14.
21. Ekins R. Measurement of free hormones in blood. Endocrin Rev 1990; 11: 5-46[Abstract].
22. Weiss RE, Murata Y, Cua K, Hayashi Y, Seo H, Refetoff S. Thyroid hormone action on liver, heart, and energy expenditure in thyroid hormone receptor beta -deficient mice. Endocrinology 1998; 139: 4945-4952[Abstract/Free Full Text].
23. Spencer C, Eigen A, Shen D, Duda M, Qualls S, Weiss S, et al. Specificity of sensitive assays of thyrotropin (TSH) used to screen for thyroid disease in hospitalised patients. Clin Chem 1987; 33: 1391-1396[Abstract/Free Full Text].
24. Small M, Buchanan L, Evans R. Value of screening thyroid functions in acute medical admissions to hospital. Clin Endocrinol 1990; 32: 185-191[Medline].

(Accepted 3 November 1999)


© BMJ 2000

Related Article

Thyroid function tests
P Kendall-Taylor, Alun Price, A P Weetman, A D Toft, G J Beckett, Sudha Bulusu, Martin Eales, and Denis O'Reilly
BMJ 2000 321: 1080. [Extract] [Full Text]

This article has been cited by other articles:

  • Helfand, M. (2004). Screening for Subclinical Thyroid Dysfunction in Nonpregnant Adults: A Summary of the Evidence for the U.S. Preventive Services Task Force. ANN INTERN MED 140: 128-141 [Abstract] [Full text]  
  • PINTO, A., GLICK, M. (2002). Management of patients with thyroid disease: Oral health considerations. Journal of the American Dental Association 133: 849-858 [Abstract] [Full text]  
  • Kendall-Taylor, P, Price, A., Weetman, A P, Toft, A D, Beckett, G J, Bulusu, S., Eales, M., O'Reilly, D. (2000). Thyroid function tests. BMJ 321: 1080-1080 [Full text]  

Rapid Responses:

Read all Rapid Responses

Thyroid function tests--time for a reassessment
A Waise
bmj.com, 19 May 2000 [Full text]
Thyroid function testing means different things to different people
Martin Eales
bmj.com, 23 May 2000 [Full text]
Accurate diagnosis depends both on clinical judgement
A D Toft, et al.
bmj.com, 26 May 2000 [Full text]
A Patients View
Gina Wright
bmj.com, 28 May 2000 [Full text]
Consequences of the TSH
David M Derry
bmj.com, 29 May 2000 [Full text]
Thyroid function tests provide an objective measure of thyroid status
Edmund Lamb
bmj.com, 29 May 2000 [Full text]
Meaning of FT3 and FT4 ranges
Anna Dominko
bmj.com, 3 Jun 2000 [Full text]
More is not always better
Sudha Bulusu
bmj.com, 3 Jun 2000 [Full text]
Seasonal variation in thyroid gland hormone levels
Anna Dominko
bmj.com, 7 Jun 2000 [Full text]
Thyroid chronobiology
Joseph Watine
bmj.com, 13 Jun 2000 [Full text]
Abnormal thyroid function: clinical assessment is important.
Sudesna Chatterjee, et al.
bmj.com, 21 Jun 2000 [Full text]
Reassessing Thyroid tests
Pat Kendall-Taylor
bmj.com, 28 Jun 2000 [Full text]
More Research Needed
Liz Hammond-Kaarremaa
bmj.com, 5 Jul 2000 [Full text]
Thyroid function tests
Alun Price
bmj.com, 11 Jul 2000 [Full text]
Thyroid function tests - time for a reassessment
Denis St J O'Reilly
bmj.com, 26 Jul 2000 [Full text]
Thyroxine IS NOT THE ANSWER
Pat Davis
bmj.com, 22 Aug 2000 [Full text]
Quality of life as a thyroid patient
Helen Butler
bmj.com, 26 Sep 2000 [Full text]
Suffering of Patients Needlessly
Kristine Holmes
bmj.com, 28 Oct 2000 [Full text]
Hypothyroid patient replies
June Hyslop
bmj.com, 9 Jan 2001 [Full text]
A patient's perspective on risk/benefit
Alison Everitt
bmj.com, 1 Mar 2001 [Full text]
The Fundamental Necessity to Scapegoat the Patient
James Finch
bmj.com, 20 Apr 2001 [Full text]
Are we that primative?
Deborah Criscione
bmj.com, 12 Jun 2001 [Full text]
Clinical trials to establish the validity of biochemical tests and of treatment for hypothroidism
Anna Perry
bmj.com, 16 Jul 2001 [Full text]
Point for clarification
Dr. Donald F. Feeney
bmj.com, 18 Oct 2002 [Full text]



Student BMJ

Risk of surgery for inflammatory bowel disease: record linkage studies

What can you learn from this BMJ paper? Read Leanne Tite's Paper+

www.student.bmj.com

Listen to the latest BMJ Interview