The hospital anxiety and depression rating scale: A cross-sectional study of psychometrics and case finding abilities in general practice
© Olssøn et al; licensee BioMed Central Ltd. 2005
Received: 21 June 2005
Accepted: 14 December 2005
Published: 14 December 2005
General practitioners' (GPs) diagnostic skills lead to underidentification of generalized anxiety disorders (GAD) and major depressive episodes (MDE). Supplement of brief questionnaires could improve the diagnostic accuracy of GPs for these common mental disorders.
The aims of this study were to examine the usefulness of The Hospital Anxiety and Depression Rating Scale (HADS) for GPs by: 1) Examining its psychometrics in the GPs' setting; 2) Testing its case-finding properties compared to patient-rated GAD and MDE (DSM-IV); and 3) Comparing its case finding abilities to that of the GPs using Clinical Global Impression-Severity (CGI-S) rating.
In a cross-sectional survey study 1,781 patients in three consecutive days in September 2001 attended 141 GPs geographically spread in Norway. Sensitivity, specificity, optimal cut off score, and Area under the curve (AUC) for the HADS and the CGI-S were calculated with Generalized Anxiety Questionnaire (GAS-Q) as reference standard for GAD, and Depression Screening Questionnaire (DSQ) for MDE.
The HADS-A had optimal cut off ≥8 (sensitivity 0.89, specificity 0.75), AUC 0.88 and 76% of patients were correctly classified in relation to GAD. The HADS-D had by optimal cut off ≥8 (sensitivity 0.80 and specificity 0.88) AUC 0.93 and 87% of the patients were correctly classified in relation to MDE. Proportions of the total correctly classified at the CGI-S optimal cut-off ≥3 were 83% of patients for GAD and 81% for MDE.
The results indicate that addition of the patients' HADS scores to GPs' information could improve their diagnostic accuracy of GAD and MDE.
An important task for general practitioners (GPs) is to diagnose and treat depressions and anxiety disorders, which are among the most common and amenable mental disorders in their practice . The literature shows that the GPs' diagnostic skills concerning these common disorders are moderately good [1–8], and somewhat better for major depressive episodes (MDE) than for generalized anxiety disorder (GAD) . A prospective cohort study of depression in primary care, found that the WHO-5 well being index (WHO-5, 5 items) had significantly higher sensitivity than the GPs' clinical diagnosis when the Composite International Diagnostic Interview (CIDI) was used as gold standard . The depression module of the brief patient health questionnaire (B-PHQ, 9 items) had significantly higher specificity than GPs' clinical diagnoses, and GPs' diagnoses had significantly higher specificity than the WHO-5. The sensitivity and specificity of screening instruments for GAD in general practice has hardly been investigated [10, 11].
Reviews [12, 13] show that the Hospital Anxiety and Depression Rating Scale (HADS)  is widely used as a brief self-rating instrument for both dimensional and categorical aspects of anxiety and depression in both epidemiology and specialist care. In these settings the psychometric properties of the HADS are excellent [15, 16]. Until now the factor structure, the internal consistency, and the inter-correlation and homogeneity of the HADS sub-scales have not been described in the context of general practice. The case-finding abilities of the HADS in relation to DSM-III/DSM-IV and ICD-10 defined anxiety disorders and depressions by the use of a score ≥ 8 as cut-off are considered good with few false negatives, but a definite proportion of false positives. In clinical practice a positive screening typically results in further evaluation. Considering the brevity and feasibility of the HADS, it should be useful for screening of patients in general practice, but studies of the HADS from that part of the health services are few and inconsistent as to cut-off scores for caseness [17–20]. These points indicate the need for more data on the case-finding abilities of the HADS subscales in general practice.
Aims of the study
This study from Norwegian general practice has the following aims: 1) To examine the psychometric features of the HADS rated by patients in the primary care setting; 2) To test the case-finding properties of the HADS in relation to the diagnoses of GAD and MDE based on patient-rating of their diagnostic criteria according to DSM-IV as reference standards; and 3) To compare the case finding abilities of the HADS rated by patients to that of GPs using the Clinical Global Impression-Severity (CGI-S).
Sampling of GPs
The GPs in various parts of Norway were recruited as a convenience sample among those registered in the database of Wyeth Norway Ltd. The procedural information to the GPs was given in writing, and no special training of them for the study was undertaken. Among 141 participating GPs, 136 were eligible and 133 gave demographic data. Ninety GPs (68%) were men and 43 (32%) women. They had been working in primary care for a mean of 15 (SD 7) and 11 (SD 7) years, respectively, and 118 (89%) of them worked in group practice. The GPs consulted with a mean of 21.1 (SD 5.1) patients on an average day. There were no significant differences between genders of GPs with regard to number of consultations.
Sampling of patients
DSQ/HADS sample N = 1,385
GAS-Q/HADS sample N = 1,238
Age, mean (SD):
Married /paired relationship
On sick leave:
DSQ / GAS-Q positive
HADS-D / HADS-A (≥ 8)
CGI-S (dep / gad, ≥ 3)
Diagnostic criteria and instruments
Psychiatric classification systems like DSM-IV and ICD-10 are based on the presence or absence of various operationalized diagnostic criteria. When structured interviews are used, the patients are asked for the presence of the diagnostic criteria by an interviewer. In contrast, in this study the patients rate themselves the diagnostic criteria for GAD (DSM-IV) on the Generalized Anxiety Questionnaire (GAS-Q) and for MDE (DSM-IV) on the Depression Screening Questionnaire (DSQ), and these patient ratings are used as diagnostic reference standard in this study.
The GAS-Q is a modification of the Anxiety Screening Questionnaire , and is a self-rating questionnaire developed to diagnose GAD according to DSM-IV and ICD-10. The GAS-Q consists of 20 items covering the diagnostic criteria for GAD in the DSM-IV. Test-retest reliability of the GAS-Q over a two-day retest period showed a kappa value of 0.74 for the diagnosis of GAD. Congruent validity comparing GAS-Q diagnosis with the DSM IV algorithm for GAD of the Composite International Diagnostic Interview showed a kappa of 0.72 .
The DSQ was made for patient-rating of MDE according to DSM-IV and ICD-10  and was chosen as our reference standard. The DSQ is an 11 item questionnaire in which diagnostic criteria are rated on a three point scale, supplemented by three questions to assess the age at first and current episode, and the number of episodes according to the criterion A of MDE in DSM-IV. Consistent with the DSM-IV criteria, a diagnosis of MDE was assigned when at least five of the items were rated as positive by the patient. In the German part of the European study, the internal consistency of the DSQ showed a Cronbach's coefficient alpha of 0.83 . Test-retest reliability over a two-day period found a kappa value of 0.82 for MDE . Tests of the DSQ diagnosis versus diagnosis of MDE based on structured interview showed a kappa 0.89 .
The HADS consists of seven items for anxiety (HADS-A) and seven for depression (HADS-D). The items are scored on a four-point scale from zero (not present) to three (considerable). The item scores are added, giving sub-scale scores on the HADS-A and the HADS-D from zero to 21. In this study valid HADS subscale scores were defined as having answered at least five of seven items on both the HADS-A and the HADS-D. In order to be valid in patients with somatic problems, the HADS items were based on the psychological aspects of anxiety and depression. The anxiety items were concentrated on general anxiety, and five of the items were close to the diagnostic criteria of GAD. The depression items were based on anhedonia, which is considered to be one of the essential criteria of depression . The concurrent validity of the HADS compared to other questionnaires for anxiety and depression is described between 0.60 and 0.80 for both sub-scales .
The CGI-S is a standardized assessment tool that is widely used as an outcome measure in research . The CGI-S had the following wording: "In your clinical judgement how severely does this patient suffer from MDE/GAD?" The ratings of CGI-S were: 1 = not ill at all, 2 = a borderline case, 3 = only mildly ill, 4 = moderately ill, 5 = seriously ill and 6 = extremely seriously ill. The CGI-S scale was dichotomised into 1–2 = not ill, 3–6 = ill, but we also explored the frequency of cases by a CGI-S score of ≥ 2 (= borderline case).
The statistical analyses were carried out with the SPSS for Windows, version 11.0. Principal Component Analysis (PCA) with oblique rotation was performed to explore the factor structure of the HADS. Internal consistency of the HADS-A and the HADS-D was tested using Cronbach's coefficient alpha. Pearson's correlation coefficient was used for estimation of the overlap between the subscales. Sensitivity and specificity were calculated for different cut-off values for the HADS-A, the HADS-D, and the CGI-S in relation to the prevalence rate of GAD identified with GAS-Q and the rate of MDE identified with DSQ. Sensitivities and specificities by optimal cut-off were used to calculate the rates of true and false positive and negative cases. The Receiver Operating Characteristics (ROC-curve) were depicted graphically, and the Area Under the Curve (AUC) were calculated for the HADS-A, the HADS-D and the CGI-S against the GAS-Q and the DSQ as reference standards. The associations of age and gender to caseness on the instruments were examined by logistic regression analyses. All significance tests were two-tailed, and p-values < .05 were reported as significant.
The Committee for Medical Ethics of Health Region East of Norway approved this study. The participants delivered informed consent after written information about the study. Wyeth Norway Ltd paid the GPs a fixed sum of EUR 15 per patient in addition to their normal salary. No employees of Wyeth Ltd. were present in any of the general practices during the day of inclusion. The national study leader coordinated the study, and Wyeth Norway Ltd functioned as sponsor of the study. This implied that employees of Wyeth Norway Ltd brought the material for the study to the GPs and later on collected the forms, but otherwise had no active part in the study. The company made no use of the collected data or analyses in their marketing. The study leader and his co-authors had no restrictions as to the content of the publications from Wyeth Norway Ltd, and the company did not want to review any manuscripts before submission.
According to the DSQ, 9.0% (CI 7.6 – 10.7%) of patients had MDE, and based on the GAS-Q 5.9% (CI 4.7 – 7.4%) had GAD. Prevalence rates for HAD-D (≥8) and HADS-A (≥8) were 18.5% (CI 16.5 – 20.6%) and 28.8% (CI 26.3 – 31.5%), respectively. According to GPs' clinical judgement by CGI-S (≥ 3) the prevalence rates were 24.3% (CI 22.1 – 26.7%) for MDE and 17.5% (CI 15.5 – 20.0%) for GAD. The associations between female gender and CGI-S caseness of depression (OR 1.5, p = 0.004, CI 1.3–1.9) and HADS-A caseness of anxiety disorder (OR 1.4, p = 0.013, CI 1.1 – 1.8) were both significant. Age significantly reduced the prevalence of caseness with 1–3 % on DSQ, GAS-Q and HADS-A. Based on a GPs' CGI-S score cut-off ≥ 2, the prevalence rates were 38% for MDE and 25% for GAD.
Psychometrics of the HADS
The internal consistency of the HADS-A and the HADS-D showed coefficient alpha of 0.89 and 0.86, respectively. PCA with varimax rotation of all 14 HADS items, extracted two factors both with Eigen-value of 4.13, and that factor solution comprised 59% of the explained variance. Anxiety and depression items loaded on separate factors. The anxiety and depression sub-scales shared 54% of the explained variance.
Case-finding abilities of the HADS
Sensitivity and specificity for HADS-A/D and CGI-S.
Generalized Anxiety Disorder
(N = 1,238)
Major Depressive Episode
(N = 1,385)
HADS – A/D
Comparison of GP-rated and patient-rated case identification
Using the GPs' CGI-S score of ≥ 3 as cut-off for a positive diagnosis, GAD was detected with a sensitivity of 0.52 and a specificity of 0.85 (Table 2). MDE was detected with a sensitivity of 0.79 and specificity of 0.81 by the same CGI-S cut-off level. Identification of GAD and MDE with the CGI-S showed AUCs of 0.77 (Figure 2) and 0.87 (Figure 3), respectively.
Classification of patients with eventual GAD and MDE.
Generalized Anxiety Disorder (GAS-Q Positive 73/1,238 = 5.9%)
Major Depressive Episode (DSQ Positive 125/1,385 = 9.0%)
Patients' HADS-A ≥ 8
GPs' CGI-S ≥ 3
Patients' HADS-D ≥ 8
GPs' CGI-S ≥ 3
Sensitivity / Specificity
0.89 / 0.75
0.52 / 0.85
0.80 / 0.88
0.79 / 0.81
% (95% CI)
% (95% CI)
% (95% CI)
% (95% CI)
True positive disorder
False positive disorder
True positive healthy
False positive healthy
Total rightly classified
Total wrongly classified
For MDE no significant difference was observed between rates of true positive disorder (Table 3). For true non-depression rate, the HADS-D (80%) showed a significantly better hit rate than the CGI-S (74%). The proportion of totally right classified depressed patients was significantly better for the patient-rated HADS-D (87%) than for the GP-rated CGI-S (81%).
Strengths and limitations
Compared to former studies from general practice the high number of patients and GPs in our study is a strength due to increased variance and reduced biases. The big sample sizes and a responder-rate above 70% among patients give adequate statistical power to the performed analyses. Our sample consists of geographically spread GPs who's working experience and gender distribution is representative for GPs in Norway . Patients' age and gender is representative for patients attending GPs in Scandinavia . We also consider as a strength that the GPs were blind to the HADS scores of the patients when they made their diagnostic evaluations.
It is a weakness of our study that we did not employ structured interviews for the establishment of reference standard diagnoses of GAD and MDE. However, the reference standards used by us comprise the same diagnostic criteria, are well described, and have shown good validity in relation to structured interviews [23, 25]. When both the HADS and the reference standards are self-rating instruments, the HADS might be systematically biased with falsely high sensitivity and/or specificity in relation to the reference standard. On the other hand, an interview could introduce observer bias in the interpretation of symptoms, which is eliminated using self-ratings. The reference standard questionnaires used in our study gave prevalence rates for GAD and MDE in general practice that were in accordance with the prevalence rates reported by Üstün & Sartorius , and this added some validity to our approach. Our design did not take into account the GPs' knowledge about the patients' somatic symptoms or psychosocial situation, which could be relevant information for the GPs in their diagnostic considerations. However, studies have shown that in non-clinical samples chronic somatic problems  and demographic variations  have only modest influence on the HADS scores.
The use of the CGI-S as a diagnostic instrument could be discussed since the instrument only evaluates the severity of the case. Severity is not a clear concept, and it is implicit in such ratings that the GPs are familiar with both mild and severe cases of GAD and MDE, although that hardly is the case. Further, the GPs could be biased in direction of false positive diagnoses since they took part in a sponsored study concerning these mental disorders.
Comparison with existing literature
The internal consistency of the HADS was found in accordance with other studies [16, 13]. The replication of the original two-factor structure of the HADS among primary care attenders has been discussed. A Dutch validation study  found evidence for the original two-factor structure among a sample (n = 112) of consecutive general practice patients. Data from a large non-clinical population give support to a s two-factor-structure of the HADS  in sub-samples with higher mental symptom levels than in the general population. In our sample from general practice, the HADS showed good separation of items, moderate inter-correlation, and a distinct two-factor structure. These results support the robustness of the HADS as a psychometrically adequate self-rating instrument for patients attending general practice.
An optimal balance between sensitivity and specificity is requested of a good questionnaire. From a clinical perspective high sensitivity might be seen the most important concern for a screening instrument, giving minimal number of false negative cases at the sacrifice of some false positive cases.
In general we found that the patient-rated HADS-A/D had better diagnostic ability than CGI-S rated by GPs (Figure 2 and 3) in relation to GAD and MDE. However, taking into regard the prevalence rate of 5.9% of GAD in general practice, the GPs' ability to recognise people not suffering from GAD (Table 3) is significant superior to that of the HADS-A and important in the clinical setting. With a prevalence rate of 9% for MDE in general practice the total proportion of patients correctly identified by HADS-D was significantly higher than that of GPs using the CGI-S.
Implications for future research or clinical practice
HADS showed satisfying psychometric properties in the general practice setting, which is of importance for future research. We found that GPs mainly recognized GAD by exclusion and MDE by inclusion, but still they had a considerable proportion of misclassifications. GPs' diagnostic precision in clinical practice is improved by supplementing HADS scores. The advantage of HADS is its feasibility of completion and well-established cut-off scores for clinically relevant caseness.
The psychometrics of the HADS was found to be excellent in this sample from general practice. The recommended cut-off score for caseness on the HADS-A and the HADS-D of ≥8 seemed appropriate for detecting GAD and MDE among patients attending primary care. In regard to prevalence rates, the GPs should positively trust their sensitivity in diagnosing MDE, and their specificity in diagnosing GAD by exclusion of patients without anxiety. Patient-rated HADS could represent a useful supplement to GPs' own clinical judgment.
Hospital Innlandet Trust, Division Psychiatry supported the study. We thank patients and general practitioners contributing to the study and Wyeth Norway Ltd for getting the material to our disposal.
- Üstün TB, Sartorius N: Mental illness in general health care. 1995, Chichester: WileyGoogle Scholar
- Goldberg D, Steele JJ, Johnsen A, Smith C: Ability of primary care physicians to make accurate ratings of psychiatric symptoms. Arch Gen Psychiatry. 1982, 39: 829-33.View ArticlePubMedGoogle Scholar
- Jencks SF: Recognition of mental distress and diagnosis of mental disorders in primary care. JAMA. 1985, 253: 1903-7. 10.1001/jama.253.13.1903.View ArticlePubMedGoogle Scholar
- Feinstein RE, Brewer AA, Editors: Primary care psychiatry and behavioral medicine. 1999, New York: Springer
- Munk-Jørgensen P, Fink P, Brevik JI, Dalgard OS, Engberg M, Hansson L, Holm M, Joukamaa M, Karlsson H, Lehtinen V, Nettbladt P, Stefansson C, Sørensem L, Jensen J, Borgquist L, Sandanger I, Nordström G: Psychiatric morbidity inn primary public health care: a multicentre investigation. Part II. Hidden morbidity and choice of treatment. Acta Psychiatr Scand. 1997, 95: 6-12.View ArticlePubMedGoogle Scholar
- Kessler RC: The epidemiology of pure and comorbid generalized anxiety disorder: a review and evaluation of recent research. Acta Psychiatr Scand Suppl. 2000, 7-13. 10.1111/j.0065-1591.2000.acp29[dash]02.x. 406
- Olfson M, Marcus SC, Druss B, Elinson L, Tanielian T, Pincus HA: National trends in the outpatient treatment of depression. JAMA. 2002, 287: 203-9. 10.1001/jama.287.2.203.View ArticlePubMedGoogle Scholar
- Wittchen HU, Kessler RC, Beesdo K, Krause P, Höfler M, Hoyer J: Generalized anxiety and depression in primary care: prevalence, recognition, and management. J Clin Psychiatry. 2002, 63 (Suppl 8): 24-34.PubMedGoogle Scholar
- Henkel V, Mergl R, Kohnen R, Maier W, Möller HJ, Hegerl U: Identifying depression in primary care: a comparison of different methods in a prospective cohort study. BMJ. 2003, 326: 200-201. 10.1136/bmj.326.7382.200.View ArticlePubMedPubMed CentralGoogle Scholar
- Hoyer von J, Krause P, Höfler M, Beesdo H, Wittchen HU: When and how well does the family physician recognize generalized anxiety disorders and depressions?. Fortschr Medizin. 2001, 119 (Suppl 1): 26-35. (in German)Google Scholar
- Beesdo K, Krause P, Höfler M, Witchen HU: Do primary care physicians know generalized anxiety disorders? Estimations of prevalence, attitudes and interventions. Fortschr Medizin. 2001, 119 (Suppl 1): 13-16. (in German)Google Scholar
- Herrmann C: International experiences with the hospital anxiety and depression scale – a review of validation data and clinical results. J Psychosom Res. 1997, 42: 17-41. 10.1016/S0022-3999(96)00216-4.View ArticlePubMedGoogle Scholar
- Bjelland I, Dahl AA, Haug TT, Neckelmann D: The validity of the hospital anxiety and depression scale. An updated literature review. J Psychosom Res. 2002, 52: 69-77. 10.1016/S0022-3999(01)00296-3.View ArticlePubMedGoogle Scholar
- Zigmond AS, Snaith RP: The hospital anxiety and depression scale. Acta Psychiatr Scand. 1983, 67: 361-70.View ArticlePubMedGoogle Scholar
- Moorey S, Greer S, Watson M, Gorman C, Rowden L, Tunmore R, Robertson B, Bliss J: The factor structure and factor stability of the hospital anxiety and depression scale in patients with cancer. Br J Psychiatry. 1991, 158: 255-259.View ArticlePubMedGoogle Scholar
- Mykletun A, Stordal E, Dahl AA: Hospital anxiety and depression (HAD) scale: factor structure, item analyses and internal consistency in a large population. Br J Psychiatry. 2001, 179: 540-544. 10.1192/bjp.179.6.540.View ArticlePubMedGoogle Scholar
- EL Rufaie OE, Absood GH: Retesting the validity of the arabic version of the hospital anxiety and depression (HAD) scale in primary heath care. Soc Psychiatry Psychiatr Epidemiol. 1995, 30: 26-31. 10.1007/BF00784431.View ArticlePubMedGoogle Scholar
- Lam CL, Pan PC, Chan AW, Chan CY, Munro C: Can the hospital anxiety and depression (HAD) scale be used on Chinese elderly in general practice?. Family Pract. 1995, 12: 149-154.View ArticleGoogle Scholar
- Wilkinson MJ, Barczak P: Psychiatric screening in general practice: comparison of the general health questionnaire and the hospital anxiety depression scale. J Royal Coll Gen Practit. 1988, 38: 311-313.Google Scholar
- Löwe B, Spitzer RL, Gräfe K, Kroenke K, Quenter A, Zipfel S, Buchholz C, Witte S, Herzog W: Comparative validity of three screening questionnaires for DSM-IV depressive disorders and physicians diagnoses. J Affect Dis. 2004, 78: 131-140. 10.1016/S0165-0327(02)00237-9.View ArticlePubMedGoogle Scholar
- Allgulander C, Nilsson B: A nationwide study in primary health care. One out of four patients suffers from anxiety and depression. Lakartidningen. 2003, 100: 832-838. (in Swedish)PubMedGoogle Scholar
- Wittchen HU, Boyer P: Sensitivity and specificity of the anxiety screening questionnaire (ASQ-15). Br J Psychiatry. 1998, 173 (Suppl 34): 10-17.Google Scholar
- Krause P, Wittchen HU, Höfler M, Winter S, Spiegel B, Pfister H: Generalisierte angst und depression in der allgemeinartztpraxis (GAD-P). Fortschr Medizin. 2001, 119 (suppl 1): 5-12.Google Scholar
- Winter S, Wittchen HU, Höfler M, Spiegel B, Ormel H, Müller N, Pfister H: Design und metoden der studie "Depression 2000". Fortschr Medizin. 2000, 118 (suppl 1): 11-21.Google Scholar
- Höfler M., Wittchen HU: Why do primary care doctors diagnose depression when diagnostic criteria are not met?. Int J Methods Psychiat Res. 2000, 9: 110-120.View ArticleGoogle Scholar
- Wittchen HU, Höfler M, Meister W: Prevalence and recognition of depressive syndromes in German primary care settings: poorly recognized and treated?. Int Clin Psychopharmacol. 2001, 16: 121-135. 10.1097/00004850-200105000-00001.View ArticlePubMedGoogle Scholar
- Watson D, Clark LA, Weber K, Assenheimer JC, Strauss ME, McCormick RA: Testing a tripartite model: II. Exploring the symptom structure of anxiety and depression in student, adult, and patient samples. J Abnorm Psychol. 1995, 104: 15-25. 10.1037/0021-843X.104.1.15.View ArticlePubMedGoogle Scholar
- Guy W: Clinical global impressions scale. ECDEU assessment manual for psychopharmacology. US Dept Health, Education, and Welfare publication (AMD) 76–338. 1976, Rockville, Md: National Institute of Mental Health, 221-227.Google Scholar
- Statistics and research on physicians in Norway. [http://www.legeforeningen.no/]
- Engstrom S, Foldevi M, Borgquist L: Is general practice effective? A systematic literature review. Scand J Prim Health Care. 2001, 19: 131-144. 10.1080/028134301750235394.View ArticlePubMedGoogle Scholar
- Crawford JR, Henry JD, Crombie C, Taylor EP: Normative data for the HADS from a large non-clinical sample. Br J Clin Psychol. 2001, 40: 429-434. 10.1348/014466501163904.View ArticlePubMedGoogle Scholar
- Spinhoven PH, Ormel J, Sloekers PPA, Kempen GI, Speckens AE, Van Hemert AM: A validation study of the hospital anxiety and depression scale (HADS) in different groups of Dutch subjects. Psychol Med. 1997, 27: 363-370. 10.1017/S0033291796004382.View ArticlePubMedGoogle Scholar
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-244X/5/46/prepub