1. Equivalency of the diagnostic accuracy of the PHQ-8 and PHQ-9: a systematic review and individual participant data meta-analysis.
- Author
-
Wu Y, Levis B, Riehm KE, Saadat N, Levis AW, Azar M, Rice DB, Boruff J, Cuijpers P, Gilbody S, Ioannidis JPA, Kloda LA, McMillan D, Patten SB, Shrier I, Ziegelstein RC, Akena DH, Arroll B, Ayalon L, Baradaran HR, Baron M, Bombardier CH, Butterworth P, Carter G, Chagas MH, Chan JCN, Cholera R, Conwell Y, de Man-van Ginkel JM, Fann JR, Fischer FH, Fung D, Gelaye B, Goodyear-Smith F, Greeno CG, Hall BJ, Harrison PA, Härter M, Hegerl U, Hides L, Hobfoll SE, Hudson M, Hyphantis T, Inagaki M, Jetté N, Khamseh ME, Kiely KM, Kwan Y, Lamers F, Liu SI, Lotrakul M, Loureiro SR, Löwe B, McGuire A, Mohd-Sidik S, Munhoz TN, Muramatsu K, Osório FL, Patel V, Pence BW, Persoons P, Picardi A, Reuter K, Rooney AG, Santos IS, Shaaban J, Sidebottom A, Simning A, Stafford L, Sung S, Tan PLL, Turner A, van Weert HC, White J, Whooley MA, Winkley K, Yamada M, Benedetti A, and Thombs BD
- Subjects
- Depressive Disorder, Major classification, Female, Humans, Interviews as Topic, Male, Middle Aged, Sensitivity and Specificity, Depressive Disorder, Major diagnosis, Mass Screening methods, Patient Health Questionnaire
- Abstract
Background: Item 9 of the Patient Health Questionnaire-9 (PHQ-9) queries about thoughts of death and self-harm, but not suicidality. Although it is sometimes used to assess suicide risk, most positive responses are not associated with suicidality. The PHQ-8, which omits Item 9, is thus increasingly used in research. We assessed equivalency of total score correlations and the diagnostic accuracy to detect major depression of the PHQ-8 and PHQ-9., Methods: We conducted an individual patient data meta-analysis. We fit bivariate random-effects models to assess diagnostic accuracy., Results: 16 742 participants (2097 major depression cases) from 54 studies were included. The correlation between PHQ-8 and PHQ-9 scores was 0.996 (95% confidence interval 0.996 to 0.996). The standard cutoff score of 10 for the PHQ-9 maximized sensitivity + specificity for the PHQ-8 among studies that used a semi-structured diagnostic interview reference standard (N = 27). At cutoff 10, the PHQ-8 was less sensitive by 0.02 (-0.06 to 0.00) and more specific by 0.01 (0.00 to 0.01) among those studies (N = 27), with similar results for studies that used other types of interviews (N = 27). For all 54 primary studies combined, across all cutoffs, the PHQ-8 was less sensitive than the PHQ-9 by 0.00 to 0.05 (0.03 at cutoff 10), and specificity was within 0.01 for all cutoffs (0.00 to 0.01)., Conclusions: PHQ-8 and PHQ-9 total scores were similar. Sensitivity may be minimally reduced with the PHQ-8, but specificity is similar.
- Published
- 2020
- Full Text
- View/download PDF