Validity of the Patient Health Questionnaire-9 (PHQ-9) for depression screening in adult primary care users in Bucaramanga, Colombia

Cassiani-Miranda, Carlos Arturo; Cuadros-Cruz, Angy Karina; Torres-Pinzón, Harold; Scoppetta, Orlando; Pinzón-Tarrazona, Jhon Henrry; López-Fuentes, Wendy Yulieth; Paez, Andrea; Cabanzo-Arenas, Diego Fernando; Ribero-Marulanda, Sergio; Llanes-Amaya, Elkin René

doi:10.1016/j.rcpeng.2019.09.002

Article information

Abstract

Full Text

Bibliography

Download PDF

Statistics

Figures (2)

Tables (3)

Table 1. Description of the sociodemographic characteristics of patients with or without minor depressive symptoms receiving healthcare at Primary Care centres.

Table 2. Description of the internal consistency of the PHQ-9 Colombian version: Cronbach's α and McDonald's ω by item.

Table 3. Description of the various cut-off points for the PHQ-9 Colombian version and validity coefficients.

Show moreShow less

Abstract

The patient health questionnaire-9 (PHQ-9) is one of the most widely used self-report instruments in primary care. There is no criterion validity of the PHQ-9 in Colombia. The objective was to validate the PHQ-9 as a screening tool in primary care. A cross-sectional, scale criterion validity study was performed using as reference criterion the mini neuropsychiatric interview (MINI) in male and female adult users of primary care centres. We calculated the internal consistency and convergent and criterion validity of the PHQ-9 by analysing the receiver operating characteristics (ROC) and the area under the curve (AUC). We analysed 243 participants; 184 (75.7%) were female. The average age was 34.05 (median of 31 and SD=12.47). Cronbach's α was 0.80 and McDonald's ω was 0.81. Spearman's Rho was 0.64 for HADS-D (P<0.010) and 0.70 for PHQ-2 (P<0.010). The AUC was 0.92 (95% CI 0.880–0.963). The optimal cut-off point of PHQ-9 was ≥7: sensitivity of 90.38 (95% CI: 81.41–99.36); specificity of 81.68 (95% CI: 75.93–87.42); PPV 57.32 (95% CI: 46.00–68.63); NPV 96.89 (95% CI: 93.90–99.88); Youden index 0.72 (95% CI: 0.62–0.82); LR+ 4.93 (95% CI: 3.61–6.74); LR− 0.12 (95% CI: 0.005–0.270). In sum, the Colombian version of PHQ-9 is a valid and reliable instrument for depression screening in primary care in Bucaramanga, with a cut-off point ≥7.

Keywords:

PHQ-9

Reproducibility of results

Screening

Depression

Primary healthcare

Colombia

Resumen

El Cuestionario de salud del paciente-9 (PHQ-9) es uno de los instrumentos de autoinforme más utilizado en Atención Primaria (AP). No existe validez de criterio del PHQ-9 en Colombia. El objetivo fue realizar la validez de criterio del PHQ-9 como instrumento de cribado en AP. Se realizó un estudio trasversal de validez de criterio de una escala usando como criterio de referencia la minientrevista neuropsiquiátrica (MINI) en usuarios adultos de centros de AP de ambos sexos. Se calcularon la consistencia interna y la validez convergente y de criterio del PHQ-9 mediante el análisis de las características operativas del receptor (COR) y el área bajo la curva (ABC). Participaron 243 pacientes, 184 (75,7%) fueron de sexo femenino. El promedio de edad fue 34,05 (mediana 31 y DE=12,47). El α de Cronbach fue 0,80 y ω de McDonald, 0,81. La rho de Spearman fue 0,64 para HADS-D (p<0,010) y 0,70 para PHQ-2 (p<0,010). El ABC fue 0,92 (IC del 95%, 0,880-0,963). El punto de corte óptimo del PHQ-9 fue ≥ 7: sensibilidad de 90,38 (IC del 95%: 81,41-99,36); especificidad de 81,68 (IC del 95%: 75,93-87,42); el VPP 57,32 (IC del 95%: 46,00-68,63); el VPN 96,89 (IC del 95%: 93,90-99,88); índice de Youden 0,72 (IC del 95%: 0,62-0,82; LR+ 4,93 (IC del 95%: 3,61-6,74); LR– 0,12 (IC del 95%: 0,005-0,270). En conclusión, la versión colombiana del PHQ-9 es un instrumento válido y confiable para el cribado de depresión en AP de Bucaramanga, con un punto de corte ≥ 7.

Palabras clave:

PHQ-9

Reproducibilidad de los resultados

Cribado

Depresión

Atención primaria de salud

Colombia

Full Text

Introduction

Depression is a major public health problem worldwide1 and has a significant impact on quality of life,2 high morbidity levels,3 reduced life expectancy4 and excess mortality.5 The lifetime prevalence of major depressive disorder (MDE) is 11.2%.6 Prevalences tend to be higher in low and middle income countries such as Pakistan, where depression prevalences of 45.9% have been reported.7 In primary care (PC), the prevalence of MDE varies significantly in a range from 4.5% to 47.8%.8

In the 2015 Colombian National Mental Health Survey, the prevalence of major depression in the general population was 5.4 (95% CI: 4.6–6.4), 2.3 (95% CI: 1.8–2.9) and 0.8 (95% CI: 0.5–1.3) for lifetime, the last year and the last month, respectively.9 In Bucaramanga, the prevalence of clinically significant depressive symptoms (CSDS) was 22.3% (95% CI: 20.0–24.6) and 11.2% for major depressive disorder (MDD) (95% CI: 9.7–12.9%).10 A later population study in adults living in Bucaramanga (N=266), using the Structured Clinical Interview for DSM-IV Axis I Disorders (SCID-I), reported a prevalence of 16.5% (95% CI: 12.3–21.6),11 which confirms the high prevalence of depression in this region.

In spite of its high burden, chronicity and recurrent nature, depression is underdiagnosed in PC, as approximately 50% of patients who present depression will not be detected.12 This diagnostic gulf could be explained by the fact that more than 75% of patients with depression initially consult a family or PC physician with little training in the identification of depressive disorders,13 time constraints in busy PC environments14 and the lack of validated screening tools in low and middle income countries.15

Because of the above, programmes have been developed to recognise depression,16,17 which recommend standardised tools. A number of tools exist to identify cases of depression; however, their benefits have not been fully determined and the literature reveals contradictory results.18 A recent systematic review suggests that of the screening tools, only the Patient Health Questionnaire-9 (PHQ-9) attains the optimum accuracy level for depression.19 The PHQ-9 is an adjectival scale derived from the Primary Care Evaluation of Mental Disorders (PRIME-MD) to assess depressive symptoms using the DSM-IV criteria.20 The PHQ-9 is shorter than most of the depression screening scales21,22 and is considered the best screening tool for depression in PC due to its accuracy, brevity, being in the public domain and multipurpose, and ease of administration, scoring and interpretation.19,23 The PHQ-9 has been translated into more than 20 languages and used in many countries and contexts.24 In PC, the sensitivity of the PHQ-9 was between 0.71 and 0.84 (mean 0.77) and its specificity was between 0.90 and 0.97 (mean 0.94),25 confirming its adequate psychometric performance in PC, albeit with some variations in the cut-off point (COP) and psychometric parameters that can be explained by the influence of cultural aspects in the response pattern.23 Its broad use is also supported by the findings of Williams et al., who concluded, in an analysis of more than 38 studies with more than 32,000 PC patients, that the PHQ-9 was equal or superior to other measures of depression.22 In addition, the DSM-5 MDD working group and the NICE guidelines consider the PHQ-9 the preferred measure to assess the presence of the depression and quantify its severity.21,22,26

The PHQ-9 has been evaluated in Colombia in university students27: however, it was not compared to a gold standard. The PHQ-9 criteria therefore need to be validated in PC in Colombia against a gold standard, in particular due to the opportunity PC services represent in the early detection of depression.28 As a result, this study's objective was to assess the validity of the PHQ-9 criteria, comparing it with the Mini-International Neuropsychiatric Interview (MINI) for screening for depressive symptoms in adult PC users in the Bucaramanga metropolitan area.

Materials and methodsDesign

This study was designed and analysed based on the recommendations of the Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) declaration.29 An analytical observational study of the validity of a scale's criteria was conducted using reference criteria.

Participants

Local PC users of both genders, aged 18–65 years were included. The PC centres belong to the Instituto de Salud de Bucaramanga [Bucaramanga Health institute] (ISABU), a State social enterprise that coordinates primary health care services in the Bucaramanga metropolitan area.

Subjects with psychoactive symptoms, cognitive decline, delirium or an intellectual disability that might prevent them from responding to the tools, those under the effects of psychoactive substances, with functional changes in vision or hearing that might prevent them from understanding the content of the survey, and those who did not understand Spanish were excluded. The sample size was calculated to evaluate the hypothesis regarding the characteristics of a diagnostic test30:

where π1 is the sensitivity of the standard (0.96) and π2 is the anticipated sensitivity of the PHQ-9 (0.88); Z1−α/2 was set at 1.96 and Z1-B at 1.28; δ was set at 0.08 (π1−π2). The result was 214. Participants were selected consecutively as they attended the health centres until a maximum number of subjects above 214 had been surveyed.

Procedures

The study was approved the ethics committee of ISABU and the Universidad de Santander [University of Santander], taking into account the current international31 and national norms32 for research with human subjects.

The PHQ-9 was translated following the recommendations for adaptation of self-reporting tests.33 A direct translation from the original scale was performed by two independent certified bilingual translators; discrepancies between the two translations were discussed, then a backtranslation was done into English, which was reviewed by the research team to assess its closeness to the original scale. The translated scale was then reviewed by 10 psychiatrists with research expertise or clinical experience to verify whether the items were consistent with the construct of depression, who also commented on the comprehension and wording of the items. Ten people from the general population with a history of depression also gave their opinions on the comprehension of the questions. The research group analysed and incorporated the patients’ and experts’ observations to obtain the new Colombian version (Fig. 1). With the new version of the scale, a pilot test was conducted with 21 subjects with characteristics similar to the study subjects but in other centres. They answered the questions without difficulties and no adjustments to the grammatical structure were needed.

Fig. 1.

Patient Health Questionnaire (PHQ-9), version for Primary Care, Bucaramanga, Colombia.

The research team was trained in the structured psychiatric interview (MINI) and hetero-administration of the PHQ-9. The people responsible for administering the scales and structure interviews were professionals with clinical experience (four psychologists, two general practice residents and one psychiatrist) who received eight hours of training by the lead author, with theoretical and practical sessions, role playing, and observation of pilot interviews with feedback. The study participants were contacted in the waiting room as they arrived for outpatient appointments with a general practitioner for any reason. One member of the research group explained the nature of the study and gave them the informed consent form. The screening scales were read by trained members of the research team. After completing the PHQ-9, each participant was assessed on the same day in another consulting room by a different trained member of the team (psychologist or psychiatrist) who did not know their PHQ-9 result, to administer the MINI depression module. The questionnaires were reviewed by two independent reviewers and saved in a form generated in Excel.

ToolsPHQ-9

The PHQ-9 is a screening scale that measures the presence and severity of depressive symptoms.34 The PHQ-935 is made up of the nine symptoms from DSM-IV MDE criterion A.20 These nine items are arranged in the form of an adjectival scale that assesses the presence of the symptom in the last two weeks (“not at all”, “several days”, “more than half the days” and “nearly every day”), scored from 0 to 3 to give a score between 0 and 27.36

It can be self- or hetero-administered and is used both algorithmically to make a probable diagnosis of MDE or as a continuous measure of scores from 0 to 27 with cut-off points (COP) at 5, 10, 15 and 20 representing levels of depressive symptoms, i.e. mild, moderate, moderately severe and severe.34 The scores can also be used dichotomously based on a COP to classify subjects with or without CSDS.37 According to Kroenke et al., the psychometric characteristics of the PHQ-9 have a sensitivity of 88% and a specificity of 88%, adequate internal consistency (Cronbach's α of 0.86–0.89), a test–retest score of 0.84, a concordance between self-administered and evaluator-administered tests of 84% and an area under the curve (AUC) of 0.95.34 In this study a COP of 8 or more was used to identify cases of CSDS, based on the meta-analysis by Manea et al.23 and the study by Rancans et al. in PC.38

Mini-International Neuropsychiatric Interview

The MINI is a brief structured diagnostic interview that explores the diagnostic categories of the DSM-IV and the ICD-10.39 Its original version was developed by Sheehan et al.39 and Lecrubier et al.40 in the United States and France. It contains 130 questions organised into modules that assess 16 disorders from axis i of the DSM-IV and one personality disorder. The original version in English has a sensitivity range between 0.46 and 0.94 and a specificity between 0.72 and 0.97,39,40 excellent inter-rater (kappa 0.70) and test–retest reliability, and moderate validity of criteria compared to the Composite International Diagnostic Interview (CIDI) and the SCID-P.39,40 The MINI quickly gained international acceptance,41–43 has translated versions in 43 languages39 and its reliability and validity have been explored in its Italian,44 Japanese,45 Norwegian,46 Moroccan47 and Portuguese48 versions. The average administration time is 18.7±11.6min, with a mean of 15min.39 Together with the CIDI and the SCID-I, the MINI is considered a globally accepted gold standard for the diagnosis of mental disorders in clinical and research settings.49

Hospital Anxiety and Depression Scale

The Hospital Anxiety and Depression Scale (HADS) was designed by Zigmond and Snaith in 198350 to detect mood disorders, especially those associated with somatic symptoms. It consists of 14 items, with an anxiety subscale (odd items) and a depression subscale (even items). Each item is graded on a four-point frequency scale from 0 to 3. The HADS has been translated into most European languages, Arabic, Hebrew, Urdu, Japanese and Chinese51 and its reliability and validity has been demonstrated in numerous studies.52 In Colombia, it was validated in cancer patients, finding adequate internal consistency (Cronbach's α of 0.85), a COP of 8 for the anxiety subscale and 9 for the depression subscale.53 These psychometric properties were confirmed in a populational sample (n=1500) in several cities in Colombia.54 In this work, the version adapted by Rico et al. was used.53

Patient Health Questionnaire-2

The Patient Health Questionnaire-2 (PHQ-2) consists of the first two items of the PHQ-9, which are necessary to suspect the presence of depression according to the DSM-IV criteria.55 The scoring system is the same as for the PHQ-9 and scores range from 0 to 6. A COP of 3 is optimal for screening, but a recent meta-analysis suggests that a COP of 2 may increase its sensitivity.56 Patients who score positive for CSDS should be evaluated with the PHQ-9 to determine whether they meet the MDE criteria.57 Its clinical utility stems from the fact that it reduces the time taken in normal PC consultations, which are usually busy.58 The PHQ-2 has been found to have psychometric performance comparable to the PHQ-9, with good reliability, validity and sensitivity to change.56 In this work, a COP of 2 or above was used to identify patients with CSDS.59

Statistical analysis

The data were analysed in SPSS version 20.0,60 carefully verified and reviewed twice. A descriptive analysis of the qualitative and quantitative variables was performed. Cronbach's α and McDonald's ω coefficients were calculated to assess internal consistency; for concurrent validity, Spearman or Pearson correlations were estimated depending on the distribution of the variables. To assess the accuracy of the PHQ-9 as a screening tool compared to the MINI, the receiver operating characteristics (ROC) and AUC were analysed. The optimum COP for the PHQ-9 was determined taking into account validity indices: sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), positive and negative likelihood ratios (LR), Youden's index and ROC curve/AUC analysis.

ResultsParticipant characteristics

Three hundred and eighty-four users were contacted, of whom 95 did not agree to participate. Of the surveys conducted, 46 were discarded due to missing and inconsistent data; the analysis therefore included 243 participants, of whom 184 (75.7%) were female. The average age was 34.05 years with a standard deviation (SD) of 12.47 years. For male participants the average age was 33.59 years with a SD of 12.89 years, while for female participants the average was 34.20 years with a SD of 12.37 years. The sociodemographic characteristics of the sample can be seen in Table 1.

Table 1.

Description of the sociodemographic characteristics of patients with or without minor depressive symptoms receiving healthcare at Primary Care centres.

Variables	No.	%
Gender
Male	59	24.30%
Female	184	75.72

Area of origin
Urban	210	86.42
Rural	33	13.58

Marital status
Single	98	40.33
Married	53	21.88
Cohabiting	83	34.16
Divorced	5	2.06
Widowed	4	1.65

Education
Primary, not completed	14	5.75
Primary, completed	60	24.69
Secondary, not completed	36	14.81
Secondary, completed	88	36.21
Vocational, not completed	18	7.41
Vocational, completed	1	0.41
Technological, completed	3	1.23
University, not completed	3	1.23
University, completed	20	8.23

Socio-economic stratum
Stratum 1	103	42.39
Stratum 2	84	34.57
Stratum 3	47	19.34
Stratum 4	6	2.47
Stratum 5	3	1.23

The prevalence of CSDS was 27.2% according to the results of the PHQ-9 and 21.8% according to the MINI structured interview.

Internal consistency

A Cronbach's α coefficient of 0.80 and an ω coefficient of 0.81 were obtained. The overall internal consistency of the scale if each item is eliminated is shown in Table 2.

Table 2.

Description of the internal consistency of the PHQ-9 Colombian version: Cronbach's α and McDonald's ω by item.

PHQ-9 questions	Cronbach's α	McDonald's ω
Q1	0.774	0.780
Q2	0.767	0.772
Q3	0.795	0.800
Q4	0.776	0.784
Q5	0.792	0.798
Q6	0.778	0.782
Q7	0.788	0.794
Q8	0.798	0.805
Q9	0.799	0.802

Convergent validity

The Kolmogorov–Smirnov test was used to establish the normality of the variables, for the purpose of deciding the type of test to use for the concurrent validity analysis of the PHQ-9 against the PHQ-2 and the HADS depression subscale (HADS-D). These variables were not found to have a normal distribution, to Spearman's Rho was used. The Spearman's Rho was 0.646 for the HADS-D (P<0.010) and 0.701 for the PHQ-2 (P<0.010).

Criterion validity

The ROC curve (Fig. 2) and accuracy indices for the PHQ-9 produced the results shown in Table 3. The AUC was 0.92 (95% CI: 0.88–0.963).

Fig. 2.

Receptor operating characteristics (ROC) of the PHQ-9 Colombian version compared with the MINI as a reference standard for depression (N=146).

Table 3.

Description of the various cut-off points for the PHQ-9 Colombian version and validity coefficients.

Cut-off point	Sensitivity	Specificity	Youden's index	% correctly classified	PPV	NPV	LR+	LR−
≥3	0.98	0.43	0.42	55.14	0.25	0.99	1.73	0.04
≥4	0.96	0.58	0.54	65.84	0.30	0.99	2.27	0.07
≥5	0.96	0.71	0.67	76.54	0.39	0.99	3.34	0.05
≥6	0.94	0.77	0.71	80.66	0.44	0.99	4.09	0.07
≥7a	0.90a	0.82a	0.72a	83.54a	0.48a	0.98a	4.93a	0.12a
≥8	0.83	0.88	0.71	86.83	0.57	0.96	6.87	0.20
≥9	0.75	0.91	0.66	87.24	0.60	0.95	7.96	0.28
≥10	0.67	0.93	0.60	87.24	0.64	0.94	9.18	0.35
≥11	0.60	0.94	0.54	86.83	0.66	0.92	10.35	0.43
≥12	0.56	0.97	0.53	88.07	0.77	0.92	17.75	0.46
≥13	0.46	0.97	0.43	86.01	0.74	0.90	14.69	0.56
≥14	0.31	0.99	0.30	84.36	0.85	0.88	29.38	0.70
≥15	0.27	0.99	0.26	83.95	0.91	0.88	51.42	0.73

a

Psychometric indices associated with the optimum cut-off point of ≥7.

The optimum COP was a PHQ-9 score ≥7 (sensitivity 90.38 [95% CI: 81.41–99.36]; specificity 81.68 [95% CI: 75.93–87.42]; PPV 57.32 [95% CI: 46.00–68.63]; NPV 96.89 [95% CI: 93.90–99.88]; Youden's index 0.72 [95% CI: 0.62–0.82]; LR+ 4.93 [95% CI: 3.61–6.74]; LR− 0.12 [95% CI: 0.005–0.270]).

Discussion

To the best of our knowledge, this is the first study on PHQ-9 criterion validity in PC in Colombia. The prevalence of MDE in this study was 21.8%. The Colombian version of the PHQ-9 demonstrated excellent diagnostic performance as a depression screening tool, as can be seen from the ROC curve and AUC. The PHQ-9 also demonstrated an adequate balance of sensitivity and specificity at the COP of ≥7 when compared with the MINI as a reference standard, establishing the PHQ-9's adequate criterion validity. The comparison of PHQ-9 scores against those from the HADS-D and the α and ω coefficients demonstrated good convergent validity and adequate internal consistency.

The percentage of subjects classified as having CSDS based on the PHQ-9 with the pre-established COP was 27.2% (95% CI: 26.3–28.9), higher than the prevalence of 22.3% (95% CI: 20.0–24.6) found in Bucaramanga using the Zung Self-rating Depression Scale.10 This difference can be explained by the poor diagnostic performance of the Zung scale in the Colombian population.61 With regard to the prevalence of MDE based on the MINI, in this sample it was 21.8% (95% CI: 20.8–23.5), which is within the expected range based on a meta-analysis of 41 studies in PC with an adjusted global prevalence of 19.5% (95% CI: 15.7–23.7).62 However, the prevalence of MDE in this study is a little higher than the 16.5% (95% CI: 12.3–21.6) reported in previous studies in the general population in Bucaramanga,11 which can be explained by the fact that this study was conducted in people attending PC centres, where the prevalence of depression is higher than in the general population63 and by the large proportion of women.64

Cronbach's α coefficient was 0.80 and McDonald's ω coefficient was 0.81, indicating good internal consistency.65,66 For a self-reporting tool to be reliable, Cronbach's α and McDonald's ω need to be at least 0.70.67 The internal consistency found in this study is in keeping with a previous study in Colombia37 and with others conducted in different languages, whose coefficients ranged from 0.79 to 0.89.68,70,71

Previous studies have demonstrated that the PHQ-9 has adequate concurrent validity with various measured, including the Hamilton Depression Rating Scale (HAM-D), short health assessment forms and even the PHQ-2.72 In our study, total PHQ-9 scores showed a statistically significant positive correlation with HADS-D and PHQ-2 scores (Spearman's Rho of 0.64 [P<0.01] for HAM-D and 0.70 [P<0.01] for PHQ-2), in keeping with previous studies in which the Pearson's coefficients for the PHQ-9 with the HAM-D and the Beck Depression Inventory (BDI) were 0.52 and 0.76, respectively.68,73–75 Meanwhile, a study of patients with Parkinson's disease showed that the PHQ-9 correlated positively with the Self-rating Depression Scale and the 15-item Geriatric Depression Scale, with a Spearman's coefficient of 0.63 in both cases.76 The correlation coefficients found in this study confirm the convergent validity of the PHQ-9, with Spearman's correlation coefficients between 0.60 and 0.80 indicating a good or considerable positive correlation.77

With regard to the COP, various studies have recommended a COP of 10 in the PHQ-9 for the identification of MDE.34 For example, in a study of PC users in China, an optimum COP of 10 produced a sensitivity of 0.87 and a specificity of 0.8.69 However, a recent meta-analysis of 18 studies demonstrated that the optimum COP of the PHQ-9 could range from 8 to 11, depending on the population studied; nevertheless, the balance of sensitivity and specificity is maintained for a COP of 7 (5 of the 18 studies included).23 In our study, the COP of 7 appears to have given the optimum balance between sensitivity and specificity, which was confirmed with an additional measure of accuracy: Youden's index,78 defined as the maximum vertical distance between the ROC curve and the 45 degree line, as an indicator of how far the curve is from an uninformative test.79 Youden's index is a function of sensitivity (Se) and specificity (Sp); it is calculated as (Se+Sp-1)80 and should be considered alongside the ROC curve as they are usually related.81 The range is 0–100 when converted into a percentage. Values >50% are generally considered acceptable for diagnostic accuracy.82

The values associated with the COP in our study are consistent with a study in older adults in PC, in whom PHQ-9 criterion validity was assessed by administering the MINI, in which an optimum COP≥7 (sensitivity 0.92; specificity 0.78) demonstrated the best psychometric characteristics.83 Nevertheless, this COP of 7 is lower than that found in most studies with the PHQ-9 in other populations. The cultural and demographic characteristics of the samples may be the reason for this difference.84 Stigma is an important aspect that can also influence people's response pattern to depression screening scales in our population, causing shame in people with mental illnesses, which limits the identification of psychopathological phenomena.85,86 It is worth noting that PHQ-9 COPs tend to be lower in middle and low income countries87–89 compared with high income countries.75,90,91 However, there are no studies looking at this phenomenon. This difference in optimum COP highlights the importance of validating screening tools in different social and cultural contexts.92

For a COP of ≥7, the sensitivity and specificity of the PHQ-9 in this sample were 90% and 83%, respectively. These findings are consistent with the study by Wang et al., in which the COP of ≥7 allowed an adequate balance between sensitivity and specificity (sensitivity 85%; specificity 86%).93 The accuracy indices in our study are therefore considered appropriate, as the screening tool is considered good when its sensitivity is between 79% and 97% and its specificity is between 63% and 86%.94 Wittkampf et al. systematically reviewed the psychometric properties of the PHQ-9 and found a sensitivity of 77% (71–84%) and specificity of 94% (90–97%), including studies in subgroups with a high prevalence of depression, such as PC users.25

The LR+ and LR− of the PHQ-9 in our sample, for a COP of ≥7, were 4.93 and 0.12, respectively. This means that, in a similar clinical context, a positive result in the PHQ-9 (COP≥7) if five times more common in a patient with depression than in one without depression, while a subject with a negative result would have a likelihood of having depression of less than 2%.95 These results are comparable with those obtained in the Chinese version of the PHQ-9, which with a COP of ≥7, had an LR+ and LR- of 5.99 and 0.17, respectively.93

The AUC of this Colombian version of the PHQ-9 for PC was 0.92, which indicates a high degree of accuracy96 and is consistent with previous studies in PC and other populations.69,71,93

This study's main strengths include the use of a clinical reference criterion to assess the PHQ-9 validity criterion, the adequate participant response rate (75.3%), adequate training of interviewers, adherence to the QUADAS-2 guidelines29 and the execution of a rigorous analysis plan. In addition, the PHQ-9 was translated in accordance with the standardised guidelines for transcultural adaptation of scales. The linguistic adaptation was supported by a group of experts, guaranteeing appropriate content validity.

This study has several limitations. Firstly, our study was conducted in a PC context, therefore the results cannot be generalised to the general population, whose characteristics would produce a different response pattern.84 Secondly, the study was limited to adults. There is growing evidence that adolescents are particularly affected by depressive disorders,97 so future studies in Colombia would need to assess the psychometric performance of the PHQ-9 in this population. Thirdly, this was a cross-sectional study, and as a consequence there will be a need, in the future, to design longitudinal studies to establish the sensitivity to change of the PHQ-9 in the Colombian population, as works exist that have used in to assess response to treatment of depression.98 Fourthly, the fact that the sample was predominantly female (75%) could affect the estimates of accuracy indices, as the prevalence of depression is higher in women than in men, giving a higher number of positive cases of depression.99 And fifthly, one relative weakness in the sample size, which was calculated following the recommendations of Sanchez et al. for comparing the sensitivity of a screening test with a reference standard.30 However, other authors, such as Buderer100 and Obuchowski101 demand larger samples. On the other hand, following Bean's criteria for comparing the sensitivity or specificity of two diagnostic tests, sample sizes similar to ours are obtained.102

In keeping with global results, the Colombian version of the PHQ-9 for PC has excellent psychometric performance as a screening test, which guarantees that it can be used in contexts with few resources and with weaknesses in the healthcare system, where the availability of psychiatrists is limited.103 Among the strategies to limit the burden of mental health disorders in low and middle income countries is integration of mental health in PC.104 One of the major barriers to achieving this goal is the lack of easy-to-administer and validated screening tools to detect depression. The validation of instruments such as the PHQ-9 in these contexts can help to solve this problem.105 It is known that just screening for depression is insufficient to mitigate the growing care needs for mental health disorders in low and middle income countries; nevertheless, given that depression contributes significantly to the burden of disease, having validated screening tools is the first step towards solving this problem.106 In some low income countries, there are cost-effective depression intervention programmes, in which screening tools can be used to identify appropriate participants.107 One of the main components of effective mental health interventions in PC is monitoring depressive symptoms using simple, brief and easy-to-administer questionnaire such as the PHQ-9.108

With the validation of this version of the PHQ-9, researchers in Colombia now have valid and reliable psychometric information about depression screening in PC, which will enable the PHQ-9 to be used in studies where it is necessary to identify depressive symptoms with an appropriate COP.

In conclusion, the results of this study indicate that the Colombian version of the PHQ-9 is a valid and reliable tool for screening for depression in a PC context in Bucaramanga, with a COP of 7 or above. The psychometric properties of this version of the PHQ-9 will need to be evaluated in different populations and other regions of the country. Future studies in Colombia should assess the PHQ-9's sensitivity to change.

Funding

This work was funded by the faculty of medicine of the Universidad de Santander (UDES) and the Instituto de Salud de Bucaramanga (ISABU). Project code: PIFE0118020041816EJ.

Authors’ contribution

Carlos Arturo Cassiani-Miranda: design, scale adjustment, training of survey-takers and interviewer, collection of information, statistical analysis, digitisation, drafting and revision of the article.

Angy Karina Cuadros-Cruz: design, scale adjustment, collection of information and drafting of the article.

Harold Torres Pinzón: design, scale adjustment, statistical analysis, digitisation, drafting and revision of the article.

Orlando Scoppetta: scale adjustment, statistical analysis, drafting and revision of the article.

Jhon Henrry Pinzón-Tarrazona: collection of information, digitisation and drafting of the article.

Wendy Yulieth López-Fuentes: collection of information, digitisation and drafting of the article.

Andrea Paez: collection of information and drafting of the article.

Diego Fernando Cabanzo-Arenas: digitisation, drafting and revision of the article.

Sergio Ribero-Marulanda: collection of information, drafting and revision of the article.

Elkin René Llanes-Amaya: collection of information and drafting of the article.

Conflicts of interest

The authors have no conflicts of interest to declare.

Acknowledgements

To the expert panel for their contributions to the validation of appearance and content: Astrid I. Arrieta, Jaider A. Barros, Adalberto Campo-Arias, Mauricio Castaño, Jenny García, Luis A. Montenegro, Jorge A. Niño, Heidi C. Oviedo, Andrés M. Rangel, Jorge J. Téllez-Vargas.

References

[1]

N. Steel, J.A. Ford, J.N. Newton, A.C.J. Davis, T. Vos, M. Naghavi, et al.

Changes in health in the countries of the UK and 150 English Local Authority areas 1990–2016: a systematic analysis for the Global Burden of Disease Study 2016.

Lancet, 392 (2018), pp. 1647-1661

http://dx.doi.org/10.1016/S0140-6736(18)32207-4 | Medline

[2]

D. Yang, J.W. Hur, Y.B. Kwak, S.W. Choi.

A systematic review and meta-analysis of applicability of web-based interventions for individuals with depression and quality of life impairment.

Psychiatry Investig, 15 (2018), pp. 759-766

http://dx.doi.org/10.30773/pi.2018.03.15 | Medline

[3]

J.F. Van Eck van der Sluijs, H. Castelijns, V. Eijsbroek, C.A.T. Rijnders, H.W.J. van Marwijk, C.M. van der Feltz-Cornelis.

Illness burden and physical outcomes associated with collaborative care in patients with comorbid depressive disorder in chronic medical conditions: a systematic review and meta-analysis.

Gen. Hosp. Psychiatry, 50 (2018), pp. 1-14

http://dx.doi.org/10.1016/j.genhosppsych.2017.08.003 | Medline

[4]

T.M. Laursen, K.L. Musliner, M.E. Benros, M. Vestergaard, T. Munk-Olsen.

Mortality and life expectancy in persons with severe unipolar depression.

J. Affect. Disord., 193 (2016), pp. 203-207

http://dx.doi.org/10.1016/j.jad.2015.12.067 | Medline

[5]

D. Brandão, L.F. Fontenelle, S.A. da Silva, P.R.P-V.M. Menezes.

Depression and excess mortality in the elderly living in low- and middle-income countries: systematic review and meta-analysis.

Int. J. Geriatr. Psychiatry, 34 (2019), pp. 22-30

http://dx.doi.org/10.1002/gps.5008 | Medline

[6]

R.C. Kessler, N.A. Sampson, P. Berglund, M.J. Gruber, A. Al-Hamzawi, L. Andrade, et al.

Anxious and non-anxious major depressive disorder in the World Health Organization World Mental Health Surveys.

Epidemiol Psychiatr Sci, 24 (2015), pp. 210-226

http://dx.doi.org/10.1017/S2045796015000189 | Medline

[7]

A.A. Muhammad Gadit, G. Mugford.

Prevalence of depression among households in three capital cities of Pakistan: need to revise the mental health policy.

PLoS ONE, 2 (2007), pp. 1-5

[8]

J. Wang, X. Wu, W. Lai, E. Long, X. Zhang, W. Li, et al.

Prevalence of depression and depressive symptoms among outpatients: a systematic review and meta-analysis.

BMJ Open, 7 (2017), pp. 1-14

[9]

C. Gómez-Restrepo, A. Bohórquez, N. Tamayo Martínez, M. Rondón, N. Bautista, H. Rengifo, et al.

Trastornos depresivos y de ansiedad y factores asociados en la población de adolescentes colombianos, Encuesta Nacional de Salud Mental 2015.

Rev Colomb Psiquiatr., 45 (2016), pp. 50-57

http://dx.doi.org/10.1016/j.rcp.2016.09.009 | Medline

[10]

M. Rueda-Sánchez, L.A. Díaz-Martínez, G.E. Rueda-Jaimes.

Prevalencia del trastorno depresivo mayor y factores asociados: un estudio poblacional en Bucaramanga (Colombia).

Rev Colomb Psiquiatr, 37 (2008), pp. 159-168

[11]

L. Cadena, P. del, L. Díaz, G. Rueda, N. Hernández, A. Campo.

Prevalencia actual del trastorno depresivo mayor en la población de Bucaramanga, Colombia.

Rev Fac Nac Salud Pública, 28 (2010), pp. 36-41

[12]

A.J. Mitchell, S. Rao, A. Vaze.

Can general practitioners identify people with distress and mild depression? A meta-analysis of clinical accuracy.

J. Affect. Disord., 130 (2011), pp. 26-36

http://dx.doi.org/10.1016/j.jad.2010.07.028 | Medline

[13]

M. Carey, S.L. Yoong, A. Grady, J. Bryant, A. Jayakody, R. Sanson-Fisher, et al.

Unassisted detection of depression by GPs: who is most likely to be misclassified?.

Fam. Pract., 32 (2015), pp. 282-287

http://dx.doi.org/10.1093/fampra/cmu087 | Medline

[14]

A.J. Mitchell, S. Rao, A. Vaze.

International comparison of clinicians’ ability to identify depression in primary care: meta-analysis and meta-regression of predictors.

Br. J. Gen. Pract., 61 (2011), pp. 72-80

http://dx.doi.org/10.3399/bjgp11X549135 | Medline

[15]

G.C. Ali, G. Ryan, M.J. de Silva.

Validated screening tools for common mental disorders in low and middle income countries: a systematic review.

PLOS ONE, 11 (2016), pp. 1-14

[16]

E.S. Paykel, A. Tylee, A. Wright, R.G. Priest, S. Rix, D. Hart.

The defeat depression campaign: psychiatry in the public arena.

Am. J. Psychiatry, 154 (1997), pp. 59-66

http://dx.doi.org/10.1176/ajp.154.6.59 | Medline

[17]

C. Mitchell, R. Dwyer, T. Hagan, N. Mathers.

Impact of the QOF and the NICE guideline in the diagnosis and management of depression: a qualitative study.

Br. J. Gen. Pract., 61 (2011), pp. 279-289

[18]

S. Gilbody, T. Sheldon, A. House.

Screening and case-finding instruments for depression: a meta-analysis.

CMAJ, 178 (2008), pp. 997-1003

http://dx.doi.org/10.1503/cmaj.070281 | Medline

[19]

A. Pettersson, K.B. Boström, P. Gustavsson, L. Ekselius.

Which instruments to support diagnosis of depression have sufficient accuracy? A systematic review.

Nord. J. Psychiatry, 69 (2015), pp. 497-508

http://dx.doi.org/10.3109/08039488.2015.1008568 | Medline

[20]

R.D. Kocalevent, A. Hinz, E. Brähler.

Standardization of the depression screener Patient Health Questionnaire (PHQ-9) in the general population.

Gen. Hosp. Psychiatry, 35 (2013), pp. 551-555

http://dx.doi.org/10.1016/j.genhosppsych.2013.04.006 | Medline

[21]

S.C. Sung, C.C.H. Low, D.S.S. Fung, Y.H. Chan.

Screening for major and minor depression in a multiethnic sample of Asian primary care patients: a comparison of the nine-item Patient Health Questionnaire (PHQ-9) and the 16-item Quick Inventory of Depressive Symptomatology – Self-Report (QIDS-SR16).

Asia-Pacific Psychiatry, 5 (2013), pp. 249-258

http://dx.doi.org/10.1111/appy.12101 | Medline

[22]

J.W. Williams, M. Pignone, G. Ramirez, C. Perez Stellato.

Identifying depression in primary care: a literature synthesis of case-finding instruments.

Gen. Hosp. Psychiatry, 24 (2002), pp. 225-237

http://dx.doi.org/10.1016/s0163-8343(02)00195-0 | Medline

[23]

L. Manea, S. Gilbody, D. McMillan.

Optimal cut-off score for diagnosing depression with the Patient Health Questionnaire (PHQ-9): a meta-analysis.

CMAJ, 184 (2012), pp. 191-196

[24]

K. Kroenke.

Enhancing the clinical utility of depression screening.

CMAJ, 184 (2012), pp. 281-282

http://dx.doi.org/10.1503/cmaj.112004 | Medline

[25]

K.A. Wittkampf, L. Naeije, A.H. Schene, J. Huyser, H.C. van Weert.

Diagnostic accuracy of the mood module of the Patient Health Questionnaire: a systematic review.

Gen. Hosp. Psychiatry, 29 (2007), pp. 388-395

http://dx.doi.org/10.1016/j.genhosppsych.2007.06.004 | Medline

[26]

D.E. Deneke, H. Schultz, T.E. Fluent.

Screening for depression in the primary care population.

Prim. Care, 41 (2014), pp. 399-420

http://dx.doi.org/10.1016/j.pop.2014.02.011 | Medline

[27]

C.A. Cassiani-Miranda, O. Scoppetta.

Factorial structure of the Patient Health Questionnaire-9 as a depression screening instrument for university students in Cartagena, Colombia.

Psychiatry Res., 269 (2018), pp. 425-429

http://dx.doi.org/10.1016/j.psychres.2018.08.071 | Medline

[28]

S. Smithson, M.P. Pignone.

Screening adults for depression in primary care.

Med. Clin. North Am., 101 (2017), pp. 807-821

http://dx.doi.org/10.1016/j.mcna.2017.03.010 | Medline

[29]

P.F. Whiting, A.W. Rutjes, M.E. Westwood, S. Mallett, J.J. Deeks, J.J. Deeks, et al.

QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies.

Ann. Intern. Med., 155 (2011), pp. 529-536

http://dx.doi.org/10.7326/0003-4819-155-8-201110180-00009 | Medline

[30]

R. Sánchez-Pedraza, J. Echeverry-Raad.

Aspectos sobre diseño y tamaño de muestra en estudios de pruebas diagnósticas.

Rev Fac Med, 49 (2001), pp. 175-180

[31]

Asociación Médica Mundial. Declaración de Helsinki de la AMM.

Principios éticos para las investigaciones médicas en seres humanos.

(2015),

[32]

Ministerio de Salud.

Resolucion 8430 de 1993.

República de Colombia: Ministerio de Salud, (1993), pp. 1-12

[33]

D.E. Beaton, C. Bombardier, F. Guillemin, M.B. Ferraz.

Guidelines for the process of cross-cultural adaptation of self-report measures.

Spine (Phila Pa 1976), 25 (2000), pp. 23186-23191

[34]

K. Kroenke, R.L. Spitzer, J.B.W. Williams.

The PHQ-9: validity of a brief depression severity measure.

J. Gen. Intern. Med., 16 (2001), pp. 606-613

http://dx.doi.org/10.1046/j.1525-1497.2001.016009606.x | Medline

[35]

R.L. Spitzer, K. Kroenke, J.B. Williams.

Validation and utility of a self-report version of PRIME-MD.

JAMA, 282 (1999), pp. 1737-1744

http://dx.doi.org/10.1001/jama.282.18.1737 | Medline

[36]

R.L. Spitzer, J.B.W. Williams, K. Kroenke, R. Hornyak, J. McMurray.

Validity and utility of the PRIME-MD Patient Health Questionnaire in assessment of 3000 obstetric-gynecologic patients: the PRIME-MD Patient Health Questionnaire Obstetrics-Gynecology Study.

Am. J. Obstet. Gynecol., 183 (2000), pp. 759-769

http://dx.doi.org/10.1067/mob.2000.106580 | Medline

[37]

C. Cassiani-Miranda, M. Vargas-Hernández, E. Pérez-Anibal, M. Herazo-Bustos, M. Hernández-Carrillo.

Confiabilidad y dimensionalidad del PHQ-9 para el cribado de sintomatología depresiva en estudiantes de ciencias de la salud de Cartagena, 2014.

Biomédica, 37 (2017), pp. 112-120

http://dx.doi.org/10.7705/biomedica.v37i0.3221 | Medline

[38]

E. Rancans, M. Trapencieris, R. Ivanovs, J. Vrublevska.

Validity of the PHQ-9 and PHQ-2 to screen for depression in nationwide primary care population in Latvia.

Ann Gen Psychiatry, 17 (2018), pp. 1-9

[39]

D.V. Sheehan, Y. Lecrubier, K.H. Sheehan, P. Amorim, J. Janavs, E. Weiller, et al.

The Mini-International Neuropsychiatric Interview (M.I.N.I.): the development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10.

J. Clin. Psychiatry, 59 (1998), pp. 22-33

Medline

[40]

Y. Lecrubier, D.V. Sheehan, E. Weiller, P. Amorim, I. Bonora, K.H. Sheehan, et al.

The Mini International Neuropsychiatric Interview (MINI). A short diagnostic structured interview: reliability and validity according to the CIDI.

Eur Psychiatry, 12 (1997), pp. 224-231

[41]

J. Balázs, Y. Lecrubier, N. Csiszér, J. Koszták, I. Bitter.

Prevalence and comorbidity of affective disorders in persons making suicide attempts in Hungary: importance of the first depressive episodes and of bipolar II diagnoses.

J. Affect. Disord., 76 (2003), pp. 113-119

http://dx.doi.org/10.1016/s0165-0327(02)00084-8 | Medline

[42]

K.D. Juang, S.J. Wang, J.L. Fuh, S.R. Lu, T.P. Su.

Comorbidity of depressive and anxiety disorders in chronic daily headache and its subtypes.

Headache, 40 (2000), pp. 818-823

http://dx.doi.org/10.1046/j.1526-4610.2000.00148.x | Medline

[43]

N.R. Pinninti, H. Madison, E. Musser, D. Rissmiller.

MINI International Neuropsychiatric Schedule: clinical utility and patient acceptance.

Eur Psychiatry, 18 (2003), pp. 361364

[44]

A. Rossi, R. Alberio, A. Porta, M. Sandri, M. Tansella, F. Amaddeo.

The reliability of the Mini-International Neuropsychiatric Interview — Italian version.

J. Clin. Psychopharmacol., 24 (2004), pp. 561-563

http://dx.doi.org/10.1097/01.jcp.0000139758.03834.ad | Medline

[45]

T. Otsubo, K. Tanaka, R. Koda, J. Shinoda, N. Sano, S. Tanaka, H. Aoyama, et al.

Reliability and validity of Japanese version of the Mini-International Neuropsychiatric Interview.

Psychiatry Clin. Neurosci., 59 (2005), pp. 517-526

http://dx.doi.org/10.1111/j.1440-1819.2005.01408.x | Medline

[46]

J. Mordal, Gundersen, J.G. Bramness.

Norwegian version of the Mini-International Neuropsychiatric Interview: feasibility, acceptability and test-retest reliability in an acute psychiatric ward.

Eur Psychiatry, 25 (2010), pp. 172-177

http://dx.doi.org/10.1016/j.eurpsy.2009.02.004 | Medline

[47]

N. Kadri, M. Agoub, S. El Gnaoui, K. Mchichi Alami, T. Hergueta, D. Moussaoui.

Moroccan colloquial Arabic version of the Mini International Neuropsychiatric Interview (MINI): qualitative and quantitative validation.

Eur Psychiatry, 20 (2005), pp. 193-195

http://dx.doi.org/10.1016/j.eurpsy.2004.11.007 | Medline

[48]

J.M. De Azevedo Marques, A.W. Zuardi.

Validity and applicability of the Mini International Neuropsychiatric Interview administered by family medicine residents in primary health care in Brazil.

Gen. Hosp. Psychiatry, 30 (2008), pp. 303-310

http://dx.doi.org/10.1016/j.genhosppsych.2008.02.001 | Medline

[49]

P. Tejada, L.E. Jaramillo, R. Sánchez-Pedraza.

Revisión crítica sobre los instrumentos para la evaluación psiquiátrica en atención primaria.

Rev Fac Med, 62 (2014), pp. 101-110

[50]

A.S. Zigmond, R.P. Snaith.

The Hospital Anxiety and Depression Scale.

Acta Psychiatr. Scand., 67 (1983), pp. 361-370

http://dx.doi.org/10.1111/j.1600-0447.1983.tb09716.x | Medline

[51]

R.P. Snaith.

Availability of the hospital anxiety and depression (HAD) scale.

Br. J. Psychiatry, 161 (1992), pp. 422

http://dx.doi.org/10.1192/bjp.161.3.422b | Medline

[52]

S. Moorey, S. Greer, M. Watson, C. Gorman, L. Rowden, R. Tunmore, et al.

The factor structure and factor stability of the hospital anxiety and depression scale in patients with cancer.

Br. J. Psychiatry, 158 (1991), pp. 255-259

http://dx.doi.org/10.1192/bjp.158.2.255 | Medline

[53]

J.L. Rico, M. Restrepo, M. Molina.

Adaptación y validación de la escala hospitalaria de ansiedad y depresión (HAD) en una muestra de pacientes con cáncer del instituto nacional de cancerología de Colombia.

Avances en Medición, 3 (2005), pp. 73-86

[54]

A. Hinz, C. Finck, Y. Gómez, I. Daig, H. Glaesmer, S. Singer.

Anxiety and depression in the general population in Colombia: reference values of the Hospital Anxiety and Depression Scale (HADS).

Soc. Psychiatry Psychiatr. Epidemiol., 49 (2014), pp. 41-49

http://dx.doi.org/10.1007/s00127-013-0714-y | Medline

[55]

K. Kroenke, R.L. Spitzer, J.B.W. Williams.

The Patient Health Questionnaire-2: validity of a two-item depression screener.

Med. Care, 41 (2003), pp. 1284-1292

http://dx.doi.org/10.1097/01.MLR.0000093487.78664.3C | Medline

[56]

B. Löwe, K. Kroenke, K. Gräfe.

Detecting and monitoring depression with a two-item questionnaire (PHQ-2).

J. Psychosom. Res., 58 (2005), pp. 163-171

http://dx.doi.org/10.1016/j.jpsychores.2004.09.006 | Medline

[57]

L. Manea, S. Gilbody, C. Hewitt, A. North, F. Plummer, R. Richardson, et al.

Identifying depression with the PHQ-2: a diagnostic meta-analysis.

J. Affect. Disord., 203 (2016), pp. 382-395

http://dx.doi.org/10.1016/j.jad.2016.06.003 | Medline

[58]

M. Inagaki, T. Ohtsuki, N. Yonemoto, Y. Kawashima, A. Saitoh, Y. Oikawa, et al.

Validity of the Patient Health Questionnaire (PHQ)-9 and PHQ-2 in general internal medicine primary care at a Japanese rural hospital: a cross-sectional study.

Gen. Hosp. Psychiatry, 35 (2013), pp. 592-597

http://dx.doi.org/10.1016/j.genhosppsych.2013.08.001 | Medline

[59]

A.J. Mitchell, M. Yadegarfar, J. Gill, B. Stubbs.

Case finding and screening clinical utility of the Patient Health Questionnaire (PHQ-9 and PHQ-2) for depression in primary care: a diagnostic meta-analysis of 40 studies.

Br J Psychiatry Open, 2 (2016), pp. 127-138

[60]

SPSS Inc..

PASW Statistics for Windows.

SPSS Inc., (2009),

[61]

S. Lezama-Meneses.

Propiedades psicométricas de la escala de Zung para síntomas depresivos en población adolescente escolarizada colombiana.

Psychol Av Discip, 6 (2012), pp. 91-101

[62]

A.J. Mitchell, A. Vaze, S. Rao.

Clinical diagnosis of depression in primary care: a meta-analysis.

Lancet, 374 (2009), pp. 609-619

http://dx.doi.org/10.1016/S0140-6736(09)60879-5 | Medline

[63]

B. Löwe, R.L. Spitzer, J.B. Williams, M. Mussell, D. Schellberg, K. Kroenke.

Depression, anxiety and somatization in primary care: syndrome overlap and functional impairment.

Gen. Hosp. Psychiatry, 30 (2008), pp. 191-199

http://dx.doi.org/10.1016/j.genhosppsych.2008.01.001 | Medline

[64]

R. Stromberg, E. Wernering, A. Aberg-Wistedt, A.K. Furhoff, S.E. Johansson, L.G. Backlund.

Screening and diagnosing depression in women visiting GPs’ drop in clinic in Primary Health Care.

BMC Fam Pract, 9 (2008), pp. 1-11

http://dx.doi.org/10.1186/1471-2296-9-1 | Medline

[65]

A. Campo-Arias, H.C. Oviedo.

Propiedades psicométricas de una escala: la consistencia interna.

Rev Salud Pública, 10 (2008), pp. 831-839

http://dx.doi.org/10.1590/s0124-00642008000500015 | Medline

[66]

J.F. Lucke.

The α and the ω of congeneric test theory: an extension of reliability and internal consistency to heterogeneous tests.

Appl Psychol Meas, 29 (2005), pp. 65-81

[67]

J.L. Ventura-León.

¿Es el final del alfa de Cronbach?.

Adicciones, 31 (2019), pp. 80-81

http://dx.doi.org/10.20882/adicciones.1037 | Medline

[68]

M. Baader, J. Molina, S. Venezian, C. Rojas, R. Farías, C. Fierro-Freixeneta, et al.

Validación y utilidad de la encuesta PHQ-9 (Patient Health Questionnaire) en el diagnóstico de depresión en pacientes usuarios de atención primaria en Chile.

Rev Chil Neuropsiquiatr, 50 (2012), pp. 10-22

[69]

S. Chen, Y. Fang, H. Chiu, H. Fan, T. Jin, Y. Conwell.

Validation of the nine-item Patient Health Questionnaire to screen for major depression in a Chinese primary care population.

Asia-Pacific Psychiatry, 5 (2013), pp. 61-68

http://dx.doi.org/10.1111/appy.12063 | Medline

[70]

B.A. Kohrt, N.P. Luitel, P. Acharya, M.J. Jordans.

Detection of depression in low resource settings: validation of the Patient Health Questionnaire (PHQ-9) and cultural concepts of distress in Nepal.

BMC Psychiatry, 16 (2016), pp. 1-14

http://dx.doi.org/10.1186/s12888-015-0706-4 | Medline

[71]

B. Gelaye, M.A. Williams, S. Lemma, N. Deyessa, Y. Bahretibeb, T. Shibre, et al.

Validity of the patient health questionnaire-9 for depression screening and diagnosis in East Africa.

Psychiatry Res., 210 (2013), pp. 653-661

http://dx.doi.org/10.1016/j.psychres.2013.07.015 | Medline

[72]

J. Arrieta, M. Aguerrebere, G. Raviola, H. Flores, P. Elliott, A. Espinosa, et al.

Validity and utility of the Patient Health Questionnaire (PHQ)-2 and PHQ-9 for screening and diagnosis of depression in rural Chiapas, Mexico: a cross-sectional study.

J. Clin. Psychol., 73 (2017), pp. 1076-1090

http://dx.doi.org/10.1002/jclp.22390 | Medline

[73]

M. Lotrakul, S. Sumrithe, R. Saipanish.

Reliability and validity of the Thai version of the PHQ-9.

BMC Psychiatry, 8 (2008), pp. 1-7

http://dx.doi.org/10.1186/1471-244X-8-1 | Medline

[74]

S.I. Liu, Z.T. Yeh, H.C. Huang, F.J. Sun, J.J. Tjung, L.C. Hwang, et al.

Validation of Patient Health Questionnaire for depression screening among primary care patients in Taiwan.

Compr. Psychiatry, 52 (2011), pp. 96-101

http://dx.doi.org/10.1016/j.comppsych.2010.04.013 | Medline

[75]

K. Wittkampf, H. van Ravesteijn, K. Baas, H. van de Hoogen, A. Schene, P. Bindels, et al.

The accuracy of Patient Health Questionnaire-9 in detecting depression and measuring depression severity in high-risk groups in primary care.

Gen. Hosp. Psychiatry, 31 (2009), pp. 451-459

http://dx.doi.org/10.1016/j.genhosppsych.2009.06.001 | Medline

[76]

M.H. Chagas, V. Tumas, G.R. Rodrigues, J.P. Machado-de-Sousa, A.S. Filho, J.E. Hallak, et al.

Validation and internal consistency of patient health questionnaire-9 for major depression in parkinson's disease.

Age Ageing, 42 (2013), pp. 645-649

http://dx.doi.org/10.1093/ageing/aft065 | Medline

[77]

O.L.O. Astivia, B.D.1. Zumbo.

Population models and simulation methods: the case of the Spearman rank correlation.

Br. J. Math. Stat. Psychol., 70 (2017), pp. 347-367

http://dx.doi.org/10.1111/bmsp.12085 | Medline

[78]

W.J. Youden.

Index for rating diagnostic tests.

Cancer, 3 (1950), pp. 32-35

http://dx.doi.org/10.1186/1471-2407-3-32 | Medline

[79]

T. Xu, J. Wang, Y. Fang.

A model-free estimation for the covariate-adjusted Youden index and its associated cut-point.

Stat. Med., 33 (2014), pp. 4963-4974

http://dx.doi.org/10.1002/sim.6290 | Medline

[80]

V. Inácio de Carvalho, M. de Carvalho, A.J. Branscum.

Nonparametric Bayesian covariate-adjusted estimation of the Youden index.

Biometrics, 73 (2017), pp. 1279-1288

http://dx.doi.org/10.1111/biom.12686 | Medline

[81]

J. Yin, L. Tian.

Joint confidence region estimation for area under ROC curve and Youden index.

Stat. Med., 33 (2014), pp. 985-1000

http://dx.doi.org/10.1002/sim.5992 | Medline

[82]

C. Li, J. Chen, G. Qin.

Partial Youden index and its inferences.

J. Biopharm. Stat., 29 (2019), pp. 385-399

http://dx.doi.org/10.1080/10543406.2018.1535502 | Medline

[83]

F. Lamers, C.C. Jonkers, H. Bosma, B.W. Penninx, J.A. Knottnerus, J.T. van Eijk.

Summed score of the Patient Health Questionnaire-9 was a reliable and valid method for depression screening in chronically ill elderly patients.

J. Clin. Epidemiol., 61 (2008), pp. 679-687

http://dx.doi.org/10.1016/j.jclinepi.2007.07.018 | Medline

[84]

A. Kleinman.

Culture and depression.

N. Engl. J. Med., 351 (2004), pp. 951-953

http://dx.doi.org/10.1056/NEJMp048078 | Medline

[85]

A. Campo-Arias, H.C. Oviedo, E. Herazo.

Estigma: barrera de acceso a servicios en salud mental.

Rev Colomb Psiquiatr, 43 (2014), pp. 162-167

http://dx.doi.org/10.1016/j.rcp.2014.07.001 | Medline

[86]

M.R. Phillips, V. Pearson, F. Li, M. Xu, L. Yang.

Stigma and expressed emotion: a study of people with schizophrenia and their family members in China.

Br. J. Psychiatry, 181 (2002), pp. 488-493

http://dx.doi.org/10.1192/bjp.181.6.488 | Medline

[87]

J.E.M. Nakku, S.D. Rathod, D. Kizza, E. Breuer, K. Mutyaba, E.C. Baron, et al.

Validity and diagnostic accuracy of the Luganda version of the 9-item and 2-item Patient Health Questionnaire for detecting major depressive disorder in rural Uganda.

Glob Ment Heal, 3 (2016), pp. e20

[88]

A. Bhana, S.D. Rathod, O. Selohilwe, T. Kathree, I. Petersen.

The validity of the Patient Health Questionnaire for screening depression in chronic care patients in primary health care in South Africa.

BMC Psychiatry, 15 (2015), pp. 1-9

http://dx.doi.org/10.1186/s12888-014-0378-5 | Medline

[89]

C. Hanlon, G. Medhin, M. Selamu, E. Breuer, B. Worku, M. Hailemariam, et al.

Validity of brief screening questionnaires to detect depression in primary care in Ethiopia.

J. Affect. Disord., 186 (2015), pp. 32-39

http://dx.doi.org/10.1016/j.jad.2015.07.015 | Medline

[90]

K. Muramatsu, H. Miyaoka, K. Kamijima, Y. Muramatsu, Y. Tanaka, M. Hosaka, et al.

Performance of the Japanese version of the Patient Health Questionnaire-9 (J-PHQ-9) for depression in primary care.

Gen. Hosp. Psychiatry, 52 (2018), pp. 64-69

http://dx.doi.org/10.1016/j.genhosppsych.2018.03.007 | Medline

[91]

J. Dros, A. Wewerinke, P.J. Bindels, H.C. van Weert, B. Arroll, F. Goodyear-Smith, et al.

Validation of PHQ-2 and PHQ-9 to screen for major depression in the primary care population.

Ann Fam Med, 7 (2010), pp. 348-353

[92]

L.K. Kerr, L.D. Kerr Jr..

Screening tools for depression in primary care: the effects of culture, gender, and somatic symptoms on the detection of depression.

West. J. Med., 175 (2001), pp. 349-352

http://dx.doi.org/10.1136/ewjm.175.5.349 | Medline

[93]

W. Wang, Q. Bian, Y. Zhao, X. Li, W. Wang, J. Du, et al.

Reliability and validity of the Chinese version of the Patient Health Questionnaire (PHQ-9) in the general population.

Gen. Hosp. Psychiatry, 36 (2014), pp. 539-544

http://dx.doi.org/10.1016/j.genhosppsych.2014.05.021 | Medline

[94]

S. El-Den, T.F. Chen, Y.L. Gan, E. Wong, C.L. O’Reilly.

The psychometric properties of depression screening tools in primary healthcare settings: a systematic review.

J. Affect. Disord., 225 (2018), pp. 503-522

http://dx.doi.org/10.1016/j.jad.2017.08.060 | Medline

[95]

D.A. Grimes, K.F. Schulz.

Refining clinical diagnosis with likelihood ratios.

Lancet, 365 (2005), pp. 1500-1505

http://dx.doi.org/10.1016/S0140-6736(05)66422-7 | Medline

[96]

X. Huang, G. Qin, Y. Fang.

Optimal combinations of diagnostic tests based on AUC.

Biometrics, 67 (2011), pp. 568-576

http://dx.doi.org/10.1111/j.1541-0420.2010.01450.x | Medline

[97]

L.P. Richardson, E. McCauley, D.C. Grossman, C.A. McCarty, J. Richards, J.E. Russo, et al.

Evaluation of the Patient Health Questionnaire-9 item for detecting major depression among adolescents.

Pediatrics, 126 (2010), pp. 1117-1123

http://dx.doi.org/10.1542/peds.2010-0852 | Medline

[98]

G. Salaminios, L. Duffy, A. Ades, R. Araya, K.S. Button, R. Churchill, et al.

A randomised controlled trial assessing the severity and duration of depressive symptoms associated with a clinically significant response to sertraline versus placebo, in people presenting to primary care with depression (PANDA trial): Study protocol for randomised controlled trial.

Trials, 18 (2017), pp. 496

http://dx.doi.org/10.1186/s13063-017-2253-4 | Medline

[99]

B. Oneib, M. Sabir, N. Abda, A. Ouanass.

Epidemiological study of the prevalence of depressive disorders in primary health care in Morocco.

J Neurosci Rural Pract, 6 (2015), pp. 477

http://dx.doi.org/10.4103/0976-3147.169768 | Medline

[100]

N.M. Buderer.

Statistical methodology. I. Incorporating the prevalence of disease into the sample size calculation for sensitivity and specificity.

Acad. Emerg. Med., 3 (1996), pp. 895-900

http://dx.doi.org/10.1111/j.1553-2712.1996.tb03538.x | Medline

[101]

N.A. Obuchowski.

Sample size calculation in studies of test accuracy.

Stat. Methods Med. Res., 7 (1998), pp. 371-392

http://dx.doi.org/10.1177/096228029800700405 | Medline

[102]

A. Beam.

Strategies for improving power in diagnostic radiology research.

Am. J. Roentgenol., 159 (1992), pp. 631-637

[103]

C. Hanlon, M. Semrau, A. Alem, S. Abayneh, J. Abdulmalik, S. Docrat, et al.

Evaluating capacity-building for mental health system strengthening in low- and middle-income countries for service users and caregivers, service planners and researchers.

Epidemiol Psychiatr Sci, 27 (2018), pp. 3-10

http://dx.doi.org/10.1017/S2045796017000440 | Medline

[104]

G. Thornicroft, S. Ahuja, S. Barber, D. Chisholm, P.Y. Collins, S. Docrat, et al.

Integrated care for people with long-term mental and physical health conditions in low-income and middle-income countries.

Lancet Psychiatry, 6 (2018), pp. 174-186

http://dx.doi.org/10.1016/S2215-0366(18)30298-0 | Medline

[105]

L. Manea, S. Gilbody, D. McMillan.

A diagnostic meta-analysis of the Patient Health Questionnaire-9 (PHQ-9) algorithm scoring method as a screen for depression.

Gen. Hosp. Psychiatry, 37 (2015), pp. 67-75

http://dx.doi.org/10.1016/j.genhosppsych.2014.09.009 | Medline

[106]

P. Sharan, R. Sagar, S. Kumar.

Mental health policies in South-East Asia and the public health role of screening instruments for depression.

WHO South-East Asia J Public Heal [Internet], 6 (2017), pp. 5

[107]

C. Hanlon, A. Fekadu, M. Jordans, F. Kigozi, I. Petersen, R. Shidhaye, et al.

District mental healthcare plans for five lowand middle-income countries: commonalities, variations and evidence gaps.

Br. J. Psychiatry, 208 (2016), pp. 47-54

[108]

K. Kroenke, R.L. Spitzer, J.B. Williams, B. Löwe.

The Patient Health Questionnaire Somatic, Anxiety, and Depressive Symptom Scales: a systematic review.

Gen. Hosp. Psychiatry, 32 (2010), pp. 345-359

http://dx.doi.org/10.1016/j.genhosppsych.2010.03.006 | Medline

⋆

Please cite this article as: Cassiani-Miranda CA, Cuadros-Cruz AK, Torres-Pinzón H, Scoppetta O, Pinzón-Tarrazona JH, López-Fuentes WY, et al. Validez del Cuestionario de salud del paciente-9 (PHQ-9) para cribado de depresión en adultos usuarios de Atención Primaria en Bucaramanga, Colombia. Rev Colomb Psiquiat. 2021;50:11–21.

Indexed in:

Follow us:

Indexed in:

Follow us:

Subscribe to our newsletter