Validation of the 10-item Centre for Epidemiological Studies Depression Scale (CES-D-10) in Zulu, Xhosa and Afrikaans populations in South Africa

Background The 10-item Centre for Epidemiological Studies Depression Scale (CES-D-10) is a depression screening tool that has been used in the South African National Income Dynamics Study (NIDS), a national household panel study. This screening tool has not yet been validated in South Africa. This study aimed to establish the reliability and validity of the CES-D-10 in Zulu, Xhosa and Afrikaans. The CES-D-10’s psychometric properties were also compared to the Patient Health Questionnaire (PHQ-9), a depression screening tool already validated in South Africa. Methods Stratified random samples of Xhosa, Afrikaans and Zulu-speaking participants aged 15 years or older (N = 944) were recruited from Cape Town Metro and Ethekwini districts. Face-to-face interviews included socio-demographic questions, the CES-D-10, Patient Health Questionnaire (PHQ-9), and WHO Disability Assessment Schedule 2.0 (WHODAS). Major depression was determined using the Mini International Neuropsychiatric Interview. All instruments were translated and back-translated to English. Construct validity was examined using exploratory factor analysis with varimax rotation. Receiver Operating Characteristics (ROC) curves were used to investigate the CES-D-10 and PHQ-9’s criterion validity, and compared using the DeLong method. Results Overall, 6.6, 18.0 and 6.9% of the Zulu, Afrikaans and Xhosa samples were diagnosed with depression, respectively. The CES-D-10 had acceptable internal consistency across samples (α = 0.69–0.89), and adequate concurrent validity, when compared to the PHQ-9 and WHODAS. The CES-D-10 area under the Receiver Operator Characteristic curve was good to excellent: 0.81 (95% CI 0.71–0.90) for Zulu, 0.93 (95% CI 0.90–0.96) for Afrikaans, and 0.94 (95% CI 0.89–0.99) for Xhosa. A cut-off of 12, 11 and 13 for Zulu, Afrikaans and Xhosa, respectively, generated the most balanced sensitivity, specificity and positive predictive value (Zulu: 71.4, 72.6% and 16.1%; Afrikaans: 84.6%, 84.0%, 53.7%; Xhosa: 81.0%, 95.0%, 54.8%). These were slightly higher than those generated for the PHQ-9. The CES-D-10 and PHQ-9 otherwise performed similarly across samples. Conclusions The CES-D-10 is a valid, reliable screening tool for depression in Zulu, Xhosa and coloured Afrikaans populations.


Background
Major depression is one of the leading causes of disease burden worldwide [1] and has clear economic implications [2,3]. The South African Stress and Health (SASH) study, conducted between 2002 and 2004, investigated the national prevalence of mental disorders, and reported that nearly 10% of the population suffered from major depressive disorder at least once in their lifetime [4]. Yet, only one in four individuals with depression or anxiety receive treatment in South Africa [5].
Where mental health services are available, the use of indicated screening tools has been advocated as a way to detect individuals at risk of depression [6]. The timely identification of individuals displaying depressive symptoms is important, as it allows such individuals to be referred for mental health treatment services to prevent depressive symptoms from worsening into full clinical depression. The Centre for Epidemiological Studies Depression Scale (CES-D) is a 20-item screening tool, initially developed to detect depression in general populations [7]. It has been validated in a variety of settings, such as Zambia [8] and South Africa [9]. Subsequently, several shorter versions have been developed, including Andresen's 10-item version (CES-D-10), generated through item-total correlations with the original 20-item CES-D [10]. Originally validated in the older population [11,12], the CES-D-10 has good psychometric properties in both healthy and psychiatric populations [13,14], and in adolescents [15]. In Andresen's original study, a cut-off score of 8 or 10 on the CES-D-10 was identified as optimal to identify individuals at risk of depression. A few studies have since focused on the diagnostic validity of the CES-D-10, yet all were conducted in the United States or in China, and cutoffs varied considerably, from 8 to 16 [11,13,14,16]. The reliability and validity of the CES-D-10 has, however, never been investigated in South Africa.
The CES-D-10 has been used in the National Income Dynamics Study (NIDS), a South African national household panel study of 7300 households [17]. The study's first wave was conducted in 2008 and another three waves were conducted to date, one every two years. At each survey, a range of economic, health and wellbeing data were collected from all household members of 15 years of age or more. While a few studies from the first waves of NIDS used the CES-D-10 as a longitudinal measure of depression severity [18][19][20], most have used a cut-off of 10 to classify participants at high risk of depression, as suggested by Andresen [21][22][23][24].
The aim of this study was to determine the reliability and validity of the CES-D-10 in three major South African languages: Zulu, Xhosa and Afrikaans. The psychometric properties of the CES-D-10 were also compared to those of the Patient Health Questionnaire (PHQ-9) [25]; another common screening tool for depression, already validated in primary health care patients in North West Province and in Gauteng in South Africa [26,27].

Design
This validation study investigated the internal consistency, concurrent, construct and criterion validity of the CES-D-10 among Zulu, Xhosa and Afrikaans-speaking populations. These languages are the most commonly spoken in South Africa, according to the 2011 census [28]: 22.7% of the South African population speaks Zulu, 16.0% speaks Xhosa and 13.5% speaks Afrikaans. The study consisted of face-toface interviews which included (1) basic demographic and economic questions; (2) depression and functioning screening instruments; and (3) the Mini International Neuropsychiatric Interview (MINI) 6.0 (Major Depressive Episode) [29].

Demographic and socio-economic information
Basic demographic and socio-economic information covered age, gender, population group, marital status, education, employment status, personal income and assets owned. Household economic measures included type of dwelling, number of household members, as well as access to electricity, water and sanitation.

Centre for Epidemiological Studies Depression Scale (CES-D-10)
The CES-D-10 is a 10-item Likert scale questionnaire assessing depressive symptoms in the past week [10]. It includes three items on depressed affect, five items on somatic symptoms, and two on positive affect. Options for each item range from "rarely or none of the time" (score of 0) to "all of the time" (score of 3). Scoring is reversed for items 5 and 8, which are positive affect statements. Total scores can range from 0 to 30. Higher scores suggest greater severity of symptoms.
Patient Health Questionnaire -9 item (PHQ-9) The PHQ-9 is a 9-item screening measure for depression, where participants are asked to rate how often they were bothered by specific symptoms over the last two weeks [25]. Each item is scored from 0 ("Not at all") to 3 ("Nearly every day"). Higher scores indicate greater symptoms of depression. The PHQ-9 has been validated in a range of settings and populations in low and middle-income countries [30], including South Africa [26].
Mini International Neuropsychiatric Interview (MINI) 6.0, Major Depressive Episode module The presence of major depression was determined using the MINI 6.0 [29], which uses the DSM-IV criteria for major depressive episodes. It has been used as a gold standard in many cross-cultural studies, including in HIV-positive patients in South Africa [31,32].
WHO Disability Assessment Schedule (WHODAS 2.0) (12-item) Functional impairment was assessed using the WHO-DAS 2.0 [33]. It comprises 12 items with response options ranging from 'No difficulty' to 'Extreme difficulty or unable to do'. The item-response-theory (IRT) based scoring was used, as set out in the WHODAS 2.0 Manual [34]: scores are percentages, with higher percentages suggesting greater impairment. The WHODAS 2.0 has undergone extensive validation, and has good reliability and validity across cultures and population groups [34].
All sections of the questionnaire, including the MINI assessment, were translated into Xhosa, Afrikaans and Zulu, and back translated to English, by six independent translators. The research team worked with the translators to assess the accuracy of each item, and to resolve discrepancies where these arose.

Sample size
Three samples were recruited, one for each language. Given that the prevalence of individuals screening ≥10 or ≥15 on the CES-D-10 in the first wave of NIDS was 28 and 8% respectively in the Western Cape, and 32 and 5% in KwaZulu Natal, 1 it was determined that a total of 300 participants per sample would be sufficient to analyse higher CES-D-10 scores, and have enough power to assess criterion validity. The sample size of validation studies included in a recent meta-analysis of the PHQ-9 [30] usually ranged from 150 to 600. The proposed sample therefore falls within the range of validation studies considered methodologically strong.

Household sampling
Participants were recruited from two districts in South Africa: the City of Cape Town metro district and Ethekwini district in KwaZulu Natal, which encompasses both rural and urban areas. The 'small area level' (SAL) was used as the primary sampling unit from which to select households in the two districts. The SAL is the lowest level of geographic unit for which Census data is publically available, and is a manageable size in terms of population and land area. Population sizes vary across SALs, but usually range between 400 and 1000 individuals.
Only SALs classified as residential were included in the sampling base. SALs were selected for inclusion using systematic sampling, based on data from StatsSA. SALs were stratified by the most common home language, main population group (White, Black, Coloured, Indian), type of area (rural/urban) and most common income bracket. In South Africa, the term 'coloured' is not considered critical, and is used to describe an ethnic group composed primarily of persons of mixed race.
A total of six participants were recruited per SAL, with a maximum of two participants per household. The first household in each SAL was selected using a random starting point (created using a sampling algorithm on the Geographic Information System). Every third household was then selected from this starting point. Nondwelling structures, such as shops, churches and museums, were skipped. Households were still included in the three count method when members were not at home or refused to participate.
This process was repeated until six participants per SAL were reached. If this could not be reached in a particular SAL, then the nearest predetermined oversampled SAL with the same settlement pattern was attempted, in order to reach the full complement of six participants. A total of 75 SALs were selected per sample, including an additional 50% of oversampled SALs.

Participants
To be eligible, participants had to be aged 15 years or more, and be able to provide consent. Their home language had to be Xhosa, Afrikaans or Zulu, depending on the district, and be considered household members. This was defined as relatives or non-relatives who lived under the same roof or within the same compound, shared resources, and slept in the house for at least four nights a week. Live-in domestic workers and lodgers were regarded as separate households.

Training
All fieldworkers conducting the interviews received one week of training by a registered counsellor (TD), on mental illness, administration of the tools, and methodological procedures. The first part of the training consisted of general psychoeducation on symptoms of depression and available treatments, and open discussions on the fieldworkers' knowledge or experience of depression. The second part of the training included a back-translation of the translated MINI, as a cognitive testing exercise, to ensure the fieldworkers had a full understanding of the concepts of symptoms assessed in the MINI, and to ensure these corroborated with the translators' translation. Fieldworkers were then trained to administer the MINI and the other screening tools, facilitated by role plays during which inter-rater reliability was also informally assessed. The depth of training on depression, in addition to the tool itself, was essential to ensure the accuracy of the fieldworkers' diagnostic assessments, and ensure that the data collected were robust and the interpretation of results reliable. Finally, TD spent three days with each fieldwork team at the start of data collection, shadowing all interviews conducted to monitor the quality of the MINI assessment and of the accuracy of diagnoses made.

Procedure
Xhosa and Afrikaans speaking participants were recruited from the City of Cape Town metro district and Zulu speaking participants from the Ethekwini district. Each sample of 300 participants was recruited by one team, comprising of two experienced, trained fieldworkers. Aerial maps of the SALs were printed and provided to the fieldworkers to navigate the SALs. The starting point for the SAL and non-dwelling structures were indicated on the maps. Fieldworkers first approached the households to determine that the language criterion was met. If a household member was present, eligible and agreed, he or she was asked to provide a list of all eligible members in the household, even if they were not present at the time of the visit. Two participants were then randomly selected, using the Dice method: a number was assigned to each eligible household member and an 18-faced dice was thrown to select the assigned number for individuals to be recruited. Appointments were made if selected individuals in the household were not present. A missed appointment was considered as a refusal.
Data were collected electronically, with the use of mobile devices. The interview was administered by the same fieldworkers involved in the recruitment process. The CES-D-10, PHQ9 and WHODAS 2.0 were administered separately from the socio-economic section and MINI 6.0 depression module, and by a different fieldworker, to avoid response bias. Each section of the interview was conducted in a private area of the participant's home, away from other household members and the second fieldworker. Minors completed the interview in the presence of the consenting caregiver. The full interview lasted approximately 45 min.

Statistical analysis
The data collected were transferred to Stata version 13, where analyses were conducted separately for each sample. Descriptive statistics were used to describe the socio-demographic characteristics of the participants, their screening scores and depression diagnosis. A review of kurtosis and skewness suggested that none of the scores on the CES-D-10, PHQ-9 or WHODAS 2.0 were normally distributed, so non-parametric tests and medians (interquartile range; IQR) were reported throughout the analysis. Probability weights were calculated to estimate the population-level prevalence of depression, taking into account the selection of eligible SALs among the districts, and the probability of a household being selected within an SAL and of an individual being selected within a household. Nonparametric independent tests were used to compare CES-D-10 scores between depressed and non-depressed participants. The internal reliability of the CES-D-10 and PHQ-9 were assessed using Cronbach's Alpha. The CES-D-10's convergent validity was determined by assessing its correlation with the WHODAS 2.0 and the PHQ-9. An exploratory factor analysis with varimax rotation was applied to investigate the construct validity of the CES-D-10, using the Kaiser Test and scree plot to identify latent dimensions of the scale. Finally, Receiver Operating Characteristics (ROC) curves were used to examine the CES-D-10 and PHQ-9's criterion validity against the MINI 6.0. Optimal cut-off scores were identified as the best balance between sensitivity and specificity values, giving equal weight to both measures. The area under the ROC curve for the CES-D-10 was compared to that of the PHQ-9 using the DeLong method [35].

Ethical considerations
This study was approved by the University of Cape Town's Health Sciences Faculty Human Research Ethics committee (REF: 209/2016). Consent and assent forms were translated in all three languages and completed by all participants who agreed to participate. A R20 supermarket voucher was given to each participant, at the end of the interview. Participants who were diagnosed with depression were given a brochure on depression, and a list of local non-governmental organisations and toll-free numbers they could contact for counselling. Participants who reported suicidal behaviour were referred to the mental health nurse at a primary health care clinic of their choice. Suicide behaviours were considered present if participants answered 'yes' to the MINI 6.0 item ("Did you repeatedly consider hurting yourself, feel suicidal or wish that you were dead? Did you attempt suicide or plan a suicide?"), or answered 'several days' or more to PHQ9 item ("Thoughts that that you would be better off dead or of hurting yourself in some way").

Results
A total of 944 participants were recruited: 307 in the Zulu sample, 334 in the Afrikaans sample and 303 in the Xhosa sample. One participant in the Zulu sample and two in the Afrikaans sample did not complete the questionnaire (Fig. 1). The original intention was to recruit two types of population group in the Afrikaans sample: one third 'white' and two-thirds 'coloured' , to be representative of the Afrikaans population in the district. However, the rate of refusals was high among the white population, and only 39 white participants were recruited. Given that the socio-economic characteristics of the white and coloured populations differed significantly in the study, the 39 white participants and 6 participants reporting to be other than coloured, were excluded from the analysis. To increase the Afrikaans coloured sample, additional participants were recruited from the remaining oversampled Afrikaans SALs. The final sample included in the analysis was therefore 306 for the Zulu sample, 289 in the coloured Afrikaans sample and 303 in the Xhosa sample.

Sample demographic and socio-economic characteristics
The majority of participants across samples were recruited from urban (formal and informal) settlements ( Table 1). Half of the Zulu sample (53.8%), and a third of the Afrikaans sample lived in formal houses. The majority of participants in the Afrikaans sample, however, lived in government housing. A third of the Xhosa sample also reported living in government houses, and nearly half in informal dwellings. The majority of households across samples reported having access to electricity, piped water and private flush toilet facilities inside or outside the dwelling.
The majority of participants sampled were women Nearly a third of the Zulu sample reported having reached the end of high school, which was the case only for 13.6% of the participants in the Afrikaans sample and 22.4% in the Xhosa sample. A minority reported having tertiary education across samples (4.7-11.1%). A fifth of the Zulu sample and over a third of the Xhosa sample reported being employed. Another 40% in the Zulu sample and 34% in the Xhosa reported being unemployed and looking for work. The remaining participants were mainly school or university students. On the other hand, participants in the Afrikaans sample consisted mostly of 'stay at home' individuals (looking after children or home; 24.9%), unemployed (23.6%) and retired (21.8%) individuals. A third of each sample reported not receiving a personal income. Remaining participants usually indicated earning less than R5,000 (US$ 320) per month.

Prevalence of major depression
The prevalence of depression in the three samples and across demographic groups are reported in Table 2. A similar proportion of participants in the Zulu and Xhosa samples were diagnosed with depression (6.9%), but a much higher prevalence was found in the Afrikaans sample (18.0%). Only one adolescent, in the Afrikaans sample, was diagnosed with depression. None of the socio-demographic measures were associated with a diagnosis of depression in the Xhosa sample. In the Zulu sample, gender, age, marital status and employment status were associated with depression. A significantly higher proportion of participants with depression were women (90%), were aged 60 years or more (28%), were retired (39%), and were either divorced or widowed, compared to nondepressed participants (67, 8, 11 and 8%, respectively). A greater proportion of non-depressed participants reported being employed (21%) or studying (16%) compared to non-depressed participants (14 and 9% respectively).
In the Afrikaans sample, age and marital status were associated with depression, as well as dwelling type: a greater proportion of depressed participants were  Taking into account the sampling strategy, the weighted population prevalence of depression was 5.9% (95% CI 3.0-11.4) in the Zulu sample, 18.9% (95% CI 13.5%-25.7%) in the Afrikaans sample, and 6.9% (95% CI 4.0%-11.5%) in the Xhosa sample.

Concurrent validity of the CES-D-10
The correlation between the CES-D-10 and the other screening tools were all above .5 and statistically significant at the 0.001 level, besides the correlation between the CES-D-10 and the WHODAS 2.0, which was lower in the Xhosa sample (Rho = 0.37).

Construct validity of the CES-D-10
The exploratory factor analysis suggests a two-factor solution in the Zulu and Xhosa samples, explaining 42.7 and 46.7% of the variance on the CES-D-10, respectively. A one-factor model in the Afrikaans sample was identified, explaining 51.6% of the variance. Item-factor correlations are shown in Table 3. All items pertaining to negative affect and somatic symptoms loaded highly on Factor 1 (0.55-0.71), and items 5 and 8, which refer to positive affect, either loaded highly on Factor 2 in the Zulu and Xhosa samples, or less well on Factor 1 in the Afrikaans sample.

CES-D-10
The sensitivity and specificity of each cut-off point on the CES-D-10 and PHQ-9 are presented in Table 4. In the Zulu sample, a cut-off of 12 on the CES-D-10 presented the most balanced sensitivity (71.4%) and specificity (72.6%), correctly classifying 72.6% of participants. However, the positive predictive value (PPV) was very low, suggesting that only 16.1% of those who scored 12 or above on the CES-D-10 were depressed. This is due to the low prevalence depression in this sample (6.6%).
In the Xhosa sample, where the prevalence of depression was also low, a cut-off of 10 on the CES-D-10 presented the most balanced sensitivity (85.7%) and specificity (87.2%), correctly classifying 84.1% of participants. Again, the PPV was very low (33.3%). However, a higher cut-off of 13 still had adequate sensitivity (81.0%) and specificity (95.0%), though less balanced, and an acceptable PPV (54.8%), altogether correctly classifying 94.1% of the sample. Finally, in the Afrikaans sample, a cut-off of 11 on the CES-D-10 presented the most balanced sensitivity (84.6%) and specificity (84.0%), correctly classifying 84.1% of participants. The PPV was higher in this sample, with 53.7% of participants with a CES-D-10 score of 11 or above having a diagnosis of depression.

PHQ-9
Sensitivity and specificity values on the PHQ-9 were generally lower than on the CES-D-10 in all three samples. In the Zulu and Xhosa sample, a cut-off of 8 on the PHQ-9 provided the best balance of sensitivity and specificity (Zulu: 66.7 and 73.1%; Xhosa: 81.0 and 87.2%). While 72.6 and 86.8% of participants were correctly classified in the Zulu and Xhosa samples, respectively, the PPV was also low (15.4 and 32.1%, respectively). Selecting another cut-off score on the PHQ-9 did not improve the PPV without being detrimental to sensitivity or specificity values. On the other hand, a cut-off of 7 on the PHQ-9 in the Afrikaans sample provided the best balance of sensitivity (82.7%) and specificity (79.1%), correctly classifying 79.7% of participants; 46.7% of participants screening positive using this cut-off were actually depressed.

Discussion
The present study sought to investigate the reliability and validity of the CES-D-10 among three language groups in South Africa. The sampling strategy allowed us to recruit a representative sample of the Xhosa, Zulu and coloured Afrikaans population in the Cape Town Metro and Ethekwini districts, so a population weighted prevalence of depression could be calculated. The estimated population prevalence of 5.9% in the Zulu and 6.9% in the Xhosa populations are similar to the national 12-month prevalence for major depression (4.9%) reported in the SASH study [36]. The estimated population prevalence of 18% in the coloured Afrikaans population was relatively high. Interestingly, the odds of suffering from a mood disorder in Williams et al [36]'s study were also higher in the Coloured community, compared to the White or African communities, but this finding was not statistically significant. Unfortunately, too few adolescents were recruited, and almost none reported having depression, so it was not possible to estimate the population prevalence of depression among adolescents. For ethical reasons, the adolescents' caregivers were present during the interview. It is therefore possible that the very low prevalence among adolescents in this study may also have been due to a lack of confidentiality when providing their responses. Scores on the CES-D-10 differed from those reported in the first wave of NIDS. In the present study, the proportion of participants screening ≥10 and ≥15 was consistently higher in the Zulu and Afrikaans samples. The proportion of participants in the Xhosa sample scoring ≥10 (17.8%) was lower than the 28% figure reported by NIDS in the Western Cape; however the proportion scoring ≥15 was very similar (approximately 8%). The differences in CES-D-10 scores reported here may be due to the relatively small and perhaps less representative sample in this study, in comparison with the NIDS sample.
ROC curves suggested that the CES-D-10 is an adequate screening tool to identify individuals at risk of depression. AUROC values were all above the minimum value of .75, which is considered clinically significant [37]. In the Zulu sample, a cut-off of 12 on the CES-D-10 seem to be the most appropriate to indicate high risk of depression, whereas a cut-off of 11 in the Afrikaans sample and 13 in the Xhosa sample were most suitable. Alternatively, a cut-off of 12 may be appropriate in the South African context, as it provides relatively acceptable sensitivity and specificity across all three language groups.
The present study suggests that a cut-off of 10 to indicate high risk of depression, as suggested by Andresen et al. [10], may not be optimal, especially if the screening tool is to be used in clinical settings, which are already overburdened in South Africa. Indeed, if a cut-off of 10 were used, nearly half of the Zulu sample and one third of the Afrikaans sample would be considered at high risk for depression. Also, misclassification of individuals into high or low risk for depression in the Zulu sample would increase from 27.4% (at a cut-off of 12) to 40.8%, and from 5.9% (at a cut-off of 13) to 12.9% in the Xhosa sample. The difference in misclassification would be less striking in the Afrikaans sample, but would still increase from 15.9% (at a cut off of 11) to 18.0%.
The CES-D-10 performed well in relation to the PHQ-9 and the WHODAS 2.0, suggesting adequate concurrent validity. The internal consistency of the CES-D-10 was also acceptable in the Afrikaans and Xhosa samples, though slightly lower in the Zulu sample. The internal reliability and exploratory factor analysis both suggest that the two positive affect items do not fit well with the other items in the tool, and constitute a second Table 4 Optimal cut-off scores on the CES-D-10 and PHQ-9 for the detection of major depressive disorder dimension. This supports previous evidence on the internal structure of the CES-D-10 among adolescents [15] and the older population in Asia [12,13], suggesting that the CES-D-10 consists of a depressed affect dimension (including somatic symptoms) and positive affect dimension. In addition, item 5 (hopefulness) consistently performed poorly in comparison to item 8 in the present study. This was also reported in Bradley et al [15]'s validation study among adolescents, where they cautioned about conceptualising hopefulness as a positive affect concept. The order of the questions may also have explained the difference in the performance of the two items, as participants may have been confused by the first positive statement, after having answered a series of negative statements.
In comparison to the original 20-item version of the CES-D, the CES-D-10 has clear benefits. The present study suggests that the shorter version of the tool is still reliable and valid in assessing clinically significant depressive symptoms among the general Xhosa, Zulu and Afrikaans populations in South Africa. As a shorter instrument, the CES-D-10 is less time consuming to administer and therefore more feasible to use in both research and clinical settings, such as part of larger screening activities integrated in health services to identify and refer at-risk individuals.
Overall, the PHQ-9 also performed well across the three samples. A cut-off of 7 or 8 on the PHQ-9 in the present study was lower than the cut-off of 9 suggested among chronic care patients in the Northwest Province of south Africa [26] and a cut-off of 10 identified among patients attending a high HIVburdened primary care clinic in Johannesburg [27]. Altogether, the psychometric properties of the CES-D-10 and PHQ-9 were similar across all three samples. Though the AUROC was only slightly higher for the CES-D-10 in the Afrikaans sample, the cutoffs identified on the CES-D-10 consistently generated higher sensitivity, specificity and PPV compared to the PHQ-9 cut-offs. Results therefore suggest that the performance of the CES-D-10 as a screening tool is on par with, if not slightly stronger than the PHQ-9.
Given that both instruments comprise of 10 items and should take the same time to administer, the authors recommend using either screening tool to assess depressive symptoms in the future. Despite the CES-D-10 having slightly stronger psychometric properties than the PHQ-9 in the present study, the poorer performance of the two CES-D-10 positive affect items may lead researchers and clinicians to give preference to the PHQ-9, however.

Limitations
Limitations of the study should be noted. First, it was not possible to assess the cultural differences in the CES-D-10's performance among the Afrikaans-speaking population in the Western Cape, given the small number of non-coloured individuals recruited. The difficulty in recruiting from the white population was also noted in previous waves of NIDS, and is not specific to the present study. However, given the differences in sociodemographic and economic characteristics found across the different Afrikaans-speaking groups in this study, the exclusion of non-coloured participants from the analysis meant that the remaining Afrikaans sample was more culturally homogeneous, and stronger interpretations of the findings could be made for this specific population.
The different cut-offs identified across the three samples suggest that generalising the present findings to other linguistic groups within South Africa may not be possible. Great care was taken in the translation and back-translation of the tool, so it is unlikely that any differences found between the three samples were due to translation errors. Instead, these are likely to reflect differences in the perception and experience of depression, and in the idioms of distress used among different South African populations [38].
Second, the prevalence of depression measured by the MINI 6.0 was relatively low in the Zulu and Xhosa samples, which weakens the inferences that can be made based on the results. Nonetheless, the results corroborate previous evidence on the internal structure of the scale, suggesting that psychometric properties of the CES-D-10 in these samples are reasonably robust.
Finally, despite the sampling methodology, gender was not proportionally distributed, and samples consisted predominantly of women. Results suggest women reported higher CES-D-10 scores and were more likely to be depressed compared to men in the Afrikaans and Xhosa samples. This finding corroborates results from the SASH study suggesting that women had a higher lifetime prevalence of depression or other mood disorders in South Africa [4]. This likely inflated the overall sample and estimated population prevalence of depression reported in this study, and may have affected the performance of the CES-D-10 in relation to the MINI. However, the gender distribution were relatively consistent across the three samples, so this cannot explain the disproportion in the prevalence of major depression reported among the Zulu, Xhosa and Afrikaans samples.

Conclusion
The findings suggest that the CES-D-10 has good psychometric properties in Zulu, coloured Afrikaans and Xhosa-speaking populations, similar to those of the PHQ-9. The CES-D-10 is therefore an adequate tool to