Skip to main content

Factor structure and measurement invariance of the Chinese version of the Center for Epidemiological Studies Depression (CES-D) scale among undergraduates and clinical patients



The Center for Epidemiologic Studies Depression scale (CESD) was widely used for screening of depressive symptoms. The purpose of the current study was to investigate the factor structure and measurement invariance of the CESD across genders and groups in a sample of Chinese undergraduates and clinical patients.


Participants included 3093 undergraduates from the Hunan province and 336 patients from psychological clinics. The structure of the CESD scale was analyzed by confirmatory factor analysis (CFA). Multiple sets of CFAs were used to test measurement invariance across genders among undergraduates and clinical patients. Internal consistency reliability was also evaluated.


The five-factor model achieved satisfactory fit (in the undergraduate sample: WLSMVχ2 = 1662.385, df = 160, CFI = 0.973, TLI = 0.968, RMSEA = 0.055; in the clinical patients: WLSMVχ2 = 502.089, df = 160, CFI = 0.962, TLI = 0.955, RMSEA = 0.072). The measurement invariance of the five-factor model across genders was supported fully assuming different degrees of invariance. The CESD also showed acceptable internal consistency.


Due to its sound structure and measurement invariance, the five-factor model of the CESD is best suited for testing in Chinese mainland college students and clinical patients.

Peer Review reports


Depression is a common mental illness, and can lead to functional impairment, disability, and even suicide [1, 2]. According to the World Health Organization (WHO), depression has become the fourth most common mental illness in the world, and it has been named the “disease of the century” [3]. Undergraduates are at a high-risk of depression due to pressures of employment, interpersonal communication, social pressures, lack of emotional feedback, and homework [4, 5]. A meta-analysis of studies published between January 1990 and October 2010 on depression among undergraduates and medical students reported an average depression prevalence rate of 30.6% worldwide [6]. Suicide and self-mutilation caused by depression are common in undergraduates and may be increasing [7].

The Center for Epidemiologic Studies Depression scale (CES-D), developed in 1977 by Radloff [8], is one of the most widely used self-report scale to assess depressive symptoms. The scale items cover the major components of depressive symptoms and was designed to measure the current level of depression [9]. Thus, the CESD has been widely used in research on children and adolescents and elderly populations, the physically ill and the mentally ill populations [9,10,11,12,13]. The CESD has shown good reliability (Cronbach α = 0.70–0.95, rtest-retest = 0.71–0.85) and good validity in different countries [13,14,15]. The Chinese version of the CESD has been reported to be useful for assessing depression among large adolescents and adults [9].

The original version of the CES-D consisted of 20 items, categorized into 4 factors: depressed affect (DA; seven items); somatic complaints (SC; seven items); interpersonal problems (IP; two items); and positive affect (PA; four items) [8]. However, others have proposed two [16], three [9, 17] and five [15, 18] factor models. In 2006, Shafer conducted four separate meta-analyses based on factor analysis studies of the CES-D including a total of 22,000 participants and found that the original four-factor structure was the most suitable [19]. However, another meta-analysis by Kim et al. (2011) found that the four-factor structure of the CES-D was not appropriate among Asian participants. Besides, the four-factor model has been shown to be the best fitting model across various Chinese factor analytic studies [10, 11]. However, Wang et al. found that three-factor model (depressed affect, somatic complaints and positive affect) was the best fitting model among Chinese adolescents [9]. A confirmatory factor analyses indicated that another three-factor structure (positive affect, interpersonal problems, depressive mood and somatic symptoms combined) had good fit in rural Chinese [20]. Thus, the best factor structure of the CES-D among Chinese participants has not yet been determined. It is important to confirmed the best fitted model of the CESD among different samples of China.

Based on the best fitted model, another essential issue that requires further study is whether the CESD has the same structure in different groups and whether its items have the same meaning for across different groups. Previous studies have found differences in CESD scores between male and female college students [12]. A longitudinal study found that a higher percentage of male students endured different degrees of depression compared to female students [21]. In these comparative studies, it is presumed that the measurement of the construct is comparable between male and female. However, since the meaning of items may differ for males and females, it is necessary to establish the measurement invariance of the CESD between different the twpo. Measurement invariance is defined as “a given factorial defined construct has the same measurement parameters across two or more samples (i.e. the loading, intercepts and residual matrix are equal among different groups)” [22, 23]. Without evidence of measurement invariance, it cannot concluded that group difference in depression reflected true differences between groups, as the difference may be due to the item bias of the scale [23]. A previous study has demonstrated that the measurement invariance of the CESD was acceptable across gender among non-clinical sample [9], but the result was not generalized to clinical populations.

Thus, the aims of the present study were to test the factor structure and internal consistency reliability of the CESD in undergraduates and clinical patients and to explore measurement invariance of the CESD across genders among the two samples.



The undergraduate participants came from the Central South University in Changshang. We recruited participants by posters and advertisements. Students who had a history of a mental disorder, a neurological disorder and intellectual disability were excluded. A total of 3158 university students were surveyed, 10 of which were excluded due to mental disorders and 55 of which were excluded due to missing data. The final sample included 3093 (57% males, 43% females), aged 18 to 22 years old [Mean = 19.5, Standard Seviation(SD) = 1.04].

The clinical samples including 353 outpatients who had been referred for the assessment and treatment in a psychological clinic of the Second Xiangya Hospital. The patients who cannot understand the questions well were excluded. A total of 336 patients finished the questions, including 139 (42%) males and 197 (58%) females, aged 16 to 33 years old (Mean = 24; SD = 5.7). The diagnoses of clinical sample were major depressive disorder(38.5%), schizophrenia(10%), obsessive-compulsive disorder(11.8%), a personality disorder(7.4%), an anxiety disorder(14.7%)and other mental disorders(16.9%)as a whole and the frequency distribution of the psychiatric disorders were 31.1%. 13.1, 13.1, 5.5, 16.4, 19.1 and 44.4%, 7.6, 10.7, 8.9, 13.3, 15.1% separately for males and females. There was significant difference between undergraduate participants and clinical samples on age (t = − 9.79, p < 0.01).

The data were collected by a trained psychology postgraduate researchers. All participants provided informed consent and the Ethics Committee of the Second Xiangya Hospital of Central South University approved the study. There were no significant demographic differences between participants who did not complete the CES-D and those that did in two groups.



The CES-D consists of 20 items, including 16 negative items (“I felt depressed”, “I Felt lonely”) and 4 positive items(“I was happy”, “I Enjoyed life”). The four positive affect items were inversely scored for calculating the total score. Items are structured on a 4-point from 0 (rarely; less than 1 day) to 3 (most or all of the time; 5–7 days). Higher scores on the CES-D indicate more depressive symptoms. The Chinese version of CES-D has been widely used in China and has been validated in previous Chinese studies [11, 20, 24].

Data analysis

Step 1: confirmatory factor analysis (CFA)

The CFAs were analyzed with Mplus 7.11 software to examine the best fit factor model of the CES-D. Given that items have only four response categories, the robust weighted least squares with mean and variance adjustment (WLSMV) estimator was used [23, 25, 26]. Several models fit indices were used to evaluate the goodness of fit: the Tucker-Lewis Index (TLI), the comparative fit index (CFI), and the root mean-square error of approximation (RMSEA) [9, 27]. According to the conventional guidelines, CFI, TLI ≥ .90 indicates acceptable model fit and ≥ .95 indicates adequate model fit, while RMSEA values ≤ .08 indicates acceptable model fit and ≤ .05 indicate good model fit [28, 29].

Six alternative models of the CES-D, which were good fitted in previous studies, were chosen for comparison. Model A was the original four-factor model proposed by Radloff [8]. In this model, items loaded on four factors: depressed, somatic, interpersonal, and positive. The four-factor model has been shown to be the best fitting model across various Chinese factor analytic studies. Model B is a two-factor model which included depressed affect and positive affect [12]. All negative terms are combined into the depressed affect, and the remaining positive terms form the positive affect. Recently, this model also has been verified good fitted in Chinese population. Model C was completed after Kuo’s study to analyze the factor structure of Chinese Americans and put forward a three-factor model (depressed affect, positive affect and interpersonal problems), and the results about Chinese American were superior to the three-actor model in Kuo’s study [13] Model D is another three-factor model proposed by Wang et al. which including depressed affect, positive affect and somatic complaints factors [9]. It is shown to be best fitting in Chinese adolescents. Model E and Model F are five-factor model proposed by Kim for Asian population after meta-analysis by Exploratory factor analysis (EFA) and CFA separately [15]. EFA is a data-driven approach while CFA is a model-driven approach [15]. Therefore, we include two five-factor models (model E and Model F) derived from different analytical method. Model E contains one additional factor (alienation, AI) compared to original four-factor structure. Besides, in Model F, one additional factor representing sorrow/ grief appeared that was distinct from the original depression factor. Both alienation and sorrow/grief were factors unique to the Asian population in the meta-analysis (See Table 1).

Table 1 Item Mapping for Tested Models

Step 2: internal consistency reliability

In the current study, Cronbach’s alphas (α), mean inter-item correlations (MIC) and McDonald’s Omega coefficient were used to evaluate internal consistency reliability. A Cronbach’s α coefficient above 0.70 (> 0.60 in some cases) was considered acceptable. An optimal range of 0.10–0.40 was set for the MIC.

Step 3: measurement invariance

After the most appropriate factor model was identified, Mplus 7.11 was used to analyze the model’s measurement invariance across genders. The multi-group CFA (MGCFA) was used to test the invariance with nested models. MGCFA method typically considers four different levels of measurement invariance: configural, weak (metric), strong (scalar) and strict. Configural invariance to test whether the latent variables are in the same constituents or patterns across groups (Model 1). Weak invariance based on the configural invariance results to test the relationship between the measurement index and the factor load, that is whether factor loads are equal to the groups (Model 2). Strong invariance based on metric invariance results to test whether the variable intercepts are equal between different group (Model 3). Strict invariance based on scalar invariance results to test whether the error variance are equal to different groups (Model 4) [22]. Given that tests of the change in CFI are reported as being superior to chi-square difference tests of nested models, because they are not affected by the sample size [29, 30], the current study compared nested models in consideration of CFI values. Thus, measurement invariance is considered established when two of following satisfied: the change of TLI < 0.01, the change of CFI < 0.01, the change of RMSEA < 0.015 [25].

Step 4: difference test

T-tests were used to explore differences between males and females and between clinical and non-clinical sample on the total CES-D score and each factor score. P-values < 0.05 was considered significant.


Descriptive statistics

In the undergraduate sample, CES-D rescores anged from 20 to 68 (Mean = 32.37 (SD) = 7.81). In the clinical sample, CES-D scores ranged from 21 to 80 (Mean = 54.30 (SD) = 12.32).

CFA of the CES-D scale based on the hypothesized model

As illustrated in Table 2, Model B、Model D and Model E fitted the data well (CFIs > 0.90, TLIs > 0.90, RMSEAs < 0.08) in the clinical sample. Model E (five-factor model) provided the best fit for the data (WLSMVχ2 = 502.089, df = 160, CFI = 0.962, TLI = 0.955, RMSEA = 0.072) in the clinical sample. As can be seen in Table 3, Model E also fit the data well in the undergraduate sample (WLSMVχ2 = 1662.38, df = 160, CFI = 0.973, TLI = 0.968, RMSEA = 0.055). For all items, the factor loadings were ≥ 0.40 and loaded significantly on the latent factors propose (p < 0.01; Table 4).

Table 2 Goodness-of-fit indices of the compared models in clinical patients
Table 3 Goodness-of-fit indices of the compared models in undergraduates
Table 4 Model E (Five-Factor Model) Factor Loading

Internal consistency reliability

In both samples, the Cronbach’s α values were > 0.8 for the whole scale and > 0.6 for each dimension (Table 5). All mean MICs were between 0.10 and 0.400 except PA subscale, IP subscale in undergraduates sample and DA subscale in clinical sample. The McDonald’s Omega coefficients were > 0.9 for the whole scale and > 0.6 for each dimension (Table 5).

Table 5 Cronbach’s α values, mean inter-item correlations and McDonald’s Omega of the CESD

Measurement invariance across genders among undergraduates and clinical patients

As the five-factor model (model E) fitted the data best in undergraduate and clinical samples, we choose the five-factor model to estimate the measurement invariance across gender.

In the undergraduates sample, the following goodness of fit indices were obtained from the configural invariance test: TLI = 0.934, CFI = 0.944, RMSEA (90% CI) = 0.043 (0.040, 0.045) (see Table 6). All indices met requirements of configural invariance. Thus, the configural invariance was established and the model was used as baseline model for the next analysis. To verify weather factor loads are equal across gender, the weak invariance was set based on the baseline model. All indices met requirements of weak invariance (see Table 6). In addition, the ∆CFI, ∆TLI, and RMSEA (0.000, 0.002, and − 0.001, respectively) were all less than 0.01. On the basis of previous steps, the strong invariance was set. All requirements for the goodness of fit indices for the strong invariance test were met (see Table 6). In addition, ∆CFI, ∆TLI, and RMSEA (− 0.006, − 0.003, and 0.001, respectively) were all less than 0.01. The strict invariance was set on the basis of the third step. All indices of the strict invariance test were less than 0.01 (∆CFI = 0.000, ∆TLI = 0.003, and RMSEA = − 0.001) and therefore, strict invariance was established in undergraduate sample (see Table 6).

Table 6 Measurement invariance of the CESD across gender

In the clinical sample, in the configural invariance test, various parameters were allowed to be freely estimated, and the following fitting indicators are obtained in clinical sample: TLI = 0.948, CFI = 0.956, RMSEA (90% CI) = 0.079 (0.071,0.086). Fitting index met the requirements of the survey and the baseline model was established. Based on the baseline model, the changes of CFI, TLI and RMSEA(CFI < 0.010, TLI < 0.010, RMSEA < 0.015) supported weak、strong and strict invariance (see Table 6). Thus, the measurement invariance of the CES-D across gender among clinical sample was established.

Difference test

In the clinical patients, females scored significantly higher than males on the score of AI (t = − 2.956, p < 0.01, Cohen’s d = 0.770) (Table 7). In the undergraduates, males scored significantly higher than males on the total score of the CES-D and scores of SC, IP (Total score: t = 2.033, p < 0.05, Cohen’s d = 0.074; SC score: t = 5.599, p < 0.01, Cohen’s d = 0.208; IP score: t = 4.092, p < 0.001, Cohen’s d = 0.150; PA score: t = 3.005, p < 0.05, Cohen’s d = 0.011;). Compared with the undergraduate group, the clinical group got significantly higher scores on total CES-D (t = − 32.274, p < 0.001, Cohen’s d = 2.127) and all five subscales (t: − 18.767 ~ − 31.676, all p < 0.001, Cohen’s d: 1.257 ~ 2.101) (Table 8).

Table 7 Comparison between Male and Female (Means±SD)
Table 8 Comparison between Undergraduates and clinical patients (Means±SD)


The current study aimed to explore the best factor structure and measurement invariance of the Chinese version of the CESD among undergraduates and clinical patients. The CFA was conducted, suggesting that five-factor model was best suited in the two samples. Moreover, gender invariance was well established among undergraduates and clinical patients. To our knowledge, this was the first study to explore the measurement invariance of Chinese version of CESD across gender in clinical patients. Besides, The CES-D also showed acceptable internal consistency in the two samples.

The two-factor, three-factor, four-factor and five-factor models of the CES-D proposed in previous studies were all tested by CFA in the present study. In the original psychometric testing of the CES-D scale, Radloff proposed a four-factor structure comprising DA (depressed affect), PA (positive affect), SC (somatic/vegetative complaints), and IP (interpersonal problems) [8]. The current results found that the five-factor (Model E: DA, PA, IP, SC, and AI) showed the best fit. This five-factor model differs from the original four-factor model by changing the previous factor structure and proposing a new factor — alienation (items 10, 14, and 17). Alienation is a condition in social relationships reflected by a low degree of integration or common values and a high degree of distance or isolation between individuals, or between an individual and a group of people in a community or work environment. This particularly factors may impairments in interpersonal relationships. Previous study found that higher scores for thinking that others were out to harm or exploit them (alienation), the more likely participants were to experience a co-occurring mood disorder. School maladjustment in relations with teacher and peers and in learning activities had indirect effects through alienation and depression on students’ suicidal ideation [31].

Prior studies have shown that the original four-factor structure of the CES-D was not suitable for Asian population [15]. In addition, a recent study suggested that ethnic and cultural factors can lead to different CES-D factor structures [32]. The understanding of words or cultural differences may play an important role in different model structures. In addition, the population tested is may also play an important role. The current experiment included college student participants. The results may reflect the unique psychological characteristics, high level of education, and sensitivity of college students. This population is more likely to experience feelings of loneliness and alienation [32, 33].

The reliability and validity of the CES-D scale for was previously studied. It was proposed that the three-factor structure was the most suitable model. However, the five-factor model was not included in the study. In the current study, which included both the three- and five-factor model, the CFA on the five-factor structure showed the best fit. Accordingly, we concluded that the five-factor CES-D scale is an effective and reliable screening tool for depression.

Based on the five-factor model, we examined the gender invariance among among undergraduates and clinical patients. Our MGCFA confirmed good configural, weak, strong and strict invariance of the Chinese RRS-10 across gender in undergraduates sample. Configural equivalence is the precondition to test other equivalence. As the baseline model, the further equivalence test is the nested model produced by restricting the corresponding parameters on the basis of configural equivalence, only if the equivalence of the previous level is established, can the equivalence test of the next higher level be continued. In this study, the configural invariance of CESD was supported, so it can be used for the next step of equivalence test. Besides, the establishment of weak equivalence model shows that the CESD observation index and latent trait have the same meaning between men and women, that is to say, each item has the same unit between men and women. Moreover, the establishment of strong equivalence shows that the intercept of CESD is invariable between men and women, which means all CESD items have the same reference point in the two groups. Finally, strict equivalence is carried out on the basis of strong equivalence, and its establishment indicates that the measurement error variance is equivalent in different gender. Therefore, measurement invariance between males and females among undergraduates patients were achieved. In clinical samples, configural, weak, strong and strict invariance were also supported. Thus, the results of this study confirm that the Chinese CESD has strict equivalence, indicating that the scale is effective and interpretable between gender groups among undergraduates and clinical patients.

Since the CES-D has achieved measurement invariance across gender among undergraduates and clinical patients, this study further compared gender differences in CES-D and its subscale scores. The current study found that females scored significantly higher than males on the AI subscale in clinical patients. Besides, the current study also found that males scored significantly higher than females on the SC and IP subscale among college students. According to previous research, there are some sex differences in interpersonal problems [34]. For instance, boys are not good at talking, so girls are better than boys in speech skills [35]. About somatic complaints, Hyde found that the differences are demonstrated between boys and girls, boys are more physically and verbally aggressive than girls [36]. And they have higher domineering, controlling, independent behaviors and are more vindictive [35, 37]. In 2000, a research from American Psychiatric Association, illustrated boys exhibit higher rates of antisocial, narcissistic, obsessive compulsive, paranoid, and aggression-related disorders than girls. In sum, according to these reasons, boys will have higher scores on SC and IP.

College males experience more pressure than college females, in life pressure, personal ambition, studying, love, job hunting, earning money and interpersonal relationships [32, 35]. In addition, male college students consume more alcohol than female college students, leading to greater depression [38]. Moreover, studies have shown that in traditional Chinese culture, men are the economic backbone of the family and thus experience greater economic pressure [7].

While the current study provides valuable data on the factor structure and measurement invariance of the CESD, it is not without limitations. First, all participants were from the Changsha University and as such, the results may not fully reflect depression in college students in China. Second, a cross-sectional design was employed with no long-term follow-up. Third, Han students only were included in the sample, and therefore the results may not apply to minority students. Lastly, we only considered measurement invariance across genders, and thus, measurement invariance across other factors such as ages and religions, remains unknown.


The CESD has good psychometric characteristics and measurement invariance across genders among clinical patients. The present study that the CESD may provide reliable and valid self-reported assessments of depression among Chinese undergraduates and clinical patients.

Availability of data and materials

The datasets generated and analyzed during the current study are not publicly available due to no permission from participants to share anonymized participant data publicly but are available from the corresponding author on reasonable request.


  1. Heo M, Murphy CF, Fontaine KR, Bruce ML, Alexopoulos GS. Population projection of US adults with lifetime experience of depressive disorder by age and sex from year 2005 to 2050. Int J Geriatr Psychiatry. 2008;23(12):1266–70.

    Article  PubMed  PubMed Central  Google Scholar 

  2. Vos T, Flaxman A, Naghavi M. Years lived with disability (YLDs) for 1160 sequelae of 289 diseases and injuries 1990–2010: a systematic analysis for the global burden of disease study 2010. Lancet. 2012;380(9859):2163–96.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Moussavi S, Chatterji S, Verdes E, Tandon A, Patel V, Ustun B. Depression, chronic diseases, and decrements in health: results from the world health surveys. Lancet. 2007;370(9590):851–8.

    Article  PubMed  Google Scholar 

  4. Auerbach RP, Mortier P, Bruffaerts R, Alonso J, Benjet C, Cuipers P, et al. WHO world mental health surveys international college student project: prevalence and distribution of mental disorders. J Abnorm Psychol. 2018;127(7):623–38.

    Article  PubMed  PubMed Central  Google Scholar 

  5. Liu Y, Zhang N, Bao GY, Huang YB, Ji BY, Wu YL, et al. Predictors of depressive symptoms in college students: a systematic review and meta-analysis of cohort studies. J Affect Disord. 2019;244:196–208.

    Article  CAS  PubMed  Google Scholar 

  6. Ibrahim AK, Kelly SJ, Adams CE, Glazebrook C. A systematic review of studies of depression prevalence in university students. J Psychiatr Res. 2013;47(3):391–400.

    Article  PubMed  Google Scholar 

  7. Zhao SB, Zhang J. The association between depression, suicidal ideation and psychological strains in college students: a cross-National Study. Cult Med Psychiatry. 2018;42(4):914–28.

    Article  PubMed  Google Scholar 

  8. Radloff LS. The CES-D scale: a self-report depression scale for research in the general population. Appl Psychol Meas. 1977;3(1):385–401.

  9. Wang MC, Armour C, Wu Y, Ren F, Zhu XZ, Yao SQ. Factor structure of the CES-D and measurement invariance across gender in mainland Chinese adolescents. J Clin Psychol. 2013;69(9):966–79.

    Article  PubMed  Google Scholar 

  10. Li HCW, Chung OKJ, Ho KY. Center for Epidemiologic Studies Depression Scale for children: psychometric testing of the Chinese version. J Adv Nurs. 2010;66:2583–91.

    Google Scholar 

  11. Lee SW, Stewart SM, Byrne BM, Wong JPS, Ho SY, Lam TH. Factor structure of the Center for Epidemiological Studies Depression Scale in Hong Kong adolescents. J Pers Assess. 2008;90(2):175–84.

    Article  PubMed  Google Scholar 

  12. Verhoeven M, Sawyer. and Spence, S. H. The factorial invariance of the CES-D during adolescence: are symptom profiles for depression stable across gender and time? J Adolesc. 2013;36(1):181–90.

    Article  PubMed  Google Scholar 

  13. Arbona C, Burridge A, Olvera N. The Center for Epidemiological Studies Depression Scale (CES-D): measurement equivalence across gender groups in Hispanic college students. J Affect Disord. 2017;219:112–8.

    Article  PubMed  Google Scholar 

  14. Chin WY, Choi EPH, Chan KTY, Wong CKH. The psychometric properties of the Center for Epidemiologic Studies Depression Scale in Chinese primary care patients: factor structure, construct validity, reliability, sensitivity and responsiveness. PLoS One. 2015;10(8):e0135131.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. Kim G, DeCoster J, Huang CH, Chiriboga DA. Race/ethnicity and the factor structure of the Center for Epidemiologic Studies Depression Scale: a meta-analysis. Cultur Divers Ethnic Minor Psychol. 2011;17(4):381–96.

    Article  Google Scholar 

  16. Rivera-Medina CL, Caraballo JN, Rodriguez-Cordero ER, Bernal G, Davila-Marrero E. Factor structure of the CES-D and measurement invariance across gender for low-income Puerto Ricans in a probability sample. J Consult Clin Psychol. 2010;78(3):398–408.

    Article  PubMed  Google Scholar 

  17. Ying YW. Depressive symptomatology among Chinese-Americans as measured by the CES-D. J Clin Psychol. 1988;44(5):739–46.

  18. Kim JH, Park EY. The factor structure of the center for epidemiologic studies depression scale in stroke patients. Top Stroke Rehabil. 2012;19(1):54–62.

    Article  PubMed  Google Scholar 

  19. Shafer AB. Meta-analysis of the factor structures of four depression questionnaires: Beck, CES-D, Hamilton, and Zung. J Clin Psychol. 2006;62(1):123–46.

    Article  PubMed  Google Scholar 

  20. Zhang J, Sun W, Kong Y, Wang C. Reality and validity of the CES-D scale in two special adult samples from rural China. Compr Psychiatry. 2012;53(8):1243–51.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Gao WJ, Ping SQ, Liu XQ. Gender differences in depression, anxiety, and stress among college students: a longitudinal study from China. J Affect Disord. 2019;263:292–300.

    Article  Google Scholar 

  22. Little TD. Mean and covariance structures (MACS) analyses of cross-cultural data: practical and theoretical issues. Multivar Behav Res. 1997;32(1):53–76.

    Article  CAS  Google Scholar 

  23. He J, Zhong X, Yao S. Factor structure of the geriatric depression scale and measurement invariance across gender among Chinese elders. J Affect Disord. 2018;238:136–41.

    Article  PubMed  Google Scholar 

  24. Jiang LJ, Wang Y, Zhang YN, Li R, Wu HL, Li CY, et al. The reliability and validity of the Center for Epidemiologic Studies Depression Scale (CES-D) for Chinese University students. Front Psychiatry. 2019;10:315.

    Article  PubMed  PubMed Central  Google Scholar 

  25. Flora DB, Curran PJ. An empirical evaluation of alternative methods of estimation for confirmatory factor analysis with ordinal data. Psychol Methods. 2004;9(4):466–91.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Wu W, Lu Y, Tan F, Yao S, Steca P, Abela JRZ, et al. Assessing measurement invariance of the children's depression inventory in Chinese and Italian primary school student samples. Assessment. 2012;19(4):506–16.

    Article  PubMed  Google Scholar 

  27. Kline RB. Principles and practice of structural equation modeling. 3rd ed. New York: Guilford Press; 2010.

    Google Scholar 

  28. Hu L, Bentler PM. Evaluating model fifit. In: Hoyle RH, editor. Structural equation modeling: concepts, issues, and applications. Newbury Park: Sage; 1993. p. 16–99.

    Google Scholar 

  29. Cheung GW, Rensvold RB. Evaluating goodness-of-fifit indexes for testing measurement invariance. Struct Equ Model. 2002;9(2):233–55.

    Article  Google Scholar 

  30. Meade, A.W, Johnson, E.C, Braddy, P.W. (2008). Power and sensitivity of alternative fit indices in tests of measurement invariance. J Appl Psychol 93, 568–592, 3, doi:

  31. Lee YJ, Chung M. Effects of Adolescent’s Alienation,Depression, Family Environment and School Maladjustment on Suicidal Ideation. Fam Environ Res. 2010;48(8):27–37.

    Google Scholar 

  32. Skriner LC, Rutgers BCC. Supplemental material for cross-ethnic measurement invariance of the SCARED and CES–D in a youth sample. Psychol Assess. 2014;26(1):332–7.

    Article  PubMed  Google Scholar 

  33. Moeller RW, Seehuus M. Loneliness as a mediator for college students' social skills and experiences of depression and anxiety. J Adolesc. 2019;73:1–13.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Gurtman MB, Lee DL. Sex differences in interpersonal problems: a circumplex analysis. Psychol Assess. 2009;21(4):515–27.

    Article  PubMed  Google Scholar 

  35. Feingold A. Gender differences in personality: a Meta-analysis. Psychol Bull. 1994;116:329–456.

    Article  Google Scholar 

  36. Hyde JS. New directions in the study of gender similarities and differences. Curr Dir Psychol Sci. 2016;16(5):259–26.

    Article  Google Scholar 

  37. Lippa R. Gender-related individual differences and psychological adjustment in terms of the big five and Circumplex model. J Pers Soc Psychol. 1995;69(6):1184–202.

    Article  Google Scholar 

  38. Dvorak RD, Lamis DA, Malone PS. Alcohol use, depressive symptoms, and impulsivity as risk factors for suicide proneness among college students. J Affect Disord. 2013;149(1–3):326–34.

    Article  PubMed  PubMed Central  Google Scholar 

Download references


We would like to thank all leaders, teachers and working staff. Without their help, it would be much harder for us to finish this paper.


This study was supported by National Natural Science Foundation of China (Grant No. 81471384) in data collection and analysis. The Fundamental Research Funds for the Central Universities of Central South University (grant No. 2020zzts284) played a vital role in interpretation of data and in writing the manuscript.

Author information

Authors and Affiliations



SY and XW supervised the study. LN and JH performed the analysis and wrote paper. CC contributed to the analysis. JY provided substantial modification to the manuscript. All co-authors revised and approved the version to be published.

Corresponding authors

Correspondence to Xiang Wang or Shuqiao Yao.

Ethics declarations

Ethics approval and consent to participate

This study was approved by the ethics committee of Second Xiangya Hospital, Central South University. Informed consent was obtained from all subjects or, if subjects are under 18, from a parent and/or legal guardian. All methods were carried out in accordance with relevant guidelines and regulations.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Niu, L., He, J., Cheng, C. et al. Factor structure and measurement invariance of the Chinese version of the Center for Epidemiological Studies Depression (CES-D) scale among undergraduates and clinical patients. BMC Psychiatry 21, 463 (2021).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: