Skip to main content

Using health administrative data to model associations and predict hospital admissions and length of stay for people with eating disorders



Eating disorders are serious mental illnesses requiring a whole of health approach. Routinely collected health administrative data has clinical utility in describing associations and predicting health outcome measures. This study aims to develop models to assess the clinical utility of health administrative data in adult eating disorder emergency presentations and length of stay.


Retrospective cohort study on health administrative data in adults with eating disorders from 2014 to 2020 in Sydney Local Health District. Emergency and admitted patient data were collected with all clinically important variables available. Multivariable regression models were analysed to explore associations and to predict admissions and length of stay.


Emergency department modelling describes some clinically important associations such as decreased odds of admission for patients with Bulimia Nervosa compared to Anorexia Nervosa (Odds Ratio [OR] 0.31, 95% Confidence Interval [95%CI] 0.10 to 0.95; p = 0.04). Admitted data included more predictors and therefore further significant associations including an average of 0.96 days increase in length of stay for each additional count of diagnosis/comorbidities (95% Confidence Interval [95% CI] 0.37 to 1.55; p = 0.001) with a valid prediction model (R2 = 0.56).


Health administrative data has clinical utility in adult eating disorders with valid exploratory and predictive models describing associations and predicting admissions and length of stay. Utilising health administrative data this way is an efficient process for assessing impacts of multiple factors on patient care and predicting health care outcomes.

Peer Review reports


Eating disorders (ED) are serious mental illnesses with a complex range of medical complications and mental health comorbidities [1]. Adults with ED can present to emergency departments with acute medical instability and/or acute behavioural and psychosocial deterioration. Liaison and interface between medical and mental health care, to treat aspects of the illness and prevent mortality, is essential [2]. All EDs have an elevated mortality risk, highest with AN [3]. It is estimated that over a million Australians are living with ED, with reported incidence ranging from 4% [4] to 16.3% [5]. Furthermore, ED are frequently chronic or relapsing [6], with an average duration of illness of 15 years [4]. Therefore ED “burden of disease” and health system expenditure are high [4], exacerbated by an increasing trend of people with ED accessing the health system [7]. Furthermore, ED are complex with significant clinical heterogeneity and may be a multidimensional disorder which includes several subtypes with different neurobiological underpinnings, and often associated with several other psychiatric disorders as well as suicide risk [8, 9].

There is limited evidence for highly efficacious interventions and optimal treatment options and pathways of care [10, 11]. Health care outcomes such as length of stay and mode of separation, which is the way in which a person leaves the hospital or service setting, are widely reported and used to guide services and clinicians on clinical decision, opportunities for improving clinical care provision and resource allocation. Whilst there is a growing body of literature in ED, there remains a lack of evidence and studies on defining clear and consistent measures for outcome, severity, remission and recovery [12].

Hospital data from New South Wales (NSW) in Australia shows a progressive, greater than 2-fold increase in emergency department triage and hospital admissions for people with ED since 2011 [13]. Furthermore, the impacts from the COVID-19 pandemic are apparent in this data with a disproportionate increase in admissions to hospital in particular for under 16-year-olds since the beginning of the pandemic [13]. Recent studies confirm this ongoing trend, on the impact of COVID-19 and associated public health response on people with ED and other mental health conditions [14, 15]. There is a clear need for effective and efficient use of the current services and resources available to provide integrated and optimal care and support for people with ED, their families and carers.

In 2013, the NSW Service Plan for People with Eating Disorders was established, providing a framework for multidisciplinary health professionals to support the delivery of treatment and care for people with or at risk of developing an eating disorder. This guides the Sydney Local Health District (SLHD) Eating Disorder Service Plan (2019–2024) [16] aimed at developing and integrating mental health and medical services across the district. SLHD has an established local and tertiary eating disorder service, providing statewide outpatient and inpatient treatment. This follows a case formulation driven framework, determining the level of support required for the person at the point of care. People with ED in SLHD who present with medical instability are assessed and triaged via the emergency department and admitted to a medical ward. Following a medical admission, a patient may be admitted to the Peter Beumont Unit (Specialist ED Ward) located at Royal Prince Alfred Hospital (RPA) or attend outpatient eating disorder services or other local services determined by a multidisciplinary team. Adults with ED may also be admitted to general Mental Health wards in SLHD.

In NSW Health, there is routinely collected health administrative data, which could be utilised to examine health care outcomes, namely, the NSW Emergency Department Data Collection (EDDC) recording all emergency department presentations and NSW Admitted Patient Data Collection (APDC) recording inpatient separations (i.e. referrals post-discharge). There are limitations with health administrative data in relation to varied accuracy, validity, and coding. However, the main advantages of utilising administrative data are efficiency, representative of all admitted patients, readily available and routinely collected with large quantities and breadth of data with numerous variables collected.

While it is well established that health administrative data can be useful in predicting and identifying associated factors on health system outcomes and care needs [17,18,19,20], this has yet to be explored in ED. The overall aim of the study is to assess the clinical utility of routinely collected health administrative data. To do this, initial exploratory multivariable regression models will be analysed to identify and describe clinically plausible factors associated with admissions in emergency department and inpatient length of stay for admitted patients to hospital. Furthermore, prediction models will also be developed and validated on these health care outcome measures of patients with ED.


This is a retrospective cohort study on routinely collected health administrative data in people ≥ 16 years of age with ED. The search criteria included all consecutively selected ED primary or other diagnosis in emergency or admissions to the hospitals in SLHD from 2014 to 2020. ED diagnosis is in accordance with ICD-10 codes for admitted data or SNOMED codes with confirmed ED diagnosis in emergency department data. SNOMED codes specifies Anorexia Nervosa and Bulimia Nervosa but describes other eating disorders diagnoses as generalized “Eating Disorder” and is descriptive in nature. The years 2014 to 2020 was chosen to capture the time from stage 1 NSW Service Plan for People with Eating Disorders. The emergency department data had additional data from 2011 so this is also included. As this data was extracted in August 2021, it allowed for inclusion of all patients admitted in 2020 including people discharged in 2021.

A data variables list was developed from the data dictionaries, NSW Emergency Department Data Collection (EDDC) and NSW Admitted Patient Data Collection (APDC) datasets that are available on the CHeReL website [21]. CHeReL is managed by the NSW Ministry of Health and its function is to carry out linkage of health-related data and provide a mechanism of access to these linked data. This information and search criteria were provided to SLHD data custodians for data extraction from SLHD electronic medical records (Cerner). A Statutory Query Language (SQL) search was used to extract SLHD emergency department data (EDD) and SLHD admitted patient data (APD). The investigators then conducted data and statistical quality assurance process’ including data cleaning and manipulation prior to analysis.

This study was approved by the Human Ethics Review Committee (Royal Prince Alfred Hospital Zone) of SLHD, Australia, protocol number X21-0059.

Outcome Measures

Mode of separation is the way in which a person leaves the hospital or service setting, and this is utilised as the main outcome measure for a patient admitted or not admitted from the EDD. For the APD, the outcome measure is defined as an episode of care with length of stay to be the total length of stay from when a person is admitted to when they were discharged. It is determined that this would be the most appropriate and clinically relevant definition for this study to explore factors that impact on the total duration of a patient’s whole episode of care.

Explanatory/Predictor Variables

All variables available from routinely collected health data were considered for inclusion in the analysis. Initial data collection from the EDD and APD were reduced to clinically significant variables based on consideration of descriptive statistics and clinical relevance/importance. The APD included diagnostic variables so initially, broad diagnostic groupings or index scores such as the Charlson Comorbidities Index and Elixahauser Index were considered, however from careful exploration of the weighting and the evidence [22, 23] in the clinical context of ED, it was decided that broad weighted groupings and indexing would not be clinically appropriate or meaningful. Therefore, diagnoses were grouped into diagnostic categories. Each diagnostic category was revised multiple times taking into account descriptive statistics, ICD-10-AM categorisations and clinical context/meaning. The final diagnostic groupings consisted of 10 mental health comorbidities and 9 physical health comorbidities. (Supplementary 1)

Statistical Analysis

Data was analysed using STATA Version 15 (StataCorp, College Station, TX, USA). Alpha of 0.05 was considered statistically significant. A full model for exploratory purposes as well as a main effects model for prediction were developed for both the EDD with the binary outcome of admission or discharge and for the APD with length of stay as outcome. A multivariable logistic regression model was chosen for modelling the odds of an admission from emergency department from the EDD and a multivariable linear regression model was chosen for modelling length of stay from the APD. Due to skewness of the APD data, further sensitivity analyses were conducted using a multivariable negative binomial regression model.

Model Building Strategy

For both multivariable initial exploratory models, variable selection was based on clinical context, appropriateness and importance of categorisation and transformation of covariates in the context of statistical analyses. For both final prediction models, the model building strategy followed the “Purposeful Selection of Covariates” as outlined by Hosmer et al. [24] The general principle of the model building strategy involved careful exploration and selection of each of the covariates based on clinical relevance and importance followed by statistical analyses in a cyclical iterative process. All potential interactions were considered clinically and then statistical analyses of potential interactions were explored. Investigators representing different disciplines with expertise in their area reviewed each step of this process, until a consensus was reached.

Validation of the final main effects models for prediction included tests on residuals, heteroscedasticity, goodness of fit tests, variation inflation factors testing for multicollinearity and exploration of outliers. A regular bootstrap procedure was followed for internal validation of the predictions in both prediction models. A regular bootstrap including 100% of the sample, with 1000 repetitions was used for internal validation [25, 26].


Patient Characteristics (Modelling Admissions in Emergency)

There were 228 presentations from the EDD identified as diagnosed with ED between 2011 and 2020. The mean age is 30.0 years (SD = 14.7, range 16 to 87). Referrals were from self, family or friends (70.2%) or ‘other’ health, aged care and community services (29.8%). The most presentations to emergency department occurred in 2019, with almost half of presentations identified with a diagnosis of Anorexia Nervosa. Most patients were female (92.1%) and presented to RPA (84.6%) compared to other hospitals in SLHD. Patient characteristics are available in Table 1.

Table 1 Characteristics of Patients for modelling admissions in emergency (n = 228)

Initial Exploratory Model (Modelling Admissions in Emergency)

The multivariable model (Table 2) shows weak evidence of a difference in odds of admission across year (p = 0.089). Odds of admission in 2020 ([OR] 0.18, 95% Confidence Interval [95%CI] 0.05 to 0.64; p = 0.008) and 2012 ([OR] 0.26, 95% Confidence Interval [95% CI] 0.08 to 0.85; p = 0.026) are lower compared to 2018. This year was chosen as the reference year as data collection and policy changes were implemented in 2016, improving consistency in the years 2017 to 2019, so the median from these years was chosen. This is consistent with descriptive statistics (Table 1) and univariate analyses (Supplementary 2). There is also weak evidence of a difference in odds of admission for varying diagnosis, and the odds of admission for patients diagnosed with Bulimia Nervosa is lower ([OR] 0.31, 95% Confidence Interval [95% CI] 0.10 to 0.95; p = 0.040) compared to Anorexia Nervosa. For every unit increase in age there is increase in odds of admission ([OR] 1.04, 95% Confidence Interval [95% CI] 1.01 to 1.08; p = 0.015). Across triage categories (increasing urgency) there is decrease in odds of admission ([OR] 0.37, 95% Confidence Interval [95% CI] 0.21 to 0.66; p = 0.001) for each level of category declining in urgency. There is also difference in odds of an admission dependent on source of referral (self/family or other) and if arriving by ambulance as opposed to self-presenting to Emergency.

Table 2 Initial exploratory model (multivariable logistic regression) with outcome as odds ratio of admissions

For testing goodness of fit, the Hosmer Lemeshow Test had a p-value of 0.784, indicating the model fits well. When compared to the minimal model there is a significant difference to the multivariable model with p < 0.001, which demonstrates a better fit. These tests are also supported by the area under the receiver operating characteristic curve (AUC) of 0.788 which indicates acceptable discrimination. The pseudo R2 is 0.201 which suggests approximately 20.1% proportional improvement compared to the minimal model.

Final Prediction Model (Modelling Admissions in Emergency)

A final prediction model for predicting ED admissions in emergency department was developed with results of the model in Table 3. This has a pseudo R2 of 0.135 which suggests approximately 13.5% proportional improvement compared to the minimal model. For goodness of fit, the Hosmer Lemeshow Test has a p-value of 0.249, which provides evidence the model fits well. Furthermore, this final prediction model is internally validated by bootstrapping with 1000 repetitions. (Supplementary 3) Also, compared to the minimal model there is a significant difference to the multivariable model with p < 0.001. These tests are also supported by the AUC of 0.724 which indicates acceptable discrimination. Although not statistically significant at the 5% level of significance as a predictor (p = 0.09), ED diagnoses are retained as a variable as it is considered a clinically important variable.

Table 3 Final prediction model (multivariable logistic regression)

Patient Characteristics (Modelling Length of Stay)

There were 613 patients from the APD identified as diagnosed with ED between 2014 and 2020. The mean age is 30.7 years (SD = 13.4, range 16 to 86). Peak number of admissions were in 2019 with over two thirds of presentations identified with a diagnosis of AN. Most patients were recorded as female (93.3%) and presented to RPA (81.2%). A number of diagnostic categories were identified from grouping ICD-10-AM codes and whilst some had low prevalence, all clinically important and relevant diagnostic categories were retained for analysis. The complete characteristics are in Table 4.

Table 4 Characteristics of Patients for modelling length of stay (n = 613)

Initial Exploratory Model (Modelling Length of Stay)

There is strong evidence of a difference in average length of stay across admission years in the initial exploratory model (p = 0.003). Compared to 2018, there is evidence of an average increase of 14.29 days length of stay in 2014 (95% Confidence Interval [95% CI] 2.78 to 25.80; p = 0.015). Also, the average length of stay increased by 0.96 (95% Confidence Interval [95% CI] 0.37 to 1.55; p = 0.001) for every unit increase in diagnosis count. Increased length of stay is also associated with admissions that included a mental health ward or Specialist ED Ward. There is also evidence of a difference in length of stay dependent on the ED diagnoses (p = 0.023) and mode of separation (p < 0.001).

There is an increase in average length of stay if a patient has delirium/dementias 21.83 days (95% Confidence Interval [95% CI] 5.81 to 37.86; p = 0.008), effects of chronic malnutrition 13.77 days (95% Confidence Interval [95% CI] 8.03 to 19.50; p < 0.001), refeeding syndrome markers (hypophosphatemia/oedema) 10.93 days (95% Confidence Interval [95% CI] 3.90 to 17.96; p = 0.002), hypoglycemia 6.77 days (95% Confidence Interval [95% CI] 1.00 to 12.53; p = 0.021), and anxiety disorders 5.68 days (95% Confidence Interval [95% CI] 0.10 to 11.26; p = 0.046). In contrast there is decreased average length of stay if a patient has adjustment disorders − 11.6 days (95% Confidence Interval [95% CI] -20.06 to -3.13; p = 0.007), substance disorders − 6.66 days (95% Confidence Interval [95% CI] -11.45 to -1.88; p = 0.006), and personality disorders − 6.02 (95% Confidence Interval [95% CI] -11.45 to -0.58; p = 0.030).

The R2 for this initial exploratory model (Table 5) is 0.587, which indicates that 58.7% of variability in length of stay is explained by the variables in this model. It is observed from the validation tests that there is heteroscedasticity and some minor departure from normality of errors. However, with a large sample (n > 500), it has been shown that even the most skewed data can be analysed using least squares regression [27]. Sensitivity analyses were also performed comparing this model to negative binomial models. (Supplementary 5) Despite some differences in output, the majority of the output were similar, further supporting the linear model.

Table 5 Initial exploratory model (multivariable linear regression) with outcome as length of stay

Final Prediction Model (Modelling Length of Stay)

A final prediction model for length of stay for patients admitted with ED was developed with results of the model in Table 6. The R2 for this final prediction model is 0.562, which indicates that 56.2% of variability in length of stay is explained by inclusion of all these variables in this final prediction model. This final prediction model is internally validated by bootstrapping. (Supplementary 6) Furthermore, the same validation process and sensitivity analyses were conducted as the initial exploratory model with similar results providing support for a linear model. (Supplementary 7)


To assess the utility of routinely collected health administrative data to model associations and predict admissions and length of stay of patients being treated for ED, investigators built two models from each data set. A full multivariable model to explore associations on patients admitted from emergency (Table 2) and length of stay of admitted patients (Table 4) from the SLHD Emergency Department Data (EDD) and SLHD Admitted Patient Data (APD) respectively. Following this, a main effects model for prediction was developed (Tables 3 and 6) by process of “Purposeful Selection of Covariates”. The results were mixed, consistent with several other papers [28, 29]. There is some utility in modelling admissions in emergency with factors associated with admissions in the initial exploratory model. However, there is more utility in modelling length of stay both on associations with the initial exploratory model as well as the final prediction model.

Table 6 Final prediction model (multivariable linear regression)

This is the first of further analyses and models that will be developed to provide health services with a potentially innovative and cost-effective way to better understand risk factors and predict health care outcomes of people with ED. In turn, this could lead to standardisation of data collection and improve the utility of routinely collected health administrative data. This can provide valuable information to a more predictive approach to healthcare, effective and efficient allocation of resources and guide improvements in service delivery for people with ED, their families, and carers. It is expected that these models will be the first steps in exploring routinely collected health administrative data in eating disorders and evaluate its clinical utility by developing, evaluating, and validating exploratory and prediction models.

Modelling Admissions in Emergency

In this study modelling emergency department admissions utilising EDD, the initial exploratory model consists of 10 variables and after purposeful selection of covariates, the final prediction model consists of 4 variables. In other studies on prediction models on emergency department admissions, variables included in these models varied significantly such as diagnoses, functional status and quality of life [28, 30]. The source of data utilised also varied, including administrative, clinical and survey data [31,32,33]. It has been found that administrative or clinical data have greater predictive ability compared to survey data [28].

In the initial exploratory model (Table 2), ED diagnoses, age, triage category, source of referral and mode of arrival are observed to be statistically significant factors impacting on admission. For example, with every year increase in age there is 4% greater odds of admission (95% Confidence Interval [95% CI] 1.01 to 1.08; p = 0.015). Whilst there are limitations with the validation by Hosmer Lemeshow test, results from the model are clinically plausible and valid both in magnitude and direction. Investigators believe this contributes to the validity of the model and provides evidence administrative data can be utilised to understand factors that impact on ED patient admissions in emergency.

However, the same cannot be inferred from the final prediction model (Table 3) as it consists of only 4 variables, ED diagnoses, age, triage category and mode of arrival. Whilst each variable is clinically valid, they would not be considered adequate to predict admissions, supported by the pseudo R2 of 0.135, which is relatively low. Several other studies have shown multiple comorbidities and polypharmacy have a significant impact of health care utilisation [34, 35], which were not available in the EDD and may have improved the utility/validity of the final prediction model.

Modelling Length of Stay

The initial exploratory model for length of stay has clinically plausible and valid factors that impact on the outcome of length of stay. This initial exploratory model includes 34 variables in total in which 19 are diagnostic categorisations based on ICD-10-AM. It is clear that the APD contains much further information than EDD, including the important variable of diagnoses, which is not available in the EDD.

Diagnoses are important variables in health care outcomes and utilisation. A large number of studies on length of stay have utilised Charlson Comorbidity Index and other methods of indexing diagnoses with mixed results [31]. There are numerous adaptations of Charlson Comorbidity Index [36,37,38] however none that exist for ED and given clinical heterogeneity in this population, utilising such an index would require adaptation and validation. Therefore, diagnoses were included in a number of ways including diagnosis counts, eating disorder diagnoses and clinically appropriate and meaningful groupings and categorisations of comorbid diagnoses. Furthermore, from the initial exploratory model in the results Table 5, there is further evidence of clinical utility as a number of clinically plausible variables are associated with length of stay in addition to diagnoses related variables including admission year, type of wards admitted and mode of separation.

In the final prediction model in Table 6, diagnoses of substance disorder, personality disorder, anxiety disorder, adjustment disorder, refeeding syndrome markers (hypophosphatemia/oedema) and effects of chronic malnutrition provide predictive value. As well as the diagnoses, predictors for length of stay include diagnosis counts, ED diagnoses, type of wards admitted, marital status and socioeconomic disadvantage. The variables in the final prediction model for length of stay are all considered to be clinically plausible and valid and this is again supported statistically by the model with R2 of 0.562 as well as internal validation via bootstrapping (Supplementary 6). Based on the R2, this model performed relatively well compared to numerous other studies on prediction models for length of stay, which had R2 between 0.3 and 0.6 [29]. Many studies are now utilising machine learning models used for predicting length of stay, however the model in this study performed better than most of the machine learning models thus far [39].


The purpose of this study is to assess the clinical utility of health administrative data but there are several limitations in using these data. These data are not purposefully collected for this study or for clinical purposes. It is collected for resource utilisation and allocations with data entry initially, by administrative staff and clinicians with limited training on data entry. There is a lack of standardisation in data collection in EDD in contrast to APD, where data collection is conducted by highly trained coders reviewing all the documentation. This, in part, explains the improved validity and utility of modelling length of stay compared to admissions in emergency.

For modelling admissions in emergency, the sample size was much smaller than anticipated as the search criteria was based on SNOMED diagnosis codes. It is known that patients with ED present to emergency department for numerous physical or mental health issues such as electrolyte disturbance or self-harm, which would not have been identified and this is a known limitation of SNOMED codes [40]. Therefore, the population was defined as patients with confirmed ED diagnoses on presentation to emergency for this study to better understand limitations of EDD data with the view to inform follow up studies, further analyses and models.

From modelling length of stay, the diagnostic groupings were based on ICD-10-AM coding, which is not consistent with diagnostic criteria utilised in clinical practice and did not include severity. This could potentially impact predictive value for length of stay. However, some limitations were mitigated by meticulously reviewing each of the ICD-10-AM codes with these diagnostic groupings both statistically and clinically, with multidisciplinary clinical input in a cyclical iterative process.

Conclusions and Future Directions

Health administrative data can be utilised to describe factors that impact on health outcome measures as well as predict health outcome measures in adults with ED. Both the initial exploratory models provide useful and clinically valid information. The final prediction model for admissions in emergency is not statistically or clinically valid for prediction, but there is potential with inclusion of further variables such as diagnoses. This is demonstrated by the final prediction model for length of stay, which performed well in comparison to other studies on prediction models for length of stay. Further exploration and evaluation of this type of data with inclusion of more information from data linkage and clinical information evaluating various statistical methods are the next steps currently in progress following this study. Furthermore, future studies will also include external validation, translation and implementation with prospective studies to further validate and optimise the final prediction models.

It is also evident from this study that there would be numerous benefits to inclusion of further variables and standardising data collection. Utilising health administrative data in this way is an efficient process for assessing impacts of multiple factors on patient care and predicting their health care outcomes, which has significant potential in not only guiding effective resource allocation and utilisation but also improving care provided to people with a lived experience of ED, their families and carers. It is even more important now to use all available data to optimise outcomes and provide efficacious and cost-effective services to meet the growing demand for treatment and care in the community, especially given the ongoing increasing prevalence and impacts of COVID-19 in mental health and eating disorders.

Data Availability

The datasets generated and/or analysed during the current study are not publicly available due to privacy and ethical restrictions as per NSW Health Policy but may be available from the corresponding author on reasonable request.



Diagnostic and Statistical Manual of Mental Disorders (version 5).


Anorexia Nervosa.


Bulimia Nervosa.


Avoidant/Restrictive Food Intake Disorder.


Unspecified Feeding or eating disorder.


Other Specified Feeding or eating disorder.


Sydney Local Health District.


NSW Emergency Department Data Collection.


NSW Admitted Patient Data Collection.


SLHD Emergency Department Data.


SLHD Admitted Patient Data.


International Classification of Diseases, Tenth Revision (Australian Modification).


Socio-Economic Indexes for Australia, 2016.

5. References

  1. NSW Service. Plan for People with Eating Disorders (2013–2018)

  2. MARSIPAN: Management of Really Sick Patients with Anorexia, College Report CR189., (2014)

  3. Arcelus J, Mitchell AJ, Wales J, Nielsen S. Mortality rates in patients with anorexia nervosa and other eating disorders. A meta-analysis of 36 studies. Arch Gen Psychiatry. 2011 Jul;68(7):724–31.

  4. Deloitte on behalf of Butterfly Foundation for Eating Disorders. (2012), “Paying the Price: The economic and social impact of eating disorders in Australia.&#8221

  5. Hay P, Girosi F, Mond J. (2015) “Prevalence and sociodemographic correlates of DSM-5 eating disorders in the Australian population.” 3:19, DOI

  6. Russell J, Mulvey B, Bennett H, Donnelly B, Frig E. Harm minimization in severe and enduring anorexia nervosa. Int Rev Psychiatry. 2019 Jun;31(4):391–402.

  7. Source, NSW Admitted Patient Data Collection (APDC)., NSW Emergency Department Data Collection (EDDC) and NSW Mental Health Ambulatory Data Collection from 2011 to 2020.

  8. Carano A, De Berardis D, Campanella D, Serroni N, Ferri F, Di Iorio G, Acciavatti T, Mancini L, Mariani G, Martinotti G, Moschetta FS, Di Giannantonio M. “Alexithymia and Suicide Ideation in a Sample of Patients with Binge Eating Disorder.”Journal of Psychiatric Practice 2012 January; 18(1):p5–11, doi:

  9. Fornaro M, Daray FM, Hunter F, Anastasia A, Stubbs B, De Berardis D, Shin JI, Husain MI, Dragioti E, Fusar-Poli P, Solmi M. The prevalence, odds and predictors of lifespan comorbid eating disorder among people with a primary diagnosis of bipolar disorders, and vice-versa: systematic review and meta-analysis. Journal of Affective Disorders. 2021 Feb 1;280:409 – 31. doi:

  10. NICE (National Institute for Clinical Excellence) guidelines. (2017) Eating disorders: recognition and treatment

  11. Murray SB, Quintana DS, Loeb KL, Griffiths S, Le Grange D. Treatment outcomes for anorexia nervosa: a systematic review and meta-analysis of randomized controlled trials. Psychol Med. 2019 Mar;49(4):535–44.

  12. Wade TD. Developing consensus on the definition of remission and recovery for research. Int J Eat Disord. 2020;53(8):1204–8.

    Article  PubMed  Google Scholar 

  13. Report. Prepared by InforMH, System Information and Analytics Branch NSW Health. “Eating Disorder Dashboard NSW”. September 2021.

  14. Koreshe JMiskovic-WhealteyE, Kim M, Simeone R. Sarah Maguire “The impact of the COVID-19 pandemic on people with lived experience of an eating disorder: An Australian perspective” Australia and New Zealand Academy for Eating Disorders 2021 Perth Conference.

  15. Fisher J, Tran T, Hammarberg K, Nguyen H, Stocker R, Rowe H, Sastri J, Popplestone S, Kirkman M. “Quantifying the mental health burden of the most severe covid-19 restrictions: A natural experiment.” J Affect Disord. 2021 Oct 1;293:406–414. doi: 10.1016/j.jad.2021.06.060. Epub 2021 Jul 2. PMID: 34246949; PMCID: PMC8264352.

  16. SLHD Eating. Disorder Service Plan (2019–2024)

  17. Coleman EA, Wagner EH, Grothaus LC, Hecht J, Savarino J, Buchner DM. Predicting hospitalization and functional decline in older health plan enrollees: are administrative data as accurate as self-report? J Am Geriatr Soc. 1998;46(4):419–25.

    Article  CAS  PubMed  Google Scholar 

  18. Tropea J, LoGiudice D, Liew D, Gorelik A, Caroline Brand. Poorer outcomes and greater healthcare costs for hospitalised older people with dementia and delirium: a retrospective cohort study. Int J Geriatr Psychiatry. 2017;32(5):539–47.

    Article  PubMed  Google Scholar 

  19. Sricharoen P, Phinyo P, Patumanond J, Piyayotai D, Sittichanbuncha Y, Yuksen C. Khanchit Likittanasombat, and Ekpaiboon Cheuathonghua. “Clinical predictors influencing the length of stay in emergency department patients presenting with acute heart failure.“ Medicina 56, no. 9 (2020):434.

  20. Kaltsidis G, Bamvita J-M, Grenier G, Marie-Josée F. “Predictors of Frequent Emergency Department Utilization for Mental Health Reasons.“Journal of Behavioral Health Services & Research48, no. 2 (2021).

  21. Centre for Health Record. Linkage (CHeReL), NSW Health, accessed August 2021.

  22. Sundararajan V, Henderson T, Perry C, Muggivan A, Quan H, William A. Ghali. “New ICD-10 version of the Charlson comorbidity index predicted in-hospital mortality. J Clin Epidemiol. 2004;57(12):1288–94.

    Article  PubMed  Google Scholar 

  23. Elixhauser A, Steiner C, Robert Harris D, Rosanna M. Coffey. “Comorbidity measures for use with administrative data.“Medical care(1998):8–27.

  24. David W, Hosmer S, Lemeshow, Rodney X. Sturdivant. “Applied logistic regression. New York: Wiley; 2000.

    Google Scholar 

  25. Ewout W, Steyerberg. Clinical prediction models: a practical Approach to Development, Validation, and updating. Springer; 2019.

  26. Steyerberg EW, Frank E, Harrell JJM Jr, Borsboom MJC, Eijkemans Y, Vergouwe, Dik J. Habbema. “Internal validation of predictive models: efficiency of some procedures for logistic regression analysis. J Clin Epidemiol. 2001;54(8):774–81.

    Article  CAS  PubMed  Google Scholar 

  27. Lumley T, Diehr P, Emerson S, Chen Lu. The importance of the normality assumption in large public health data sets. Annu Rev Public Health. 2002;23(1):151–69.

    Article  PubMed  Google Scholar 

  28. Wallace E, Stuart E, Vaughan N, Bennett K, Fahey T, Susan M. Smith. “Risk prediction models to predict emergency hospital admission in community-dwelling adults: a systematic review.“ Medical care 52, no.8(2014): 751.

  29. Lu M, Sajobi T, Lucyk K, Lorenzetti D, Quan H. “Systematic review of risk adjustment models of hospital length of stay (LOS).“ Medical care 53, no. 4 (2015):355–365.

  30. Monahan AC, Sue S, Feldman. JMIR Med Inf. 2021;9(9):e30022. “Models Predicting Hospital Admission of Adult Patients Utilizing Prehospital Data: Systematic Review Using PROBAST and CHARMS.“.

  31. Roos NP, Roos LL, Mossey J, et al. Using administrative data to predict important health outcomes. Entry to hospital, nursing home, and death. Med Care. 1988;26:221–39.

    Article  CAS  PubMed  Google Scholar 

  32. Freedman JD, Beck A, Robertson B, et al. Using a mailed survey to predict hospital admission among patients older than 80. J Am Geriatr Soc. 1996;44:689–92.

    Article  CAS  PubMed  Google Scholar 

  33. Mazzaglia G, Roti L, Corsini G, et al. Screening of older community-dwelling people at risk for death and hospitalization: the Assistenza Socio-Sanitaria in Italia project. J Am Geriatr Soc. 2007;55:1955–60.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Salisbury C, Johnson L, Purdy S, et al. Epidemiology and impact of multimorbidity in primary care: a retrospective cohort study. Br J Gen Pract. 2011;61:e12–e21.

    Article  PubMed  Google Scholar 

  35. Barnett K, Mercer SW, Norbury M, et al. The epidemiology of multimorbidity in a large cross-sectional dataset: implications for health care, research and medical education. Lancet. 2012;380:37–43.

    Article  PubMed  Google Scholar 

  36. D’Hoore W, Bouckaert A, Tilquin C. Practical considerations on the use of the Charlson comorbidity index with administrative data bases. J Clin Epidemiol. 1996;49(12):1429–33.

    Article  PubMed  Google Scholar 

  37. Charlson ME, Charlson RE, Peterson JC, Spyridon S, Marinopoulos WM, Briggs. Hollenberg. “The Charlson comorbidity index is adapted to predict costs of chronic disease in primary care patients. J Clin Epidemiol. 2008;61(12):1234–40.

    Article  PubMed  Google Scholar 

  38. Hemmelgarn BR, Braden J, Manns H, Quan. Ghali. “Adapting the Charlson Comorbidity Index for use in patients with ESRD. Am J Kidney Dis. 2003;42(1):125–32.

    Article  PubMed  Google Scholar 

  39. Lequertier V, Wang T, Fondrevelle J, Augusto V, Duclos A. “Hospital Length of Stay Prediction Methods: A Systematic Review " Medical Care. 2021;59(10):929–38.

    PubMed  Google Scholar 

  40. Giannangelo K, Millar J. “Mapping SNOMED CT to ICD-10.“ In MIE, pp.83–87. 2012.

Download references


The authors would like to acknowledge the Performance Unit in Sydney Local Health District for their support and working collaboratively on the data search and collection.


This research received no specific grant from any funding agency, commercial or not-for-profit sectors.

Author information

Authors and Affiliations



M.K. initiated and lead the study involved in all aspects of the project from initiation, literature review, ethics application, forming, leading and coordinating the research team, statistical analysis, writing manuscript and submission. E.K. contributed to initiation, ethics application and review of study process. A.S. and M.H. contributed to clinical review of protocol/design, content/manuscript and analysis. K.M. contributed to study design, literature review and supervision of research with focus on statistical analysis. J.MW contributed to all aspects of the project in a supervisory role from initiation to submission. All authors reviewed and edited the manuscript.

Corresponding author

Correspondence to Marcellinus Kim.

Ethics declarations

Competing Interests

There are no competing or conflicts of interest.

Ethics approval and consent to participate

The authors assert that all procedures contributing to this work comply with the ethical standards of the relevant national and institutional committees on human experimentation and with the Helsinki Declaration of 1975, as revised in 2008. As outlined in the study protocol and ethics application, informed consent will not be attained and a waiver of consent was requested as part of the ethics application process as per the SLHD compliance process as this study is a retrospective cohort study on health administrative data. This study was approved by the Human Ethics Review Committee (Royal Prince Alfred Hospital Zone) of Sydney Local Health District (SLHD), Australia, protocol number X21-0059.

Consent for publication

Not applicable.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kim, M., Holton, M., Sweeting, A. et al. Using health administrative data to model associations and predict hospital admissions and length of stay for people with eating disorders. BMC Psychiatry 23, 326 (2023).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: