Abstract
Background: Cardiac surgery is resource intensive and often requires multidisciplinary involvement to facilitate discharge. To facilitate evidence-based resource planning, we derived and validated clinical models to predict postoperative hospital length of stay (LOS).
Methods: We used linked, population-level databases with information on all Ontario residents and included patients aged 18 years or older who underwent coronary artery bypass grafting, valvular or thoracic aorta surgeries between October 2008 and September 2019. The primary outcome was hospital LOS. The models were derived by using patients who had surgery before Sept. 30, 2016, and validated after that date. To address the rightward skew in LOS data and to identify top-tier resource users, we used logistic regression to derive a model to predict the likelihood of LOS being more than the 98th percentile (> 30 d), and γ regression in the remainder to predict continuous LOS in days. We used backward stepwise variable selection for both models.
Results: Among 105 193 patients, 2422 (2.3%) had an LOS of more than 30 days. Factors predicting prolonged LOS included age, female sex, procedure type and urgency, comorbidities including frailty, high-risk acute coronary syndrome, heart failure, reduced left ventricular ejection fraction and psychiatric and pulmonary circulatory disease. The C statistic was 0.92 for the prolonged LOS model and the mean absolute error was 2.4 days for the continuous LOS model.
Interpretation: We derived and validated clinical models to identify top-tier resource users and predict continuous LOS with excellent accuracy. Our models could be used to benchmark clinical performance based on expected LOS, rationally allocate resources and support patient-centred operative decision-making.
Cardiac surgery is resource intensive. It carries a higher burden of complications, requires intensive postoperative monitoring and involves an often longer hospital length of stay (LOS) as compared with non-cardiac surgery.1 With steady improvements in surgical technique and perioperative care, cardiac surgery is increasingly being offered to frail and complex patients with higher resource needs.2,3 The drive by many organizations for operational efficiency and competing capacity needs during the COVID-19 pandemic makes evidence-based triaging and resource allocation, founded on real-world data, an urgent priority. Prediction of intensive care unit (ICU) LOS after cardiac surgery4–6 is important but does not fully reflect the extent of resources needed. Nonetheless, although risk factors have been identified for prolonged postoperative hospital LOS, few models are available to predict this important metric, and none are able to estimate continuous postoperative LOS with accuracy. Further, though existing models include those from the Society of Thoracic Surgeons and the EuroSCORE data sets,7–9 they were developed to predict perioperative death and end organ morbidity and were only later validated in single-centre data sets for the purpose of predicting prolonged LOS in a binary fashion instead of estimation of continuous LOS. To better inform health resource planning, we derived and externally validated clinical models using population-based data to identify top-tier resource users and to predict continuous hospital LOS after cardiac surgery.
Methods
We conducted a population-based, retrospective cohort study in Ontario between Oct. 1, 2008, and Sept. 30, 2019. Ontario is the most populous province in Canada, with about 14.6 million residents, and is ethnically diverse.10 Ontario operates a publicly funded health care system that reimburses all covered services and clinicians. All centres in Ontario performing cardiac surgical procedures were included.
The conduct and reporting of this study is in accordance with TRIPOD (Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis) Statement11 reporting guideline.
Participants
The population included all adult patients aged 18 years or older who underwent coronary artery bypass grafting (CABG), aortic, mitral or tricuspid valve, or thoracic aorta surgery. These procedures represent the majority of cardiac surgeries performed in the United States and are frequently the target for risk prediction models in the cardiac surgical literature.12 For patients with multiple cardiac procedures during the study period, only the index procedure was included in the analyses.
Data source
The data set from this study is held securely in coded form at ICES (formerly the Institute for Clinical Evaluative Sciences), which holds multiple population-based health databases of Ontario residents. CorHealth Ontario maintains a prospective registry of all patients who undergo invasive cardiac procedures in Ontario and regularly undergoes selected chart audits and core laboratory validation.13
Data sets were linked deterministically using unique confidential identifiers and were analyzed at ICES.14–17 These included the CorHealth Ontario registry (patient and procedural details) with population-level administrative health care databases including the Canadian Institute for Health Information Discharge Abstract Database (comorbidities, hospital admissions and in-hospital procedures), the Ontario Health Insurance Plan database (physician service claims) and the Registered Persons Database (vital statistics).17–20 These administrative databases have been validated for many outcomes, exposures and comorbidities, including heart failure, chronic obstructive pulmonary disease, asthma, hypertension and diabetes.21–24
Covariates
Potential covariates considered in the analyses included patient sociodemographics, comorbidities and hospital characteristics. We estimated each patient’s socioeconomic status by using the neighbourhood median income from the Canadian Census,25 and determined residence status (rural v. urban) using Statistics Canada definitions.26 The type of hospital (teaching and community) was determined by the Public Hospitals Act, where teaching hospitals had formal agreements with a university and postgraduate training accreditation body.27 Further variables included physiologic, anatomic and comorbidity data, as well as procedure-specific information (operative priority status, redo sternotomy, type of surgery and surgery duration). We obtained data on height, weight, operative priority and information pertaining to left ventricular ejection fraction, valvular disease and coronary anatomy from the CorHealth Ontario registry. In addition, we identified comorbidities from the CorHealth Ontario registry, which we supplemented with data from the Discharge Abstract Database and the Ontario Health Insurance Plan database using the enhanced Canadian version of the International Classification of Diseases and Related Health Problems, 10th Revision codes28 within 5 years before surgery, according to validated algorithms.21,23,29,30 Frailty is defined by the Hospital Frailty Risk Score, a multidimensional frailty instrument that was validated using routinely collected hospital admissions data of patients aged 75 years or older from the United Kingdom.31,32 This score ranges between 0 and 173.1 points and is stratified by low risk (< 5), intermediate risk3–13 and high risk (> 15).31
Outcome
The primary outcome was postoperative hospital LOS, from the date of surgery to the date of index hospital discharge. We derived 2 separate models: the binary outcome model to identify the top-tier resource users (i.e., LOS exceeding the 98th percentile value of > 30 d) and the continuous outcome model to predict the actual LOS in days in the remainder of the cohort. The rationale for our modelling approach was twofold. First, as LOS data are invariably right-skewed with extreme values in patients with prolonged stay,33,34 the choice of 98th percentile (LOS > 30 d) as the cut-off point eliminates extreme values and increases the precision of continuous LOS prediction. Second, patients with LOS of more than 30 days are among the highest resource users and are most likely to have impaired quality of life after eventual discharge.13,35–38 Thus, the ability to predict prolonged LOS will allow patients to be informed partners in the therapeutic decision-making process.
Statistical analysis
We compared continuous variables using a 2-sample t test or Wilcoxon rank-sum test where appropriate, and categorical variables using a χ2 test. Between-group differences were considered to be small if the absolute standardized difference for a given variable was less than 10%.39
Missing data
The left ventricular ejection fraction was missing in 3582 (3.4%), rurality status in 87 (0.08%), income quintile in 272 (0.26%), glomerular filtration rate in 4671 (4.4%), body mass index (BMI) in 5583 (5.3%), surgery duration in 1317 (1.2%) and operative priority in 12 060 (11.5%) patients. We imputed these missing values once within the SAS PROC MI framework, where they were predicted drawing on all candidate covariates using predictive mean matching for continuous variables and logistic regression for categorical variables.40
Model development and validation
We split the cohort temporally into derivation and validation data sets, such that the cohort who underwent cardiac surgery before Sept. 30, 2016, was used to derive the models and the remainder of the cohort was used to externally validate these models. We predicted prolonged hospital LOS using logistic regression and continuous hospital LOS using γ regression. For each model, we selected predictor variables using a backward-stepwise algorithm with a significance threshold (p < 0.10 for entry and p < 0.05 for retention in the model).41 For continuous variables, we examined their association with the outcome using cubic splines. Most of these variables were entered into the models as continuous values, whereas BMI violated the linearity assumption and was entered as a spline term.
Model evaluation
The discrimination of the binary outcome model was evaluated using the C statistic, and its discrimination and calibration was assessed using the Brier score,42 as well as plots of observed compared with predicted rates within vigintiles of predicted risk in the validation cohort. The performance of the continuous outcome model was assessed using the mean absolute error (MAE), as well as plots of mean observed compared with predicted LOS in days within each vigintile of predicted LOS in patients from the validation cohort with high likelihood of LOS of more than 30 days (i.e., probability exceeding the optimal cut-off value as determined by the Youden Index).
We performed the analysis using SAS version 9.4 (SAS Institute) and defined statistical significance by a 2-sided p value of less than 0.05.
Ethics approval
An independent, nonprofit research institute, ICES holds legal status under Ontario’s health information privacy law, which allows it to collect and analyze health care and demographic data, without consent, for health system evaluation and improvement. The use of data was authorized under section 45 of Ontario’s Personal Health Information Protection Act, which does not require review by a research ethics board.13
Results
Among 105 193 patients (73 870 comprising the derivation cohort and 28 901 comprising the validation cohort), 2422 (2.3%) had prolonged hospital LOS of more than 30 days (Table 1). The distribution of hospital LOS in the derivation and validation cohorts are illustrated in Appendix 1, available at www.cmajopen.ca/content/11/1/E180/suppl/DC1. The median LOS was 46 (interquartile range [IQR] 37–66) days for patients with prolonged LOS and 6 (IQR 5–8) days for the remainder of the cohort. Patient characteristics were notably different between groups (Table 1). A minority of the patients in either group were female, though females were more likely than males to have prolonged hospital LOS. Patients with prolonged hospital LOS were more likely to be older, with a 6-year median age difference noted, and reside in a lower income neighbourhood, and were more likely to present urgently and emergently for complex procedures (re-do sternotomy, CABG + valve[s], multiple valves and thoracic aorta surgery) of longer surgical duration and at teaching hospitals. They were more likely to have a higher multimorbidity burden, including frailty, in addition to cardiovascular comorbidities such as recent myocardial infarction, reduced left ventricular ejection fraction and higher New York Heart Association classification. In addition, prolonged hospital LOS was associated with longer mechanical ventilation and ICU LOS, as well as higher rates of surgical re-exploration, extracorporeal membrane oxygenation, surgical site infection, ventilator-associated pneumonia, acute kidney injury, myocardial infarction, stroke, and physician visits and readmission to ICU and hospital within 30 days of discharge (Appendix 2, available at www.cmajopen.ca/content/11/1/E180/suppl/DC1).
Predictors of length of stay
The binary model for prolonged LOS consisted of 16 variables (Table 2) with the continuous model consisting of 28 variables (Table 3). The characteristics common to both models are procedure type and duration, age, rural residence, BMI, frailty, Canadian Cardiovascular Society and New York Heart Association classification status, left ventricular ejection fraction, glomerular filtration rate, valvular disease, diabetes requiring treatment, anemia, cerebrovascular disease, malignancy and depression. The continuous outcome model additionally included sex, presenting at a community hospital, operative priority, atrial fibrillation, endocarditis, peripheral arterial disease, chronic obstructive pulmonary disease, pulmonary circulatory disease, alcoholism, dementia and psychosis.
Model performance
Binary outcome model
The C statistic was 0.92 in both derivation and validation data sets, demonstrating excellent discrimination. The model was well calibrated, with a Brier score of 0.016 and the observed and predicted risks of prolonged LOS being very similar across all probability vigintiles in the calibration data set (Figure 1). At the optimal cut-off value of 0.0166, the binary model had a sensitivity of 88.7% and specificity of 81.2%.
Continuous outcome model
The continuous model had an MAE of 2.3 days in the derivation data set. The MAE was 2.0 in the validation data set, indicating good predictive accuracy. The distribution of MAE according to vigintiles of hospital LOS in the derivation and validation cohorts are presented in Appendix 3 (available at www.cmajopen.ca/content/11/1/E180/suppl/DC1). The calibration plot in Figure 2 shows that the mean observed and predicted hospital LOS within each LOS vigintile were nearly identical in the validation cohort.
Interpretation
Operative decision-making, performance benchmarking and postoperative resource planning may be enhanced by objective tools. Traditional statistical models are dated; they are limited to the prediction of prolonged LOS of varying durations and fail to predict continuous LOS.43–46 Although continuous hospital LOS is beginning to be explored using machine learning techniques in isolated CABG patients, it is based on small, single-centre data sets and lacks generalizability in the broader health care setting.47,48 The ability to predict continuous hospital LOS is advantageous to hospital administrators, patients and caregivers. Patients often ask about their projected LOS as it sets realistic expectations of the costs and benefits of surgery as a part of the informed consent and therapeutic decision-making process. For caregivers, an estimated LOS will more effectively facilitate personal and professional arrangements to care for loved ones after surgery. At the hospital level, the widening use of electronic medical records may lend itself to institution-based, self-quality assessments, and to rationally allocate telemonitoring and other postdischarge care and follow-up resources to reduce morbidity and readmissions. At the system level, these models could be used to generate risk-adjusted observed compared with expected LOS to systematically evaluate performance.
Our models were pragmatically designed for operational capacity planning and were derived and validated in a large and representative population to overcome these limitations. CorHealth Ontario produced an independent report addressing risk-adjusted average hospital LOS for the 2011–2016 fiscal years.49 Of note, this report had trimmed hospital LOS at the 99th percentile to remove extreme observations, much like the methodology employed in our analysis to isolate toptier resource users who are at the highest risk of complications, worsening frailty, functional decline and loss of personal freedom and independence after surgery.13,36,37,50–52 The ability to identify patients at risk for prolonged LOS allows for better decision-making from the health care system as well as the patient. At the system level, this ability, coupled with actual LOS prediction, will facilitate data-driven clinical scheduling to increase throughput, facilitate targeted interventions such as prehabilitation, Enhanced Recovery After Surgery and early referral to continuing care facilities. As prolonged LOS has also been implicated with increased health care cost7 and disability after discharge,13,35–38 our predictive models will inform effective provider–patient discussions and encourage patient-centred operative decision-making. Practically, this could be implemented by first identifying patients at risk of prolonged LOS followed by using the continuous model to facilitate actual LOS prediction in patients whose LOS would not likely exceed 30 days.
Notably, our binary outcome model demonstrated excellent performance with a C statistic of 0.92 and outperforms existing models. Comparatively, the EuroScore had a C statistic of 0.71 (IQR 0.69–0.72) for predicting prolonged hospital LOS (> 12 d) when validated in a monocentric setting,9 and the Society of Thoracic Surgeon’s model had a C statistic of 0.716–0.732 for predicting a short LOS of 5 days or less and 0.739–0.796 for predicting a prolonged LOS of more than 5 days, depending on the type of surgery performed.53 It should also be noted that these models rely on designated staff for data collection, which constitutes further health care resource demands and is not feasible at all Canadian centres.
Our continuous outcome model was able to predict LOS with an MAE of 2 days, which is accepted in a publicly funded health care system given LOS could be influenced by the availability of postdischarge continuing care facilities and home-based caregivers rather than medical indications alone. Importantly, our ability to predict continuous LOS enables precision-based hospital capacity planning, as well as quality benchmarking and incentivized allocation of health care funding. Incorporation of the models into tools, such as the province-wide CorHealth information system, could also help individual providers to understand bed requirements at the time of intervention, allowing for more accurate resource planning.
Many risk factors from our LOS models are consistent with those published in the literature.11,54 In addition, we were able to incorporate frailty as a defining element of perioperative outcomes and recovery,55,56 as well as anemia, hospital type and a variety of sociodemographic factors to ensure that all patient groups are equally represented. A further novel component was the inclusion of psychiatric comorbidities such as depression and psychosis, which have been demonstrated in other settings to be associated with increased hospital admission, LOS and death.57,58 The apparent protective influence of rurality on outcomes is likely explained by observations from other Ontario population-based studies, that rural residents are more likely to be higher income earners, younger and less likely to be frail or have comorbidities such as anemia and renal disease.59 The variables included in our models are routinely collected and readily available to facilitate their adoption at most institutions.
Notable strengths of our models include their generalizability across the scope of the most frequently performed cardiac surgical procedures in a large and representative population, as well as their relevance to clinicians, administrators, policy-makers and patients.
Limitations
Limitations include those inherent to health administrative data sets including the lack of certain detailed physiologic measures, such as natriuretic peptide, in the databases used. Additional limitations include limited application in patients who undergo minimally invasive cardiac procedures and less frequently performed procedures such as cardiac transplant and ventricular assist device placement. Further validation may be needed in other health care jurisdictions.
Conclusion
We derived and validated a set of clinical prediction models to identify top-tier resource users (hospital LOS > 30 d) and actual LOS after cardiac surgery with excellent accuracy. Care, outcomes and patient satisfaction may be substantially improved if clinical judgment is supported by objective quantification in the planning of care. Shared decision-making through a thorough discussion of evidence-based estimates of risk and benefits of treatment options with patients and caregivers is vital to patient-centred care. Our research provides scientific evidence to inform shared clinical decision-making. Also, being based on an unbiased population-based sample, these models could be combined with established ICU LOS7 and waitlist60 management tools to provide evidence-based triaging decision support, to conserve system capacity, enhance operational efficiency, and to benchmark performance.
Acknowledgement
The authors acknowledge the usage of data compiled and provided by the Canadian Institute for Health Information. These data sets were linked using unique encoded identifiers and analyzed at ICES. The analyses, conclusions, opinions and statements expressed in the manuscript are those of the authors and do not necessarily reflect those of the above agencies.
Footnotes
Competing interests: None declared.
This article has been peer reviewed.
Contributors: Louise Sun contributed to the conception and design of the work. Anan Bader Eddeen and Louise Sun contributed to the acquisition and analysis of the data. Alexandra Fottinger, Anan Bader Eddeen and Louise Sun interpreted the data. Alexandra Fottinger and Louise Sun drafted the manuscript. All authors revised the manuscript critically for important intellectual content, gave final approval of the version to be published and agreed to be accountable for all aspects of the work.
Funding: This study is supported by the COVID-19 Response Fund from the University of Ottawa. Louise Sun was the National New Investigator at the Heart and Stroke Foundation of Canada. Douglas Lee is the Ted Rogers Chair in Heart Function Outcomes, University Health Network, University of Toronto.
Data sharing: The data set from this study is held securely in coded form at ICES. Although legal data sharing agreements between ICES and data providers (e.g., health care organizations and government) prohibit ICES from making the data set publicly available, access may be granted to those who meet prespecified criteria for confidential access, available at https://www.ices.on.ca/DAS (email: das{at}ices.on.ca). The full data set creation plan and underlying analytic code are available from the authors on request, understanding that the computer programs may rely on coding templates or macros that are unique to ICES and are therefore either inaccessible or may require modification.
Disclaimer: This study was supported by ICES, which is funded by an annual grant from the Ontario Ministry of Health (MOH) and the Ministry of Long-Term Care (MLTC). Parts of this material are based on data and information compiled and provided by the Canadian Institute for Health Information (CIHI). The authors acknowledge that the clinical registry data used in this analysis is from participating hospitals through CorHealth Ontario, which serves as an advisory body to the MOH, is funded by the MOH and is dedicated to improving the quality, efficiency and access and equity in the delivery of the continuum of adult cardiac and stroke care in Ontario. The analyses, conclusions, opinions and statements expressed herein are solely those of the authors and do not reflect those of the funding or data sources; no endorsement is intended or should be inferred.
Supplemental information: For reviewer comments and the original submission of this manuscript, please see www.cmajopen.ca/content/11/1/E180/suppl/DC1.
This is an Open Access article distributed in accordance with the terms of the Creative Commons Attribution (CC BY-NC-ND 4.0) licence, which permits use, distribution and reproduction in any medium, provided that the original publication is properly cited, the use is noncommercial (i.e., research or educational use) and no modifications or adaptations are made. See: https://creativecommons.org/licenses/by-nc-nd/4.0/
References
- © 2023 CMA Impact Inc. or its licensors