Rxivist logo

Regional performance variation in external validation of four prediction models for severity of COVID-19 at hospital admission: An observational multi-centre cohort study

By Kristin Wickstrom, Valeria Vitelli, Ewan Carr, Aleksander Rygh Holten, Rebecca Bendayan, Andrew Henry Reiner, Daniel Bean, Tom Searle, Anthony Shek, Zeljko Kraljevic, James T Teo, Richard Dobson, Kristian Tonby, Alvaro Köhn-Luque, Erik Koldberg Amundsen

Posted 26 Mar 2021
medRxiv DOI: 10.1101/2021.03.26.21254390

Background: Several prediction models for coronavirus disease-19 (COVID-19) have been published. Prediction models should be externally validated to assess their performance before implementation. This observational cohort study aimed to validate published models of severity for hospitalized patients with COVID-19 using clinical and laboratory predictors. Methods: Prediction models fitting relevant inclusion criteria were chosen for validation. The outcome was either mortality or a composite outcome of mortality and ICU admission (severe disease). 1295 patients admitted with symptoms of COVID-19 at Kings Cross Hospital (KCH) in London, United Kingdom, and 307 patients at Oslo University Hospital (OUH) in Oslo, Norway were included. The performance of the models was assessed in terms of discrimination and calibration. Results: We identified two models for prediction of mortality (referred to as Xie and Zhang1) and two models for prediction of severe disease (Allenbach and Zhang2). The performance of the models was variable. For prediction of mortality Xie had good discrimination at OUH with an area under the receiver-operating characteristic (AUROC) 0.87 [95 % confidence interval (CI) 0.79-0.95] and acceptable discrimination at KCH, AUROC 0.79 [0.76-0.82]. In prediction of severe disease, Allenbach had acceptable discrimination (OUH AUROC 0.81 [0.74-0.88] and KCH AUROC 0.72 [0.68-0.75]). The Zhang models had moderate to poor discrimination. Initial calibration was poor for all models but improved with recalibration. Conclusions: The performance of the four prediction models was variable. The Xie model had the best discrimination for mortality, while the Allenbach model had acceptable results for prediction of severe disease.

Download data

  • Downloaded 152 times
  • Download rankings, all-time:
    • Site-wide: 158,598
    • In infectious diseases: 6,603
  • Year to date:
    • Site-wide: 94,834
  • Since beginning of last month:
    • Site-wide: 114,170

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide