Predicting COVID-19 related death using the OpenSAFELY platform
The OpenSAFELY Collaborative,
Elizabeth J Williamson,
Helen I McDonald,
Alex J Walker,
Sebastian CJ Bacon,
Helen J Curtis,
Caroline E Morton,
Nicholas G Davies,
Nicholas J DeVito,
Ian J Douglas,
Christopher T Rentsch,
Angel YS Wong,
David A Harrison,
Ewout W. Steyerberg,
Rosalind M Eggo,
Stephen JW Evans,
Posted 01 Mar 2021
medRxiv DOI: 10.1101/2021.02.25.21252433
Posted 01 Mar 2021
ObjectivesTo compare approaches for obtaining relative and absolute estimates of risk of 28-day COVID-19 mortality for adults in the general population of England in the context of changing levels of circulating infection. DesignThree designs were compared. (A) case-cohort which does not explicitly account for the time-changing prevalence of COVID-19 infection, (B) 28-day landmarking, a series of sequential overlapping sub-studies incorporating time-updating proxy measures of the prevalence of infection, and (C) daily landmarking. Regression models were fitted to predict 28-day COVID-19 mortality. SettingWorking on behalf of NHS England, we used clinical data from adult patients from all regions of England held in the TPP SystmOne electronic health record system, linked to Office for National Statistics (ONS) mortality data, using the OpenSAFELY platform. ParticipantsEligible participants were adults aged 18 or over, registered at a general practice using TPP software on 1st March 2020 with recorded sex, postcode and ethnicity. 11,972,947 individuals were included, and 7,999 participants experienced a COVID-19 related death. The study period lasted 100 days, ending 8th June 2020. PredictorsA range of demographic characteristics and comorbidities were used as potential predictors. Local infection prevalence was estimated with three proxies: modelled based on local prevalence and other key factors; rate of A&E COVID-19 related attendances; and rate of suspected COVID-19 cases in primary care. Main outcome measuresCOVID-19 related death. ResultsAll models discriminated well between patients who did and did not experience COVID-19 related death, with C-statistics ranging from 0.92-0.94. Accurate estimates of absolute risk required data on local infection prevalence, with modelled estimates providing the best performance. ConclusionsReliable estimates of absolute risk need to incorporate changing local prevalence of infection. Simple models can provide very good discrimination and may simplify implementation of risk prediction tools in practice.
- Downloaded 162 times
- Download rankings, all-time:
- Site-wide: 128,199
- In infectious diseases: 5,008
- Year to date:
- Site-wide: 34,524
- Since beginning of last month:
- Site-wide: 43,838
Downloads over time
Distribution of downloads per paper, site-wide
- 27 Nov 2020: The website and API now include results pulled from medRxiv as well as bioRxiv.
- 18 Dec 2019: We're pleased to announce PanLingua, a new tool that enables you to search for machine-translated bioRxiv preprints using more than 100 different languages.
- 21 May 2019: PLOS Biology has published a community page about Rxivist.org and its design.
- 10 May 2019: The paper analyzing the Rxivist dataset has been published at eLife.
- 1 Mar 2019: We now have summary statistics about bioRxiv downloads and submissions.
- 8 Feb 2019: Data from Altmetric is now available on the Rxivist details page for every preprint. Look for the "donut" under the download metrics.
- 30 Jan 2019: preLights has featured the Rxivist preprint and written about our findings.
- 22 Jan 2019: Nature just published an article about Rxivist and our data.
- 13 Jan 2019: The Rxivist preprint is live!