On representing the prognostic value of continuous gene expression biomarkers with the restricted mean survival curve
Metrics: PDF 484 views | HTML 706 views | ?
Kevin H. Eng1, Emily Schiller1 and Kayla Morrell1
1 Department of Biostatistics and Bioinformatics, Roswell Park Cancer Institute, Buffalo, NY, USA
Kevin H. Eng, email:
Keywords: dichotomization, model diagnostics, prognostic marker, restricted mean survival, survival analysis
Received: June 26, 2015 Accepted: September 12, 2015 Published: October 14, 2015
Motivation. Researchers developing biomarkers for cancer prognosis from quantitative gene expression data are often faced with an odd methodological discrepancy: while Cox’s proportional hazards model, the appropriate and popular technique, produces a continuous and relative risk score, it is hard to cast the estimate in clear clinical terms like median months of survival and percent of patients affected. To produce a familiar Kaplan-Meier plot, researchers commonly make the decision to dichotomize a continuous (often unimodal and symmetric) score. It is well known in the statistical literature that this procedure induces significant bias.
Results. We illustrate the liabilities of common techniques for categorizing a risk score and discuss alternative approaches. We promote the use of the restricted mean survival (RMS) and the corresponding RMS curve that may be thought of as an analog to the best fit line from simple linear regression.
Conclusions. Continuous biomarker workflows should be modified to include the more rigorous statistical techniques and descriptive plots described in this article. All statistics discussed can be computed via standard functions in the Survival package of the R statistical programming language. Example R language code for the RMS curve is presented in the appendix.
All site content, except where otherwise noted, is licensed under a Creative Commons Attribution 3.0 License.