Automated Survival Prediction in Metastatic Cancer Patients Using High-Dimensional Electronic Medical Record Data

Michael F. Gensheimer, A. Solomon Henry, Douglas J. Wood, Trevor J. Hastie, Sonya Aggarwal, Sara A. Dudley, Pooja Pradhan, Imon Banerjee, Eunpi Cho, Kavitha Ramchandran, Erqi Pollom, Albert C. Koong, Daniel L. Rubin, Daniel T. Chang

Research output: Contribution to journalArticle

Abstract

Background: Oncologists use patients' life expectancy to guide decisions and may benefit from a tool that accurately predicts prognosis. Existing prognostic models generally use only a few predictor variables. We used an electronic medical record dataset to train a prognostic model for patients with metastatic cancer. Methods: The model was trained and tested using 12 588 patients treated for metastatic cancer in the Stanford Health Care system from 2008 to 2017. Data sources included provider note text, labs, vital signs, procedures, medication orders, and diagnosis codes. Patients were divided randomly into a training set used to fit the model coefficients and a test set used to evaluate model performance (80%/20% split). A regularized Cox model with 4126 predictor variables was used. A landmarking approach was used due to the multiple observations per patient, with t0 set to the time of metastatic cancer diagnosis. Performance was also evaluated using 399 palliative radiation courses in test set patients. Results: The C-index for overall survival was 0.786 in the test set (averaged across landmark times). For palliative radiation courses, the C-index was 0.745 (95% confidence interval [CI] = 0.715 to 0.775) compared with 0.635 (95% CI = 0.601 to 0.669) for a published model using performance status, primary tumor site, and treated site (two-sided P < .001). Our model's predictions were well-calibrated. Conclusions: The model showed high predictive performance, which will need to be validated using external data. Because it is fully automated, the model can be used to examine providers' practice patterns and could be deployed in a decision support tool to help improve quality of care.

Original languageEnglish (US)
Pages (from-to)568-574
Number of pages7
JournalJournal of the National Cancer Institute
Volume111
Issue number6
DOIs
StatePublished - Jun 1 2019
Externally publishedYes

ASJC Scopus subject areas

  • Oncology
  • Cancer Research

Fingerprint Dive into the research topics of 'Automated Survival Prediction in Metastatic Cancer Patients Using High-Dimensional Electronic Medical Record Data'. Together they form a unique fingerprint.

  • Cite this

    Gensheimer, M. F., Henry, A. S., Wood, D. J., Hastie, T. J., Aggarwal, S., Dudley, S. A., Pradhan, P., Banerjee, I., Cho, E., Ramchandran, K., Pollom, E., Koong, A. C., Rubin, D. L., & Chang, D. T. (2019). Automated Survival Prediction in Metastatic Cancer Patients Using High-Dimensional Electronic Medical Record Data. Journal of the National Cancer Institute, 111(6), 568-574. https://doi.org/10.1093/jnci/djy178