One algorithm may not fit all: How selection bias affects machine learning performance

Alice C. Yu, John Eng

Research output: Contribution to journalArticlepeer-review

Abstract

Machine learning (ML) algorithms have demonstrated high diagnostic accuracy in identifying and categorizing disease on radiologic images. Despite the results of initial research studies that report ML algorithm diagnostic accuracy similar to or exceeding that of radiologists, the results are less impressive when the algorithms are installed at new hospitals and are presented with new images. This phenomenon is potentially the result of selection bias in the data that were used to develop the ML algorithm. Selection bias has long been described by clinical epidemiologists as a key consider-ation when designing a clinical research study, but this concept has largely been unaddressed in the medical imaging ML literature. The authors discuss the importance of selection bias and its rel-evance to ML algorithm development to prepare the radiologist to critically evaluate ML literature for potential selection bias and un-derstand how it might affect the applicability of ML algorithms in real clinical environments.

Original languageEnglish (US)
Pages (from-to)1932-1937
Number of pages6
JournalRadiographics
Volume40
Issue number7
DOIs
StatePublished - Nov 1 2020

ASJC Scopus subject areas

  • Radiology Nuclear Medicine and imaging

Fingerprint Dive into the research topics of 'One algorithm may not fit all: How selection bias affects machine learning performance'. Together they form a unique fingerprint.

Cite this