TY - JOUR
T1 - Advancing COVID-19 diagnosis with privacy-preserving collaboration in artificial intelligence
AU - Bai, Xiang
AU - Wang, Hanchen
AU - Ma, Liya
AU - Xu, Yongchao
AU - Gan, Jiefeng
AU - Fan, Ziwei
AU - Yang, Fan
AU - Ma, Ke
AU - Yang, Jiehua
AU - Bai, Song
AU - Shu, Chang
AU - Zou, Xinyu
AU - Huang, Renhao
AU - Zhang, Changzheng
AU - Liu, Xiaowu
AU - Tu, Dandan
AU - Xu, Chuou
AU - Zhang, Wenqing
AU - Wang, Xi
AU - Chen, Anguo
AU - Zeng, Yu
AU - Yang, Dehua
AU - Wang, Ming Wei
AU - Holalkere, Nagaraj
AU - Halin, Neil J.
AU - Kamel, Ihab R.
AU - Wu, Jia
AU - Peng, Xuehua
AU - Wang, Xiang
AU - Shao, Jianbo
AU - Mongkolwat, Pattanasak
AU - Zhang, Jianjun
AU - Liu, Weiyang
AU - Roberts, Michael
AU - Teng, Zhongzhao
AU - Beer, Lucian
AU - Sanchez, Lorena E.
AU - Sala, Evis
AU - Rubin, Daniel L.
AU - Weller, Adrian
AU - Lasenby, Joan
AU - Zheng, Chuangsheng
AU - Wang, Jianming
AU - Li, Zhen
AU - Schönlieb, Carola
AU - Xia, Tian
N1 - Funding Information:
This study was supported by the HUST COVID-19 Rapid Response Call (grant nos. 2020kfyXGYJ021, 2020kfyXGYJ031, 2020kfyXGYJ093 and 2020kfyXGYJ094), the National Natural Science Foundation of China (grant nos. 61703171 and 81771801), National Cancer Institute, National Institutes of Health (grant no. U01CA242879) and Thammasat University Research fund under the NRCT (contract no. 25/2561) for the Digital platform for sustainable digital economy development project based on the RUN Digital Cluster collaboration scheme. H.W. acknowledges support from Cambridge Trust, Kathy Xu Fellowship, Centre for Advanced Photonics and Electronics and the Cambridge Philosophical Society. M.R. acknowledges support from AstraZeneca, Intel and the DRAGON consortium. A.W. acknowledges support from a Turing AI Fellowship under grant no. EP/V025379/1, The Alan Turing Institute and the Leverhulme Trust via CFI. C.B.S. acknowledges support from DRAGON, Intel, the Philip Leverhulme Prize, the Royal Society Wolfson Fellowship, the EPSRC grants EP/S026045/1 and EP/ T003553/1, EP/N014588/1, EP/T017961/1, the Wellcome Innovator Award RG98755, the Leverhulme Trust project Unveiling the invisible, the European Union Horizon 2020 research and innovation programme under the Marie Skodowska-Curie grant agreement no. 777826 NoMADS, the Cantab Capital Institute for the Mathematics of Information and the Alan Turing Institute. The funders did not participate in the research or review any details of this study.
Publisher Copyright:
© 2021, The Author(s).
PY - 2021/12
Y1 - 2021/12
N2 - Artificial intelligence provides a promising solution for streamlining COVID-19 diagnoses; however, concerns surrounding security and trustworthiness impede the collection of large-scale representative medical data, posing a considerable challenge for training a well-generalized model in clinical practices. To address this, we launch the Unified CT-COVID AI Diagnostic Initiative (UCADI), where the artificial intelligence (AI) model can be distributedly trained and independently executed at each host institution under a federated learning framework without data sharing. Here we show that our federated learning framework model considerably outperformed all of the local models (with a test sensitivity/specificity of 0.973/0.951 in China and 0.730/0.942 in the United Kingdom), achieving comparable performance with a panel of professional radiologists. We further evaluated the model on the hold-out (collected from another two hospitals without the federated learning framework) and heterogeneous (acquired with contrast materials) data, provided visual explanations for decisions made by the model, and analysed the trade-offs between the model performance and the communication costs in the federated training process. Our study is based on 9,573 chest computed tomography scans from 3,336 patients collected from 23 hospitals located in China and the United Kingdom. Collectively, our work advanced the prospects of utilizing federated learning for privacy-preserving AI in digital health.
AB - Artificial intelligence provides a promising solution for streamlining COVID-19 diagnoses; however, concerns surrounding security and trustworthiness impede the collection of large-scale representative medical data, posing a considerable challenge for training a well-generalized model in clinical practices. To address this, we launch the Unified CT-COVID AI Diagnostic Initiative (UCADI), where the artificial intelligence (AI) model can be distributedly trained and independently executed at each host institution under a federated learning framework without data sharing. Here we show that our federated learning framework model considerably outperformed all of the local models (with a test sensitivity/specificity of 0.973/0.951 in China and 0.730/0.942 in the United Kingdom), achieving comparable performance with a panel of professional radiologists. We further evaluated the model on the hold-out (collected from another two hospitals without the federated learning framework) and heterogeneous (acquired with contrast materials) data, provided visual explanations for decisions made by the model, and analysed the trade-offs between the model performance and the communication costs in the federated training process. Our study is based on 9,573 chest computed tomography scans from 3,336 patients collected from 23 hospitals located in China and the United Kingdom. Collectively, our work advanced the prospects of utilizing federated learning for privacy-preserving AI in digital health.
UR - http://www.scopus.com/inward/record.url?scp=85121384145&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85121384145&partnerID=8YFLogxK
U2 - 10.1038/s42256-021-00421-z
DO - 10.1038/s42256-021-00421-z
M3 - Article
AN - SCOPUS:85121384145
SN - 2522-5839
VL - 3
SP - 1081
EP - 1089
JO - Nature Machine Intelligence
JF - Nature Machine Intelligence
IS - 12
ER -