Deep Convolutional Neural Network-Based Diagnosis of Anterior Cruciate Ligament Tears: Performance Comparison of Homogenous Versus Heterogeneous Knee MRI Cohorts with Different Pulse Sequence Protocols and 1.5-T and 3-T Magnetic Field Strengths

Christoph Germann; Giuseppe Marbach; Francesco Civardi; Sandro F. Fucentese; Jan Fritz; Reto Sutter; Christian W.A. Pfirrmann; Benjamin Fritz

doi:10.1097/RLI.0000000000000664

Deep Convolutional Neural Network-Based Diagnosis of Anterior Cruciate Ligament Tears: Performance Comparison of Homogenous Versus Heterogeneous Knee MRI Cohorts with Different Pulse Sequence Protocols and 1.5-T and 3-T Magnetic Field Strengths

Christoph Germann, Giuseppe Marbach, Francesco Civardi, Sandro F. Fucentese, Jan Fritz, Reto Sutter, Christian W.A. Pfirrmann, Benjamin Fritz

School of Medicine

Research output: Contribution to journal › Article › peer-review

2 Scopus citations

Abstract

Objectives The aim of this study was to clinically validate a Deep Convolutional Neural Network (DCNN) for the detection of surgically proven anterior cruciate ligament (ACL) tears in a large patient cohort and to analyze the effect of magnetic resonance examinations from different institutions, varying protocols, and field strengths. Materials and Methods After ethics committee approval, this retrospective analysis of prospectively collected data was performed on 512 consecutive subjects, who underwent knee magnetic resonance imaging (MRI) in a total of 59 different institutions followed by arthroscopic knee surgery at our institution. The DCNN and 3 fellowship-trained full-time academic musculoskeletal radiologists evaluated the MRI examinations for full-thickness ACL tears independently. Surgical reports served as the reference standard. Statistics included diagnostic performance metrics, including sensitivity, specificity, area under the receiver operating curve ("AUC ROC"), and kappa statistics. P values less than 0.05 were considered to represent statistical significance. Results Anterior cruciate ligament tears were present in 45.7% (234/512) and absent in 54.3% (278/512) of the subjects. The DCNN had a sensitivity of 96.1%, which was not significantly different from the readers (97.5%-97.9%; all P ≥ 0.118), but significantly lower specificity of 93.1% (readers, 99.6%-100%; all P < 0.001) and "AUC ROC"of 0.935 (readers, 0.989-0.991; all P < 0.001) for the entire cohort. Subgroup analysis showed a significantly lower sensitivity, specificity, and "AUC ROC"of the DCNN for outside MRI (92.5%, 87.1%, and 0.898, respectively) than in-house MRI (99.0%, 94.4%, and 0.967, respectively) examinations (P = 0.026, P = 0.043, and P < 0.05, respectively). There were no significant differences in DCNN performance for 1.5-T and 3-T MRI examinations (all P ≥ 0.753, respectively). Conclusions Deep Convolutional Neural Network performance of ACL tear diagnosis can approach performance levels similar to fellowship-trained full-time academic musculoskeletal radiologists at 1.5 T and 3 T; however, the performance may decrease with increasing MRI examination heterogeneity.

Original language	English (US)
Pages (from-to)	499-506
Number of pages	8
Journal	Investigative radiology
Volume	55
Issue number	8
DOIs	https://doi.org/10.1097/RLI.0000000000000664
State	Published - Aug 1 2020

Keywords

anterior cruciate ligament injuries
artificial intelligence
knee injuries
magnetic resonance imaging
neural networks (computer)

ASJC Scopus subject areas

General Medicine

Access to Document

10.1097/RLI.0000000000000664

Fingerprint

Dive into the research topics of 'Deep Convolutional Neural Network-Based Diagnosis of Anterior Cruciate Ligament Tears: Performance Comparison of Homogenous Versus Heterogeneous Knee MRI Cohorts with Different Pulse Sequence Protocols and 1.5-T and 3-T Magnetic Field Strengths'. Together they form a unique fingerprint.

Cite this

Germann, C., Marbach, G., Civardi, F., Fucentese, S. F., Fritz, J., Sutter, R., Pfirrmann, C. W. A., & Fritz, B. (2020). Deep Convolutional Neural Network-Based Diagnosis of Anterior Cruciate Ligament Tears: Performance Comparison of Homogenous Versus Heterogeneous Knee MRI Cohorts with Different Pulse Sequence Protocols and 1.5-T and 3-T Magnetic Field Strengths. Investigative radiology, 55(8), 499-506. https://doi.org/10.1097/RLI.0000000000000664

Deep Convolutional Neural Network-Based Diagnosis of Anterior Cruciate Ligament Tears: Performance Comparison of Homogenous Versus Heterogeneous Knee MRI Cohorts with Different Pulse Sequence Protocols and 1.5-T and 3-T Magnetic Field Strengths. / Germann, Christoph; Marbach, Giuseppe; Civardi, Francesco et al.
In: Investigative radiology, Vol. 55, No. 8, 01.08.2020, p. 499-506.

Research output: Contribution to journal › Article › peer-review

Germann, C, Marbach, G, Civardi, F, Fucentese, SF, Fritz, J, Sutter, R, Pfirrmann, CWA & Fritz, B 2020, 'Deep Convolutional Neural Network-Based Diagnosis of Anterior Cruciate Ligament Tears: Performance Comparison of Homogenous Versus Heterogeneous Knee MRI Cohorts with Different Pulse Sequence Protocols and 1.5-T and 3-T Magnetic Field Strengths', Investigative radiology, vol. 55, no. 8, pp. 499-506. https://doi.org/10.1097/RLI.0000000000000664

Germann C, Marbach G, Civardi F, Fucentese SF, Fritz J, Sutter R et al. Deep Convolutional Neural Network-Based Diagnosis of Anterior Cruciate Ligament Tears: Performance Comparison of Homogenous Versus Heterogeneous Knee MRI Cohorts with Different Pulse Sequence Protocols and 1.5-T and 3-T Magnetic Field Strengths. Investigative radiology. 2020 Aug 1;55(8):499-506. doi: 10.1097/RLI.0000000000000664

Germann, Christoph ; Marbach, Giuseppe ; Civardi, Francesco et al. / Deep Convolutional Neural Network-Based Diagnosis of Anterior Cruciate Ligament Tears : Performance Comparison of Homogenous Versus Heterogeneous Knee MRI Cohorts with Different Pulse Sequence Protocols and 1.5-T and 3-T Magnetic Field Strengths. In: Investigative radiology. 2020 ; Vol. 55, No. 8. pp. 499-506.

@article{a64159e11b1c41a3bc6bef808e142a5a,

title = "Deep Convolutional Neural Network-Based Diagnosis of Anterior Cruciate Ligament Tears: Performance Comparison of Homogenous Versus Heterogeneous Knee MRI Cohorts with Different Pulse Sequence Protocols and 1.5-T and 3-T Magnetic Field Strengths",

abstract = "Objectives The aim of this study was to clinically validate a Deep Convolutional Neural Network (DCNN) for the detection of surgically proven anterior cruciate ligament (ACL) tears in a large patient cohort and to analyze the effect of magnetic resonance examinations from different institutions, varying protocols, and field strengths. Materials and Methods After ethics committee approval, this retrospective analysis of prospectively collected data was performed on 512 consecutive subjects, who underwent knee magnetic resonance imaging (MRI) in a total of 59 different institutions followed by arthroscopic knee surgery at our institution. The DCNN and 3 fellowship-trained full-time academic musculoskeletal radiologists evaluated the MRI examinations for full-thickness ACL tears independently. Surgical reports served as the reference standard. Statistics included diagnostic performance metrics, including sensitivity, specificity, area under the receiver operating curve ({"}AUC ROC{"}), and kappa statistics. P values less than 0.05 were considered to represent statistical significance. Results Anterior cruciate ligament tears were present in 45.7% (234/512) and absent in 54.3% (278/512) of the subjects. The DCNN had a sensitivity of 96.1%, which was not significantly different from the readers (97.5%-97.9%; all P ≥ 0.118), but significantly lower specificity of 93.1% (readers, 99.6%-100%; all P < 0.001) and {"}AUC ROC{"}of 0.935 (readers, 0.989-0.991; all P < 0.001) for the entire cohort. Subgroup analysis showed a significantly lower sensitivity, specificity, and {"}AUC ROC{"}of the DCNN for outside MRI (92.5%, 87.1%, and 0.898, respectively) than in-house MRI (99.0%, 94.4%, and 0.967, respectively) examinations (P = 0.026, P = 0.043, and P < 0.05, respectively). There were no significant differences in DCNN performance for 1.5-T and 3-T MRI examinations (all P ≥ 0.753, respectively). Conclusions Deep Convolutional Neural Network performance of ACL tear diagnosis can approach performance levels similar to fellowship-trained full-time academic musculoskeletal radiologists at 1.5 T and 3 T; however, the performance may decrease with increasing MRI examination heterogeneity.",

keywords = "anterior cruciate ligament injuries, artificial intelligence, knee injuries, magnetic resonance imaging, neural networks (computer)",

author = "Christoph Germann and Giuseppe Marbach and Francesco Civardi and Fucentese, {Sandro F.} and Jan Fritz and Reto Sutter and Pfirrmann, {Christian W.A.} and Benjamin Fritz",

year = "2020",

month = aug,

day = "1",

doi = "10.1097/RLI.0000000000000664",

language = "English (US)",

volume = "55",

pages = "499--506",

journal = "Investigative radiology",

issn = "0020-9996",

publisher = "Lippincott Williams and Wilkins",

number = "8",

}

TY - JOUR

T1 - Deep Convolutional Neural Network-Based Diagnosis of Anterior Cruciate Ligament Tears

T2 - Performance Comparison of Homogenous Versus Heterogeneous Knee MRI Cohorts with Different Pulse Sequence Protocols and 1.5-T and 3-T Magnetic Field Strengths

AU - Germann, Christoph

AU - Marbach, Giuseppe

AU - Civardi, Francesco

AU - Fucentese, Sandro F.

AU - Fritz, Jan

AU - Sutter, Reto

AU - Pfirrmann, Christian W.A.

AU - Fritz, Benjamin

PY - 2020/8/1

Y1 - 2020/8/1

N2 - Objectives The aim of this study was to clinically validate a Deep Convolutional Neural Network (DCNN) for the detection of surgically proven anterior cruciate ligament (ACL) tears in a large patient cohort and to analyze the effect of magnetic resonance examinations from different institutions, varying protocols, and field strengths. Materials and Methods After ethics committee approval, this retrospective analysis of prospectively collected data was performed on 512 consecutive subjects, who underwent knee magnetic resonance imaging (MRI) in a total of 59 different institutions followed by arthroscopic knee surgery at our institution. The DCNN and 3 fellowship-trained full-time academic musculoskeletal radiologists evaluated the MRI examinations for full-thickness ACL tears independently. Surgical reports served as the reference standard. Statistics included diagnostic performance metrics, including sensitivity, specificity, area under the receiver operating curve ("AUC ROC"), and kappa statistics. P values less than 0.05 were considered to represent statistical significance. Results Anterior cruciate ligament tears were present in 45.7% (234/512) and absent in 54.3% (278/512) of the subjects. The DCNN had a sensitivity of 96.1%, which was not significantly different from the readers (97.5%-97.9%; all P ≥ 0.118), but significantly lower specificity of 93.1% (readers, 99.6%-100%; all P < 0.001) and "AUC ROC"of 0.935 (readers, 0.989-0.991; all P < 0.001) for the entire cohort. Subgroup analysis showed a significantly lower sensitivity, specificity, and "AUC ROC"of the DCNN for outside MRI (92.5%, 87.1%, and 0.898, respectively) than in-house MRI (99.0%, 94.4%, and 0.967, respectively) examinations (P = 0.026, P = 0.043, and P < 0.05, respectively). There were no significant differences in DCNN performance for 1.5-T and 3-T MRI examinations (all P ≥ 0.753, respectively). Conclusions Deep Convolutional Neural Network performance of ACL tear diagnosis can approach performance levels similar to fellowship-trained full-time academic musculoskeletal radiologists at 1.5 T and 3 T; however, the performance may decrease with increasing MRI examination heterogeneity.

AB - Objectives The aim of this study was to clinically validate a Deep Convolutional Neural Network (DCNN) for the detection of surgically proven anterior cruciate ligament (ACL) tears in a large patient cohort and to analyze the effect of magnetic resonance examinations from different institutions, varying protocols, and field strengths. Materials and Methods After ethics committee approval, this retrospective analysis of prospectively collected data was performed on 512 consecutive subjects, who underwent knee magnetic resonance imaging (MRI) in a total of 59 different institutions followed by arthroscopic knee surgery at our institution. The DCNN and 3 fellowship-trained full-time academic musculoskeletal radiologists evaluated the MRI examinations for full-thickness ACL tears independently. Surgical reports served as the reference standard. Statistics included diagnostic performance metrics, including sensitivity, specificity, area under the receiver operating curve ("AUC ROC"), and kappa statistics. P values less than 0.05 were considered to represent statistical significance. Results Anterior cruciate ligament tears were present in 45.7% (234/512) and absent in 54.3% (278/512) of the subjects. The DCNN had a sensitivity of 96.1%, which was not significantly different from the readers (97.5%-97.9%; all P ≥ 0.118), but significantly lower specificity of 93.1% (readers, 99.6%-100%; all P < 0.001) and "AUC ROC"of 0.935 (readers, 0.989-0.991; all P < 0.001) for the entire cohort. Subgroup analysis showed a significantly lower sensitivity, specificity, and "AUC ROC"of the DCNN for outside MRI (92.5%, 87.1%, and 0.898, respectively) than in-house MRI (99.0%, 94.4%, and 0.967, respectively) examinations (P = 0.026, P = 0.043, and P < 0.05, respectively). There were no significant differences in DCNN performance for 1.5-T and 3-T MRI examinations (all P ≥ 0.753, respectively). Conclusions Deep Convolutional Neural Network performance of ACL tear diagnosis can approach performance levels similar to fellowship-trained full-time academic musculoskeletal radiologists at 1.5 T and 3 T; however, the performance may decrease with increasing MRI examination heterogeneity.

KW - anterior cruciate ligament injuries

KW - artificial intelligence

KW - knee injuries

KW - magnetic resonance imaging

KW - neural networks (computer)

UR - http://www.scopus.com/inward/record.url?scp=85088207831&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85088207831&partnerID=8YFLogxK

U2 - 10.1097/RLI.0000000000000664

DO - 10.1097/RLI.0000000000000664

M3 - Article

C2 - 32168039

AN - SCOPUS:85088207831

SN - 0020-9996

VL - 55

SP - 499

EP - 506

JO - Investigative radiology

JF - Investigative radiology

IS - 8

ER -

Deep Convolutional Neural Network-Based Diagnosis of Anterior Cruciate Ligament Tears: Performance Comparison of Homogenous Versus Heterogeneous Knee MRI Cohorts with Different Pulse Sequence Protocols and 1.5-T and 3-T Magnetic Field Strengths

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this