TY - GEN
T1 - Automatic Labeling of Vertebrae in Long-Length Intraoperative Imaging with a Multi-View, Region-Based CNN
AU - Huang, Y.
AU - Jones, Craig
AU - Zhang, X.
AU - Johnston, A.
AU - Aygun, Nafi
AU - Witham, Timothy F
AU - Helm, P. A.
AU - Siewerdsen, J. H.
AU - Uneri, Ali
N1 - Funding Information:
This research was supported by research collaboration with Medtronic. The system and algorithms shown in this work were for research purposes only and are not available for sale or commercial use.
Publisher Copyright:
© 2022 SPIE.
PY - 2022
Y1 - 2022
N2 - Purpose: A recent imaging method (viz., Long-Film) for capturing long-length images of the spine was enabled on the Oarm™ system. Proposed work uses a custom, multi-perspective, region-based convolutional neural network (R-CNN) for labeling vertebrae in Long-Film images and evaluates approaches for incorporating long contextual information to take advantage of the extended field-of-view and improve the labeling accuracy. Methods: Evaluated methods for incorporating contextual information include: (1) a recurrent network module with long short-term memory (LSTM) added after R-CNN classification; and (2) a post-processing, sequence-sorting step based on the label confidence scores. The models were trained and validated on 11,805 Long-Film images simulated from projections of 370 CT images and tested on 50 Long-Film images of 14 cadaveric specimens. Results: The multi-perspective R-CNN with LSTM module achieved 91.7% vertebrae level identification rate, compared to 72.4% when used without LSTM, thus demonstrating the improvement of incorporating contextual information. While sequence sorting achieved 89.4% in labeling accuracy, it failed to handle errors during detection and did not provide additional improvements when applied following the LSTM module. Conclusions: The proposed LSTM module significantly improved the labeling accuracy upon the base model through effective contextual information incorporation and training in an end-to-end fashion. Compared to sequence sorting, it showed more flexibility towards false positives and false negatives in vertebrae detection. The proposed model offers the potential to provide a valuable check for target localization and forms the basis for automatic measurement of spinal curvature changes in interventional settings.
AB - Purpose: A recent imaging method (viz., Long-Film) for capturing long-length images of the spine was enabled on the Oarm™ system. Proposed work uses a custom, multi-perspective, region-based convolutional neural network (R-CNN) for labeling vertebrae in Long-Film images and evaluates approaches for incorporating long contextual information to take advantage of the extended field-of-view and improve the labeling accuracy. Methods: Evaluated methods for incorporating contextual information include: (1) a recurrent network module with long short-term memory (LSTM) added after R-CNN classification; and (2) a post-processing, sequence-sorting step based on the label confidence scores. The models were trained and validated on 11,805 Long-Film images simulated from projections of 370 CT images and tested on 50 Long-Film images of 14 cadaveric specimens. Results: The multi-perspective R-CNN with LSTM module achieved 91.7% vertebrae level identification rate, compared to 72.4% when used without LSTM, thus demonstrating the improvement of incorporating contextual information. While sequence sorting achieved 89.4% in labeling accuracy, it failed to handle errors during detection and did not provide additional improvements when applied following the LSTM module. Conclusions: The proposed LSTM module significantly improved the labeling accuracy upon the base model through effective contextual information incorporation and training in an end-to-end fashion. Compared to sequence sorting, it showed more flexibility towards false positives and false negatives in vertebrae detection. The proposed model offers the potential to provide a valuable check for target localization and forms the basis for automatic measurement of spinal curvature changes in interventional settings.
UR - http://www.scopus.com/inward/record.url?scp=85131959986&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85131959986&partnerID=8YFLogxK
U2 - 10.1117/12.2611912
DO - 10.1117/12.2611912
M3 - Conference contribution
AN - SCOPUS:85131959986
T3 - Progress in Biomedical Optics and Imaging - Proceedings of SPIE
BT - Medical Imaging 2022
A2 - Linte, Cristian A.
A2 - Siewerdsen, Jeffrey H.
PB - SPIE
T2 - Medical Imaging 2022: Image-Guided Procedures, Robotic Interventions, and Modeling
Y2 - 21 March 2022 through 27 March 2022
ER -