Dense Depth Estimation in Monocular Endoscopy with Self-Supervised Learning Methods

Xingtong Liu; Ayushi Sinha; Masaru Ishii; Gregory D. Hager; Austin Reiter; Russell H. Taylor; Mathias Unberath

doi:10.1109/TMI.2019.2950936

Dense Depth Estimation in Monocular Endoscopy with Self-Supervised Learning Methods

Xingtong Liu, Ayushi Sinha, Masaru Ishii, Gregory D. Hager, Austin Reiter, Russell H. Taylor, Mathias Unberath

Research output: Contribution to journal › Article › peer-review

13 Scopus citations

Abstract

We present a self-supervised approach to training convolutional neural networks for dense depth estimation from monocular endoscopy data without a priori modeling of anatomy or shading. Our method only requires monocular endoscopic videos and a multi-view stereo method, e.g., structure from motion, to supervise learning in a sparse manner. Consequently, our method requires neither manual labeling nor patient computed tomography (CT) scan in the training and application phases. In a cross-patient experiment using CT scans as groundtruth, the proposed method achieved submillimeter mean residual error. In a comparison study to recent self-supervised depth estimation methods designed for natural video on in vivo sinus endoscopy data, we demonstrate that the proposed approach outperforms the previous methods by a large margin. The source code for this work is publicly available online at https://github.com/lppllppl920/EndoscopyDepthEstimation-Pytorch.

Original language	English (US)
Article number	8889760
Pages (from-to)	1438-1447
Number of pages	10
Journal	IEEE transactions on medical imaging
Volume	39
Issue number	5
DOIs	https://doi.org/10.1109/TMI.2019.2950936
State	Published - May 2020

Keywords

Endoscopy
depth estimation
self-supervised learning
unsupervised learning

ASJC Scopus subject areas

Software
Radiological and Ultrasound Technology
Computer Science Applications
Electrical and Electronic Engineering

Access to Document

10.1109/TMI.2019.2950936

Cite this

@article{a6ca4ea3b9b54c27b31b07477bdcb937,

title = "Dense Depth Estimation in Monocular Endoscopy with Self-Supervised Learning Methods",

abstract = "We present a self-supervised approach to training convolutional neural networks for dense depth estimation from monocular endoscopy data without a priori modeling of anatomy or shading. Our method only requires monocular endoscopic videos and a multi-view stereo method, e.g., structure from motion, to supervise learning in a sparse manner. Consequently, our method requires neither manual labeling nor patient computed tomography (CT) scan in the training and application phases. In a cross-patient experiment using CT scans as groundtruth, the proposed method achieved submillimeter mean residual error. In a comparison study to recent self-supervised depth estimation methods designed for natural video on in vivo sinus endoscopy data, we demonstrate that the proposed approach outperforms the previous methods by a large margin. The source code for this work is publicly available online at https://github.com/lppllppl920/EndoscopyDepthEstimation-Pytorch.",

keywords = "Endoscopy, depth estimation, self-supervised learning, unsupervised learning",

author = "Xingtong Liu and Ayushi Sinha and Masaru Ishii and Hager, {Gregory D.} and Austin Reiter and Taylor, {Russell H.} and Mathias Unberath",

note = "Publisher Copyright: {\textcopyright} 1982-2012 IEEE.",

year = "2020",

month = may,

doi = "10.1109/TMI.2019.2950936",

language = "English (US)",

volume = "39",

pages = "1438--1447",

journal = "IEEE transactions on medical imaging",

issn = "0278-0062",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "5",

}

TY - JOUR

T1 - Dense Depth Estimation in Monocular Endoscopy with Self-Supervised Learning Methods

AU - Liu, Xingtong

AU - Sinha, Ayushi

AU - Ishii, Masaru

AU - Hager, Gregory D.

AU - Reiter, Austin

AU - Taylor, Russell H.

AU - Unberath, Mathias

PY - 2020/5

Y1 - 2020/5

N2 - We present a self-supervised approach to training convolutional neural networks for dense depth estimation from monocular endoscopy data without a priori modeling of anatomy or shading. Our method only requires monocular endoscopic videos and a multi-view stereo method, e.g., structure from motion, to supervise learning in a sparse manner. Consequently, our method requires neither manual labeling nor patient computed tomography (CT) scan in the training and application phases. In a cross-patient experiment using CT scans as groundtruth, the proposed method achieved submillimeter mean residual error. In a comparison study to recent self-supervised depth estimation methods designed for natural video on in vivo sinus endoscopy data, we demonstrate that the proposed approach outperforms the previous methods by a large margin. The source code for this work is publicly available online at https://github.com/lppllppl920/EndoscopyDepthEstimation-Pytorch.

AB - We present a self-supervised approach to training convolutional neural networks for dense depth estimation from monocular endoscopy data without a priori modeling of anatomy or shading. Our method only requires monocular endoscopic videos and a multi-view stereo method, e.g., structure from motion, to supervise learning in a sparse manner. Consequently, our method requires neither manual labeling nor patient computed tomography (CT) scan in the training and application phases. In a cross-patient experiment using CT scans as groundtruth, the proposed method achieved submillimeter mean residual error. In a comparison study to recent self-supervised depth estimation methods designed for natural video on in vivo sinus endoscopy data, we demonstrate that the proposed approach outperforms the previous methods by a large margin. The source code for this work is publicly available online at https://github.com/lppllppl920/EndoscopyDepthEstimation-Pytorch.

KW - Endoscopy

KW - depth estimation

KW - self-supervised learning

KW - unsupervised learning

UR - http://www.scopus.com/inward/record.url?scp=85084461418&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85084461418&partnerID=8YFLogxK

U2 - 10.1109/TMI.2019.2950936

DO - 10.1109/TMI.2019.2950936

M3 - Article

C2 - 31689184

AN - SCOPUS:85084461418

SN - 0278-0062

VL - 39

SP - 1438

EP - 1447

JO - IEEE transactions on medical imaging

JF - IEEE transactions on medical imaging

IS - 5

M1 - 8889760

ER -

Dense Depth Estimation in Monocular Endoscopy with Self-Supervised Learning Methods

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this