Transition State Clustering: Unsupervised Surgical Trajectory Segmentation for Robot Learning

Sanjay Krishnan; Animesh Garg; Sachin Patil; Colin Lea; Gregory Hager; Pieter Abbeel; Ken Goldberg

doi:10.1007/978-3-319-60916-4_6

Transition State Clustering: Unsupervised Surgical Trajectory Segmentation for Robot Learning

Sanjay Krishnan, Animesh Garg, Sachin Patil, Colin Lea, Gregory Hager, Pieter Abbeel, Ken Goldberg

Whiting School of Engineering

Research output: Chapter in Book/Report/Conference proceeding › Chapter

Abstract

A large and growing corpus of synchronized kinematic and video recordings of robot-assisted surgery has the potential to facilitate training and subtask automation. One of the challenges in segmenting such multi-modal trajectories is that demonstrations vary spatially, temporally, and contain random noise and loops (repetition until achieving the desired result). Segments of task trajectories are often less complex, less variable, and allow for easier detection of outliers. As manual segmentation can be tedious and error-prone, we propose a new segmentation method that combines hybrid dynamical systems theory and Bayesian non-parametric statistics to automatically segment demonstrations. Transition State Clustering (TSC) models demonstrations as noisy realizations of a switched linear dynamical system, and learns spatially and temporally consistent transition events across demonstrations. TSC uses a hierarchical Dirichlet Process Gaussian Mixture Model to avoid having to select the number of segments a priori. After a series of merging and pruning steps, the algorithm adaptively optimizes the number of segments. In a synthetic case study with two linear dynamical regimes, where demonstrations are corrupted with noise and temporal variations, TSC finds up to a 20% more accurate segmentation than GMM-based alternatives. On 67 recordings of surgical needle passing and suturing tasks from the JIGSAWS surgical training dataset [7], supplemented with manually annotated visual features, TSC finds 83% of needle passing segments and 73% of the suturing segments found by human experts. Qualitatively, TSC also identifies transitions overlooked by human annotators.

Original language	English (US)
Title of host publication	Springer Proceedings in Advanced Robotics
Publisher	Springer Science and Business Media B.V.
Pages	91-110
Number of pages	20
DOIs	https://doi.org/10.1007/978-3-319-60916-4_6
State	Published - 2018

Publication series

Name	Springer Proceedings in Advanced Robotics
Volume	3
ISSN (Print)	2511-1256
ISSN (Electronic)	2511-1264

ASJC Scopus subject areas

Control and Systems Engineering
Electrical and Electronic Engineering
Mechanical Engineering
Engineering (miscellaneous)
Artificial Intelligence
Computer Science Applications
Applied Mathematics

Access to Document

10.1007/978-3-319-60916-4_6

Cite this

Krishnan, S., Garg, A., Patil, S., Lea, C., Hager, G., Abbeel, P., & Goldberg, K. (2018). Transition State Clustering: Unsupervised Surgical Trajectory Segmentation for Robot Learning. In Springer Proceedings in Advanced Robotics (pp. 91-110). (Springer Proceedings in Advanced Robotics; Vol. 3). Springer Science and Business Media B.V.. https://doi.org/10.1007/978-3-319-60916-4_6

Transition State Clustering: Unsupervised Surgical Trajectory Segmentation for Robot Learning. / Krishnan, Sanjay; Garg, Animesh; Patil, Sachin et al.
Springer Proceedings in Advanced Robotics. Springer Science and Business Media B.V., 2018. p. 91-110 (Springer Proceedings in Advanced Robotics; Vol. 3).

Research output: Chapter in Book/Report/Conference proceeding › Chapter

@inbook{dbecd8ebf3cb425dbc3671d356ef8c0b,

title = "Transition State Clustering: Unsupervised Surgical Trajectory Segmentation for Robot Learning",

abstract = "A large and growing corpus of synchronized kinematic and video recordings of robot-assisted surgery has the potential to facilitate training and subtask automation. One of the challenges in segmenting such multi-modal trajectories is that demonstrations vary spatially, temporally, and contain random noise and loops (repetition until achieving the desired result). Segments of task trajectories are often less complex, less variable, and allow for easier detection of outliers. As manual segmentation can be tedious and error-prone, we propose a new segmentation method that combines hybrid dynamical systems theory and Bayesian non-parametric statistics to automatically segment demonstrations. Transition State Clustering (TSC) models demonstrations as noisy realizations of a switched linear dynamical system, and learns spatially and temporally consistent transition events across demonstrations. TSC uses a hierarchical Dirichlet Process Gaussian Mixture Model to avoid having to select the number of segments a priori. After a series of merging and pruning steps, the algorithm adaptively optimizes the number of segments. In a synthetic case study with two linear dynamical regimes, where demonstrations are corrupted with noise and temporal variations, TSC finds up to a 20% more accurate segmentation than GMM-based alternatives. On 67 recordings of surgical needle passing and suturing tasks from the JIGSAWS surgical training dataset [7], supplemented with manually annotated visual features, TSC finds 83% of needle passing segments and 73% of the suturing segments found by human experts. Qualitatively, TSC also identifies transitions overlooked by human annotators.",

author = "Sanjay Krishnan and Animesh Garg and Sachin Patil and Colin Lea and Gregory Hager and Pieter Abbeel and Ken Goldberg",

note = "Funding Information: Acknowledgements This research was supported in part by a seed grant from the UC Berkeley Center for Information Technology in the Interest of Society (CITRIS), by the U.S. National Science Foundation under Award IIS-1227536: Multilateral Manipulation by Human-Robot Collaborative Systems. This work has been supported in part by funding from Google and Cisco. We also thank Florian Pokorny, Jeff Mahler, and Michael Laskey. Publisher Copyright: {\textcopyright} 2018, Springer International Publishing AG.",

year = "2018",

doi = "10.1007/978-3-319-60916-4_6",

language = "English (US)",

series = "Springer Proceedings in Advanced Robotics",

publisher = "Springer Science and Business Media B.V.",

pages = "91--110",

booktitle = "Springer Proceedings in Advanced Robotics",

}

TY - CHAP

T1 - Transition State Clustering

T2 - Unsupervised Surgical Trajectory Segmentation for Robot Learning

AU - Krishnan, Sanjay

AU - Garg, Animesh

AU - Patil, Sachin

AU - Lea, Colin

AU - Hager, Gregory

AU - Abbeel, Pieter

AU - Goldberg, Ken

N1 - Funding Information: Acknowledgements This research was supported in part by a seed grant from the UC Berkeley Center for Information Technology in the Interest of Society (CITRIS), by the U.S. National Science Foundation under Award IIS-1227536: Multilateral Manipulation by Human-Robot Collaborative Systems. This work has been supported in part by funding from Google and Cisco. We also thank Florian Pokorny, Jeff Mahler, and Michael Laskey. Publisher Copyright: © 2018, Springer International Publishing AG.

PY - 2018

Y1 - 2018

N2 - A large and growing corpus of synchronized kinematic and video recordings of robot-assisted surgery has the potential to facilitate training and subtask automation. One of the challenges in segmenting such multi-modal trajectories is that demonstrations vary spatially, temporally, and contain random noise and loops (repetition until achieving the desired result). Segments of task trajectories are often less complex, less variable, and allow for easier detection of outliers. As manual segmentation can be tedious and error-prone, we propose a new segmentation method that combines hybrid dynamical systems theory and Bayesian non-parametric statistics to automatically segment demonstrations. Transition State Clustering (TSC) models demonstrations as noisy realizations of a switched linear dynamical system, and learns spatially and temporally consistent transition events across demonstrations. TSC uses a hierarchical Dirichlet Process Gaussian Mixture Model to avoid having to select the number of segments a priori. After a series of merging and pruning steps, the algorithm adaptively optimizes the number of segments. In a synthetic case study with two linear dynamical regimes, where demonstrations are corrupted with noise and temporal variations, TSC finds up to a 20% more accurate segmentation than GMM-based alternatives. On 67 recordings of surgical needle passing and suturing tasks from the JIGSAWS surgical training dataset [7], supplemented with manually annotated visual features, TSC finds 83% of needle passing segments and 73% of the suturing segments found by human experts. Qualitatively, TSC also identifies transitions overlooked by human annotators.

AB - A large and growing corpus of synchronized kinematic and video recordings of robot-assisted surgery has the potential to facilitate training and subtask automation. One of the challenges in segmenting such multi-modal trajectories is that demonstrations vary spatially, temporally, and contain random noise and loops (repetition until achieving the desired result). Segments of task trajectories are often less complex, less variable, and allow for easier detection of outliers. As manual segmentation can be tedious and error-prone, we propose a new segmentation method that combines hybrid dynamical systems theory and Bayesian non-parametric statistics to automatically segment demonstrations. Transition State Clustering (TSC) models demonstrations as noisy realizations of a switched linear dynamical system, and learns spatially and temporally consistent transition events across demonstrations. TSC uses a hierarchical Dirichlet Process Gaussian Mixture Model to avoid having to select the number of segments a priori. After a series of merging and pruning steps, the algorithm adaptively optimizes the number of segments. In a synthetic case study with two linear dynamical regimes, where demonstrations are corrupted with noise and temporal variations, TSC finds up to a 20% more accurate segmentation than GMM-based alternatives. On 67 recordings of surgical needle passing and suturing tasks from the JIGSAWS surgical training dataset [7], supplemented with manually annotated visual features, TSC finds 83% of needle passing segments and 73% of the suturing segments found by human experts. Qualitatively, TSC also identifies transitions overlooked by human annotators.

UR - http://www.scopus.com/inward/record.url?scp=85107030870&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85107030870&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-60916-4_6

DO - 10.1007/978-3-319-60916-4_6

M3 - Chapter

AN - SCOPUS:85107030870

T3 - Springer Proceedings in Advanced Robotics

SP - 91

EP - 110

BT - Springer Proceedings in Advanced Robotics

PB - Springer Science and Business Media B.V.

ER -

Transition State Clustering: Unsupervised Surgical Trajectory Segmentation for Robot Learning

Abstract

Publication series

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this