The CoSTAR Block Stacking Dataset: Learning with Workspace Constraints

Andrew Hundt; Varun Jain; Chia Hung Lin; Chris Paxton; Gregory D. Hager

doi:10.1109/IROS40897.2019.8967784

The CoSTAR Block Stacking Dataset: Learning with Workspace Constraints

Andrew Hundt, Varun Jain, Chia Hung Lin, Chris Paxton, Gregory D. Hager

Whiting School of Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

2 Scopus citations

Abstract

A robot can now grasp an object more effectively than ever before, but once it has the object what happens next? We show that a mild relaxation of the task and workspace constraints implicit in existing object grasping datasets can cause neural network based grasping algorithms to fail on even a simple block stacking task when executed under more realistic circumstances. To address this, we introduce the JHU CoSTAR Block Stacking Dataset (BSD), where a robot interacts with 5.1 cm colored blocks to complete an order-fulfillment style block stacking task. It contains dynamic scenes and real time-series data in a less constrained environment than comparable datasets. There are nearly 12,000 stacking attempts and over 2 million frames of real data. We discuss the ways in which this dataset provides a valuable resource for a broad range of other topics of investigation. We find that hand-designed neural networks that work on prior datasets do not generalize to this task. Thus, to establish a baseline for this dataset, we demonstrate an automated search of neural network based models using a novel multiple-input HyperTree MetaModel, and find a final model which makes reasonable 3D pose predictions for grasping and stacking on our dataset. The CoSTAR BSD, code, and instructions are available at sites.google.com/site/costardataset.

Original language	English (US)
Title of host publication	2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2019
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	1797-1804
Number of pages	8
ISBN (Electronic)	9781728140049
DOIs	https://doi.org/10.1109/IROS40897.2019.8967784
State	Published - Nov 2019
Event	2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2019 - Macau, China Duration: Nov 3 2019 → Nov 8 2019

Publication series

Name	IEEE International Conference on Intelligent Robots and Systems
ISSN (Print)	2153-0858
ISSN (Electronic)	2153-0866

Conference

Conference	2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2019
Country/Territory	China
City	Macau
Period	11/3/19 → 11/8/19

ASJC Scopus subject areas

Control and Systems Engineering
Software
Computer Vision and Pattern Recognition
Computer Science Applications

Access to Document

10.1109/IROS40897.2019.8967784

Cite this

Hundt, A., Jain, V., Lin, C. H., Paxton, C., & Hager, G. D. (2019). The CoSTAR Block Stacking Dataset: Learning with Workspace Constraints. In 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2019 (pp. 1797-1804). Article 8967784 (IEEE International Conference on Intelligent Robots and Systems). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IROS40897.2019.8967784

The CoSTAR Block Stacking Dataset: Learning with Workspace Constraints. / Hundt, Andrew; Jain, Varun; Lin, Chia Hung et al.
2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2019. Institute of Electrical and Electronics Engineers Inc., 2019. p. 1797-1804 8967784 (IEEE International Conference on Intelligent Robots and Systems).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Hundt, A, Jain, V, Lin, CH, Paxton, C & Hager, GD 2019, The CoSTAR Block Stacking Dataset: Learning with Workspace Constraints. in 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2019., 8967784, IEEE International Conference on Intelligent Robots and Systems, Institute of Electrical and Electronics Engineers Inc., pp. 1797-1804, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2019, Macau, China, 11/3/19. https://doi.org/10.1109/IROS40897.2019.8967784

Hundt A, Jain V, Lin CH, Paxton C, Hager GD. The CoSTAR Block Stacking Dataset: Learning with Workspace Constraints. In 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2019. Institute of Electrical and Electronics Engineers Inc. 2019. p. 1797-1804. 8967784. (IEEE International Conference on Intelligent Robots and Systems). doi: 10.1109/IROS40897.2019.8967784

@inproceedings{6d394c2a32d24a82b2cc4cabb2afa046,

title = "The CoSTAR Block Stacking Dataset: Learning with Workspace Constraints",

abstract = "A robot can now grasp an object more effectively than ever before, but once it has the object what happens next? We show that a mild relaxation of the task and workspace constraints implicit in existing object grasping datasets can cause neural network based grasping algorithms to fail on even a simple block stacking task when executed under more realistic circumstances. To address this, we introduce the JHU CoSTAR Block Stacking Dataset (BSD), where a robot interacts with 5.1 cm colored blocks to complete an order-fulfillment style block stacking task. It contains dynamic scenes and real time-series data in a less constrained environment than comparable datasets. There are nearly 12,000 stacking attempts and over 2 million frames of real data. We discuss the ways in which this dataset provides a valuable resource for a broad range of other topics of investigation. We find that hand-designed neural networks that work on prior datasets do not generalize to this task. Thus, to establish a baseline for this dataset, we demonstrate an automated search of neural network based models using a novel multiple-input HyperTree MetaModel, and find a final model which makes reasonable 3D pose predictions for grasping and stacking on our dataset. The CoSTAR BSD, code, and instructions are available at sites.google.com/site/costardataset.",

author = "Andrew Hundt and Varun Jain and Lin, {Chia Hung} and Chris Paxton and Hager, {Gregory D.}",

note = "Funding Information: VII. ACKNOWLEDGEMENTS We thank Chunting Jiao for his assistance with data collection. This material is based upon work supported by the National Science Foundation under NSF NRI Grant Award No. 1637949. Publisher Copyright: {\textcopyright} 2019 IEEE.; 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2019 ; Conference date: 03-11-2019 Through 08-11-2019",

year = "2019",

month = nov,

doi = "10.1109/IROS40897.2019.8967784",

language = "English (US)",

series = "IEEE International Conference on Intelligent Robots and Systems",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "1797--1804",

booktitle = "2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2019",

}

TY - GEN

T1 - The CoSTAR Block Stacking Dataset

T2 - 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2019

AU - Hundt, Andrew

AU - Jain, Varun

AU - Lin, Chia Hung

AU - Paxton, Chris

AU - Hager, Gregory D.

N1 - Funding Information: VII. ACKNOWLEDGEMENTS We thank Chunting Jiao for his assistance with data collection. This material is based upon work supported by the National Science Foundation under NSF NRI Grant Award No. 1637949. Publisher Copyright: © 2019 IEEE.

PY - 2019/11

Y1 - 2019/11

N2 - A robot can now grasp an object more effectively than ever before, but once it has the object what happens next? We show that a mild relaxation of the task and workspace constraints implicit in existing object grasping datasets can cause neural network based grasping algorithms to fail on even a simple block stacking task when executed under more realistic circumstances. To address this, we introduce the JHU CoSTAR Block Stacking Dataset (BSD), where a robot interacts with 5.1 cm colored blocks to complete an order-fulfillment style block stacking task. It contains dynamic scenes and real time-series data in a less constrained environment than comparable datasets. There are nearly 12,000 stacking attempts and over 2 million frames of real data. We discuss the ways in which this dataset provides a valuable resource for a broad range of other topics of investigation. We find that hand-designed neural networks that work on prior datasets do not generalize to this task. Thus, to establish a baseline for this dataset, we demonstrate an automated search of neural network based models using a novel multiple-input HyperTree MetaModel, and find a final model which makes reasonable 3D pose predictions for grasping and stacking on our dataset. The CoSTAR BSD, code, and instructions are available at sites.google.com/site/costardataset.

AB - A robot can now grasp an object more effectively than ever before, but once it has the object what happens next? We show that a mild relaxation of the task and workspace constraints implicit in existing object grasping datasets can cause neural network based grasping algorithms to fail on even a simple block stacking task when executed under more realistic circumstances. To address this, we introduce the JHU CoSTAR Block Stacking Dataset (BSD), where a robot interacts with 5.1 cm colored blocks to complete an order-fulfillment style block stacking task. It contains dynamic scenes and real time-series data in a less constrained environment than comparable datasets. There are nearly 12,000 stacking attempts and over 2 million frames of real data. We discuss the ways in which this dataset provides a valuable resource for a broad range of other topics of investigation. We find that hand-designed neural networks that work on prior datasets do not generalize to this task. Thus, to establish a baseline for this dataset, we demonstrate an automated search of neural network based models using a novel multiple-input HyperTree MetaModel, and find a final model which makes reasonable 3D pose predictions for grasping and stacking on our dataset. The CoSTAR BSD, code, and instructions are available at sites.google.com/site/costardataset.

UR - http://www.scopus.com/inward/record.url?scp=85081160051&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85081160051&partnerID=8YFLogxK

U2 - 10.1109/IROS40897.2019.8967784

DO - 10.1109/IROS40897.2019.8967784

M3 - Conference contribution

AN - SCOPUS:85081160051

T3 - IEEE International Conference on Intelligent Robots and Systems

SP - 1797

EP - 1804

BT - 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2019

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 3 November 2019 through 8 November 2019

ER -

The CoSTAR Block Stacking Dataset: Learning with Workspace Constraints

Abstract

Publication series

Conference

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this