TY - GEN
T1 - Revisiting Stereo Depth Estimation From a Sequence-to-Sequence Perspective with Transformers
AU - Li, Zhaoshuo
AU - Liu, Xingtong
AU - Drenkow, Nathan
AU - Ding, Andy
AU - Creighton, Francis X.
AU - Taylor, Russell H.
AU - Unberath, Mathias
N1 - Publisher Copyright:
© 2021 IEEE
PY - 2021
Y1 - 2021
N2 - Stereo depth estimation relies on optimal correspondence matching between pixels on epipolar lines in the left and right images to infer depth. In this work, we revisit the problem from a sequence-to-sequence correspondence perspective to replace cost volume construction with dense pixel matching using position information and attention. This approach, named STereo TRansformer (STTR), has several advantages: It 1) relaxes the limitation of a fixed disparity range, 2) identifies occluded regions and provides confidence estimates, and 3) imposes uniqueness constraints during the matching process. We report promising results on both synthetic and real-world datasets and demonstrate that STTR generalizes across different domains, even without fine-tuning.
AB - Stereo depth estimation relies on optimal correspondence matching between pixels on epipolar lines in the left and right images to infer depth. In this work, we revisit the problem from a sequence-to-sequence correspondence perspective to replace cost volume construction with dense pixel matching using position information and attention. This approach, named STereo TRansformer (STTR), has several advantages: It 1) relaxes the limitation of a fixed disparity range, 2) identifies occluded regions and provides confidence estimates, and 3) imposes uniqueness constraints during the matching process. We report promising results on both synthetic and real-world datasets and demonstrate that STTR generalizes across different domains, even without fine-tuning.
UR - http://www.scopus.com/inward/record.url?scp=85118850161&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85118850161&partnerID=8YFLogxK
U2 - 10.1109/ICCV48922.2021.00614
DO - 10.1109/ICCV48922.2021.00614
M3 - Conference contribution
AN - SCOPUS:85118850161
T3 - Proceedings of the IEEE International Conference on Computer Vision
SP - 6177
EP - 6186
BT - Proceedings - 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 18th IEEE/CVF International Conference on Computer Vision, ICCV 2021
Y2 - 11 October 2021 through 17 October 2021
ER -