Publication Details

Towards Deep Learning-based 6D Bin Pose Estimation in 3D Scans

GAJDOŠECH Lukáš, KOCUR Viktor, STUCHLÍK Martin, HUDEC Lukáš and MADARAS Martin. Towards Deep Learning-based 6D Bin Pose Estimation in 3D Scans. In: Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4 VISAPP: VISAPP. Setubal: SciTePress - Science and Technology Publications, 2022, pp. 545-552. ISBN 978-989-758-555-5. Available from: https://www.scitepress.org/Link.aspx?doi=10.5220/0010878200003124
Czech title
Směrem k bázi hlubokého učení 6D BIN založeného na 3D skenování
Type
conference paper
Language
english
Authors
Gajdošech Lukáš, Mgr. (FMFI UK)
Kocur Viktor, Ing., Ph.D. (DCGM FIT BUT)
Stuchlík Martin, Mgr. (Skeletex Research, s.r.o.)
Hudec Lukáš, Ing., Ph.D. (FIIT STU)
Madaras Martin, Ing., Ph.D. (FMFI UK)
URL
Keywords

Computer Vision, Bin Pose Estimation, 6D Pose Estimation, Deep Learning, Point Clouds

Abstract

An automated robotic system needs to be as robust as possible and fail-safe in general while having relatively high precision and repeatability. Although deep learning-based methods are becoming research standard on how to approach 3D scan and image processing tasks, the industry standard for processing this data is still analytically-based. Our paper claims that analytical methods are less robust and harder for testing, updating, and maintaining. This paper focuses on a specific task of 6D pose estimation of a bin in 3D scans. Therefore, we present a high-quality dataset composed of synthetic data and real scans captured by a structured-light scanner with precise annotations. Additionally, we propose two different methods for 6D bin pose estimation, an analytical method as the industrial standard and a baseline data-driven method. Both approaches are cross-evaluated, and our experiments show that augmenting the training on real scans with synthetic data improves our proposed data-driven neural model. This position paper is preliminary, as proposed methods are trained and evaluated on a relatively small initial dataset which we plan to extend in the future.

Published
2022
Pages
545-552
Proceedings
Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4 VISAPP: VISAPP
Conference
17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP), Online, AT
ISBN
978-989-758-555-5
Publisher
SciTePress - Science and Technology Publications
Place
Setubal, PT
DOI
UT WoS
000777569400058
BibTeX
@INPROCEEDINGS{FITPUB12867,
   author = "Luk\'{a}\v{s} Gajdo\v{s}ech and Viktor Kocur and Martin Stuchl\'{i}k and Luk\'{a}\v{s} Hudec and Martin Madaras",
   title = "Towards Deep Learning-based 6D Bin Pose Estimation in 3D Scans",
   pages = "545--552",
   booktitle = "Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4 VISAPP: VISAPP",
   year = 2022,
   location = "Setubal, PT",
   publisher = "SciTePress - Science and Technology Publications",
   ISBN = "978-989-758-555-5",
   doi = "10.5220/0010878200003124",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/12867"
}
Back to top