Full text

Turn on search term navigation

Copyright © 2025 Yiyang Dong et al. Journal of Engineering published by John Wiley & Sons Ltd. This is an open access article under the terms of the Creative Commons Attribution License (the “License”), which permits use, distribution and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0/

Abstract

Pose estimation of objects is one of the main tasks for robotics to understand their environments and for monitoring tasks of grasping and manipulation of objects. In this paper, we present an experimental study of CAD-based object pose estimation to detect object locations and estimate their orientations using the prior defined models. Specifically, our study pipeline is developed for RGB-D sensors and consists of three steps. First, we incorporate an objection detection method using RGB images, which can result in the definition of the bounding boxes, instance masks, and class labels of detected objects with missing pose information. Then, we leverage the depth values of the masked pixels and known camera intrinsics to generate point clouds of objects. Finally, we align CAD models, defined in canonical poses, to the scan objects, achieving pose estimation and complete representation for the objects. Given that there may exist many challenges for such alignment task such as scale differences, partial overlap, noise, and outliers, we introduce two alignment approaches, namely, scale iterative closest point (SICP) and coherent point drift (CPD), and present a comprehensive experimental study of their accuracy, robustness, and computational efficiency. In particular, we observe that the methods are sensitive to the initial relative poses of objects. To address this problem, we introduce a multipose initialization scheme to improve their robustness. Our experimental results show that both methods can achieve accurate alignment; however, scale ICP (SICP) is time-efficient, while CPD is more robust to noise and occlusions. Our study demonstrates the feasibility of using RGB-D sensors, an object detection module, and point cloud alignment methods for accurate object detection and pose estimation.

Details

Title
Experimental Study of CAD-Based Scaled Alignment and Object Pose Estimation for RGB-D Sensor
Author
Dong, Yiyang 1   VIAFID ORCID Logo  ; Liang, Minghui 1   VIAFID ORCID Logo  ; Payandeh, Shahram 1   VIAFID ORCID Logo 

 Networked Robotics and Sensing Laboratory School of Engineering Science Simon Fraser University Burnaby, V5A 1S6 Canada 
Editor
Weili Zeng
Publication year
2025
Publication date
2025
Publisher
John Wiley & Sons, Inc.
ISSN
23144912
e-ISSN
23144904
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3169861595
Copyright
Copyright © 2025 Yiyang Dong et al. Journal of Engineering published by John Wiley & Sons Ltd. This is an open access article under the terms of the Creative Commons Attribution License (the “License”), which permits use, distribution and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0/