Full Text

Turn on search term navigation

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

This paper addresses the problem of instance-level 6DoF pose estimation from a single RGBD image in an indoor scene. Many recent works have shown that a two-stage network, which first detects the keypoints and then regresses the keypoints for 6d pose estimation, achieves remarkable performance. However, the previous methods concern little about channel-wise attention and the keypoints are not selected by comprehensive use of RGBD information, which limits the performance of the network. To enhance RGB feature representation ability, a modular Split-Attention block that enables attention across feature-map groups is proposed. In addition, by combining the Oriented FAST and Rotated BRIEF (ORB) keypoints and the Farthest Point Sample (FPS) algorithm, a simple but effective keypoint selection method named ORB-FPS is presented to avoid the keypoints appear on the non-salient regions. The proposed algorithm is tested on the Linemod and the YCB-Video dataset, the experimental results demonstrate that our method outperforms the current approaches, achieves ADD(S) accuracy of 94.5% on the Linemod dataset and 91.4% on the YCB-Video dataset.

Details

Title
A 3D Keypoints Voting Network for 6DoF Pose Estimation in Indoor Scene
Author
Liu, Huikai 1 ; Liu, Gaorui 2 ; Zhang, Yue 1 ; Linjian Lei 3   VIAFID ORCID Logo  ; Xie, Hui 1   VIAFID ORCID Logo  ; Li, Yan 1 ; Sun, Shengli 2 

 Shanghai Institute of Technical Physics, Chinese Academy of Sciences, Shanghai 200083, China; [email protected] (H.L.); [email protected] (G.L.); [email protected] (Y.Z.); [email protected] (L.L.); [email protected] (H.X.); [email protected] (Y.L.); School of Electronic, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing 100049, China; Key Laboratory of Intelligent Infrared Perception, Chinese Academy of Sciences, Shanghai 200083, China 
 Shanghai Institute of Technical Physics, Chinese Academy of Sciences, Shanghai 200083, China; [email protected] (H.L.); [email protected] (G.L.); [email protected] (Y.Z.); [email protected] (L.L.); [email protected] (H.X.); [email protected] (Y.L.); Key Laboratory of Intelligent Infrared Perception, Chinese Academy of Sciences, Shanghai 200083, China 
 Shanghai Institute of Technical Physics, Chinese Academy of Sciences, Shanghai 200083, China; [email protected] (H.L.); [email protected] (G.L.); [email protected] (Y.Z.); [email protected] (L.L.); [email protected] (H.X.); [email protected] (Y.L.); Key Laboratory of Intelligent Infrared Perception, Chinese Academy of Sciences, Shanghai 200083, China; School of Information Science and Technology, ShanghaiTech University, Shanghai 201210, China 
First page
230
Publication year
2021
Publication date
2021
Publisher
MDPI AG
e-ISSN
20751702
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2584401341
Copyright
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.