Full text

Turn on search term navigation

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

In this paper, a mutually enhanced modeling method (MEMe) is presented for human pose estimation, which focuses on enhancing lightweight model performance, but with low complexity. To obtain higher accuracy, a traditional model scale is largely expanded with heavy deployment difficulties. However, for a more lightweight model, there is a large performance gap compared to the former; thus, an urgent need for a way to fill it. Therefore, we propose a MEMe to reconstruct a lightweight baseline model, EffBase transferred intuitively from EfficientDet, into the efficient and effective pose (EEffPose) net, which contains three mutually enhanced modules: the Enhanced EffNet (EEffNet) backbone, the total fusion neck (TFNeck), and the final attention head (FAHead). Extensive experiments on COCO and MPII benchmarks show that our MEMe-based models reach state-of-the-art performances, with limited parameters. Specifically, in the same conditions, our EEffPose-P0 with 256 × 192 can use only 8.98 M parameters to achieve 75.4 AP on the COCO val set, which outperforms HRNet-W48, but with only 14% of its parameters.

Details

Title
MEMe: A Mutually Enhanced Modeling Method for Efficient and Effective Human Pose Estimation
Author
Li, Jie 1   VIAFID ORCID Logo  ; Wang, Zhixing 2 ; Qi, Bo 1   VIAFID ORCID Logo  ; Zhang, Jianlin 1   VIAFID ORCID Logo  ; Hu, Yang 3 

 Key Laboratory of Optical Engineering, Chinese Academy of Sciences, Chengdu 610209, China; [email protected] (J.L.); [email protected] (Z.W.); [email protected] (J.Z.); [email protected] (H.Y.); Institute of Optics and Electronics, Chinese Academy of Sciences, Chengdu 610209, China; University of Chinese Academy of Sciences, Beijing 100039, China 
 Key Laboratory of Optical Engineering, Chinese Academy of Sciences, Chengdu 610209, China; [email protected] (J.L.); [email protected] (Z.W.); [email protected] (J.Z.); [email protected] (H.Y.); Institute of Optics and Electronics, Chinese Academy of Sciences, Chengdu 610209, China; University of Chinese Academy of Sciences, Beijing 100039, China; School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 610209, China 
 Key Laboratory of Optical Engineering, Chinese Academy of Sciences, Chengdu 610209, China; [email protected] (J.L.); [email protected] (Z.W.); [email protected] (J.Z.); [email protected] (H.Y.); Institute of Optics and Electronics, Chinese Academy of Sciences, Chengdu 610209, China 
First page
632
Publication year
2022
Publication date
2022
Publisher
MDPI AG
e-ISSN
14248220
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2621377033
Copyright
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.