Full text

Turn on search term navigation

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

The segmentation of surface defects in lithium batteries is crucial for enhancing the overall quality of the production process. However, the severe foreground–background imbalance in surface images of lithium batteries, along with the irregular shapes and random distribution of foreground regions, poses significant challenges for defect segmentation. Based on these observations, this paper focuses on the separation of foreground and background in surface defect images of lithium batteries and proposes a novel Mask Space Optimization Transformer (MSOFormer) for semantic segmentation of these images. Specifically, the Mask Boundary Loss (MBL) module in our model provides more efficient supervision during training to enhance the accuracy of the mask computation within the mask attention mechanism, thereby improving the model’s performance in separating foreground and background. Additionally, the Dynamic Spatial Query (DSQ) module allocates spatial information of the image to each query, enhancing the model’s sensitivity to the positions of small foreground targets in various scenes. The Efficient Pixel Decoder (EPD) ensures deformable receptive fields for irregularly shaped foregrounds while further improving the model’s performance and efficiency. Experimental results demonstrate that our method outperforms other state-of-the-art methods in terms of mean Intersection over Union (mIoU). Specifically, our approach achieves an mIoU of 84.18% on the lithium battery surface defect test set and 85.53% and 87.05% mIoUs on two publicly available defect test sets with similar defect characteristics to lithium batteries.

Details

Title
Mask-Space Optimized Transformer for Semantic Segmentation of Lithium Battery Surface Defect Images
Author
Sun, Daozong 1 ; Chen, Jiasi 2   VIAFID ORCID Logo  ; Wu, Peiwen 2 ; Pan, Yucheng 2 ; Zhong, Hongsheng 2   VIAFID ORCID Logo  ; Deng, Zihao 2 ; Xue, Xiuyun 1 

 College of Electronic Engineering (College of AI), South China Agricultural University, Guangzhou 510642, China; [email protected] (D.S.); [email protected] (J.C.); [email protected] (P.W.); [email protected] (Y.P.); [email protected] (H.Z.); [email protected] (Z.D.); National Citrus Industry Technical System Machinery Research Office, Guangzhou 510642, China; Guangzhou Agricultural Information Acquisition and Application Key Laboratory, Guangzhou 510642, China; Guangdong Provincial Agricultural Information Monitoring Engineering Technology Research Center, Guangzhou 510642, China 
 College of Electronic Engineering (College of AI), South China Agricultural University, Guangzhou 510642, China; [email protected] (D.S.); [email protected] (J.C.); [email protected] (P.W.); [email protected] (Y.P.); [email protected] (H.Z.); [email protected] (Z.D.) 
First page
3627
Publication year
2024
Publication date
2024
Publisher
MDPI AG
e-ISSN
22277390
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3133312661
Copyright
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.