Abstract

Crop disease detection and crop baking stage judgement require large image data to improve accuracy. However, the existing crop disease image datasets have high asymmetry, and the poor baking environment leads to image acquisition difficulties and colour distortion. Therefore, we explore the potential of the self-attention mechanism on crop image datasets and propose an innovative crop image data-enhancement method for recurrent generative adversarial networks (GANs) fused with the self-attention mechanism to significantly enhance the perception and information capture capabilities of recurrent GANs. By introducing the self-attention mechanism module, the cycle-consistent GAN (CycleGAN) is more adept at capturing the internal correlations and dependencies of image data, thus more effectively capturing the critical information among image data. Furthermore, we propose a new enhanced loss function for crop image data to optimise the model performance and meet specific task requirements. We further investigate crop image data enhancement in different contexts to validate the performance and stability of the model. The experimental results show that, the peak signal-to-noise ratio of the SM-CycleGAN for tobacco images and tea leaf disease images are improved by 2.13% and 3.55%, and the structural similarity index measure is improved by 1.16% and 2.48% compared to CycleGAN, respectively.

Details

Title
SM-CycleGAN: crop image data enhancement method based on self-attention mechanism CycleGAN
Author
Liu, Dian 1 ; Cao, Yang 1 ; Yang, Jing 2 ; Wei, Jianyu 3 ; Zhang, Jili 3 ; Rao, Chenglin 1 ; Wu, Banghong 1 ; Zhang, Dabin 1 

 Guizhou University, School of Mechanical Engineering, Guiyang, China (GRID:grid.443382.a) (ISNI:0000 0004 1804 268X) 
 Guizhou University, School of Mechanical Engineering, Guiyang, China (GRID:grid.443382.a) (ISNI:0000 0004 1804 268X); Guizhou University, State Key Laboratory of Public Big Data, Guiyang, China (GRID:grid.443382.a) (ISNI:0000 0004 1804 268X) 
 China Tobacco Guangxi Industrial Co., Ltd, Nanning, China (GRID:grid.468111.b) (ISNI:0000 0004 5899 6074) 
Pages
9277
Publication year
2024
Publication date
2024
Publisher
Nature Publishing Group
e-ISSN
20452322
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3048575775
Copyright
© The Author(s) 2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.