Content area

Abstract

In recent years, with the rapid advancement of deep learning technologies such as generative adversarial networks (GANs), deepfake technology has become increasingly sophisticated. As a result, the generated fake images are becoming more difficult to visually distinguish from real ones. Existing deepfake detection methods primarily rely on training models with specific datasets. However, these models often suffer from limited generalization when processing images of unknown origin or across domains, leading to a significant decrease in detection accuracy. To address this issue, this paper proposes a deepfake image-detection network based on feature aggregation and enhancement. The key innovation of the proposed method lies in the integration of two modules: the Feature Aggregation Module (FAM) and the Attention Enhancement Module (AEM). The FAM effectively aggregates both deep semantic information and shallow detail features through a multi-scale feature-fusion mechanism, overcoming the limitations of traditional methods that rely on a single-level feature. Meanwhile, the AEM enhances the network’s ability to capture subtle forgery traces by incorporating attention mechanisms and filtering techniques, significantly boosting the model’s efficiency in processing complex information. The experimental results demonstrate that the proposed method achieves significant improvements across all evaluation metrics. Specifically, on the StarGAN dataset, the model attained outstanding performance, with accuracy (Acc) and average precision (AP) both reaching 100%. In cross-dataset testing, the proposed method exhibited strong generalization ability, raising the overall average accuracy to 87.0% and average precision to 92.8%, representing improvements of 5.2% and 6.7%, respectively, compared to existing state-of-the-art methods. These results show that the proposed method can not only achieve optimal performance on data with the same distribution, but also demonstrate strong generalization ability in cross-domain detection tasks.

Details

1009240
Title
Hierarchical Feature Fusion and Enhanced Attention Mechanism for Robust GAN-Generated Image Detection
Author
Zhang, Weinan 1 ; Cui Sanshuai 1   VIAFID ORCID Logo  ; Zhang, Qi 1 ; Chen Biwei 2   VIAFID ORCID Logo  ; Zeng, Hui 3 ; Zhong Qi 1   VIAFID ORCID Logo 

 Faculty of Data Science, City University of Macau, Macau SAR, China; [email protected] (W.Z.); [email protected] (Q.Z.); [email protected] (Q.Z.) 
 Belt and Road School, Beijing Normal University at Zhuhai, Zhuhai 519088, China; [email protected] 
 School of Computer Science and Technology, Southwest University of Science and Technology, Mianyang 621010, China 
Publication title
Volume
13
Issue
9
First page
1372
Publication year
2025
Publication date
2025
Publisher
MDPI AG
Place of publication
Basel
Country of publication
Switzerland
Publication subject
e-ISSN
22277390
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-04-23
Milestone dates
2025-03-03 (Received); 2025-04-16 (Accepted)
Publication history
 
 
   First posting date
23 Apr 2025
ProQuest document ID
3203209798
Document URL
https://www.proquest.com/scholarly-journals/hierarchical-feature-fusion-enhanced-attention/docview/3203209798/se-2?accountid=208611
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-05-13
Database
ProQuest One Academic