Full text

Turn on search term navigation

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

In recent years, data augmentation techniques have become the predominant approach for addressing highly imbalanced classification problems in machine learning. Algorithms such as the Synthetic Minority Over-sampling Technique (SMOTE) and Conditional Tabular Generative Adversarial Network (CTGAN) have proven effective in synthesizing minority class samples. However, these methods often introduce distributional bias and noise, potentially leading to model overfitting, reduced predictive performance, increased computational costs, and elevated cybersecurity risks. To overcome these limitations, we propose a novel architecture, FADEL, which integrates feature-type awareness with a supervised discretization strategy. FADEL introduces a unique feature augmentation ensemble framework that preserves the original data distribution by concurrently processing continuous and discretized features. It dynamically routes these feature sets to their most compatible base models, thereby improving minority class recognition without the need for data-level balancing or augmentation techniques. Experimental results demonstrate that FADEL, solely leveraging feature augmentation without any data augmentation, achieves a recall of 90.8% and a G-mean of 94.5% on the internal test set from Kaohsiung Chang Gung Memorial Hospital in Taiwan. On the external validation set from Kaohsiung Medical University Chung-Ho Memorial Hospital, it maintains a recall of 91.9% and a G-mean of 86.7%. These results outperform conventional ensemble methods trained on CTGAN-balanced datasets, confirming the superior stability, computational efficiency, and cross-institutional generalizability of the FADEL architecture. Altogether, FADEL uses feature augmentation to offer a robust and practical solution to extreme class imbalance, outperforming mainstream data augmentation-based approaches.

Details

Title
FADEL: Ensemble Learning Enhanced by Feature Augmentation and Discretization
Author
Chuan-Sheng, Hung 1   VIAFID ORCID Logo  ; Lin Chun-Hung Richard 2   VIAFID ORCID Logo  ; Shi-Huang, Chen 3 ; You-Cheng, Zheng 4   VIAFID ORCID Logo  ; Cheng-Han, Yu 1 ; Cheng-Wei, Hung 1 ; Ting-Hsin, Huang 4   VIAFID ORCID Logo  ; Tsai Jui-Hsiu 5   VIAFID ORCID Logo 

 Department of Computer Science and Engineering, National Sun Yat-sen University, Kaohsiung 804, Taiwan; [email protected] (C.-S.H.); [email protected] (Y.-C.Z.); [email protected] (C.-H.Y.); [email protected] (C.-W.H.); [email protected] (T.-H.H.) 
 Department of Computer Science and Engineering, National Sun Yat-sen University, Kaohsiung 804, Taiwan; [email protected] (C.-S.H.); [email protected] (Y.-C.Z.); [email protected] (C.-H.Y.); [email protected] (C.-W.H.); [email protected] (T.-H.H.), Artificial Intelligence Research and Promotion Center, National Sun Yat-sen University, Kaohsiung 804, Taiwan 
 Department of Computer Science and Information Engineering, Shu-Te University, Kaohsiung 824, Taiwan; [email protected] 
 Department of Computer Science and Engineering, National Sun Yat-sen University, Kaohsiung 804, Taiwan; [email protected] (C.-S.H.); [email protected] (Y.-C.Z.); [email protected] (C.-H.Y.); [email protected] (C.-W.H.); [email protected] (T.-H.H.), Division of Cardiology, Department of Internal Medicine, Kaohsiung Chang Gung Memorial Hospital, Kaohsiung 833, Taiwan 
 School of Medicine, Tzu Chi University, Hualien 970, Taiwan, Department of Psychiatry, Dalin Tzu Chi Hospital, Buddhist Tzu Chi Medical Foundation, Chia-Yi 622, Taiwan 
First page
827
Publication year
2025
Publication date
2025
Publisher
MDPI AG
e-ISSN
23065354
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3243983208
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.