Full text

Turn on search term navigation

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

The growing demand for telemedicine has highlighted the need for automated healthcare services, particularly in medical question classification. This study presents a deep learning model designed to address key challenges in telemedicine, including class imbalance and accurate routing of Arabic medical questions to the correct specialties. The model combines AraBERTv0.2-Twitter, fine-tuned for informal Arabic, with Bidirectional Long Short-Term Memory (BiLSTM) networks to capture deep semantic relationships in medical text. We used a labeled dataset of 5000 Arabic consultation records from Altibbi, covering five key medical specialties selected for their clinical relevance and frequency. The data underwent preprocessing to remove noise and normalize text. We employed stratified sampling to ensure representative distribution across the selected medical specialties. We evaluate multiple models using macro precision, macro recall, macro F1-score, weighted F1-score, and G-Mean. Our results demonstrate that DeepSMOTE combined with cross-entropy loss achieves the best performance. The findings offer statistically significant improvements and have practical implications for improving screening and patient routing in telemedicine platforms.

Details

Title
Enhancing the Classification of Imbalanced Arabic Medical Questions Using DeepSMOTE
Author
Al-Smadi, Bushra 1 ; Hammo Bassam 2   VIAFID ORCID Logo  ; Faris Hossam 3   VIAFID ORCID Logo  ; Castillo, Pedro A 4   VIAFID ORCID Logo 

 King Abdullah II School of Information Technology, The University of Jordan, Amman 11942, Jordan; [email protected] (B.A.-S.); [email protected] (H.F.) 
 King Abdullah II School of Information Technology, The University of Jordan, Amman 11942, Jordan; [email protected] (B.A.-S.); [email protected] (H.F.), King Hussein School of Computing Sciences, Princess Sumaya University for Technology, Amman 11941, Jordan 
 King Abdullah II School of Information Technology, The University of Jordan, Amman 11942, Jordan; [email protected] (B.A.-S.); [email protected] (H.F.), Department of Computer Engineering, Automatics and Robotics, Higher Technical School of Computer Sciences and Telecommunications Engineering (ETSIIT)-Communication and Information Technologies Researching Centre (CITIC), University of Granada, 18071 Granada, Spain; [email protected] 
 Department of Computer Engineering, Automatics and Robotics, Higher Technical School of Computer Sciences and Telecommunications Engineering (ETSIIT)-Communication and Information Technologies Researching Centre (CITIC), University of Granada, 18071 Granada, Spain; [email protected] 
First page
77
Publication year
2025
Publication date
2025
Publisher
MDPI AG
e-ISSN
26732688
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3194484908
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.