Full text

Turn on search term navigation

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

The integration and emerging adoption of machine learning (ML) algorithms in healthcare big data has revolutionized clinical decision-making, predictive analytics, and real-time medical diagnostics. However, the application of machine learning in healthcare big data faces computational challenges, particularly in efficiently processing and training on large-scale, high-velocity data generated by healthcare organizations worldwide. In response to these issues, this study critically reviews and examines current state-of-the-art advancements in machine learning algorithms and big data frameworks within healthcare analytics, with a particular emphasis on solutions addressing data volume and velocity. The reviewed literature is categorized into three key areas: (1) efficient techniques, arithmetic operations, and dimensionality reduction; (2) advanced and specialized processing hardware; and (3) clustering and parallel processing methods. Key research gaps and open challenges are identified based on the evaluation of the literature across these categories, and important future research directions are discussed in detail. Among the several proposed solutions are the utilization of federated learning and decentralized data processing, as well as efficient parallel processing through big data frameworks such as Apache Spark, neuromorphic computing, and multi-swarm large-scale optimization algorithms; these highlight the importance of interdisciplinary innovations in algorithm design, hardware efficiency, and distributed computing frameworks, which collectively contribute to faster, more accurate, and resource-efficient AI-driven healthcare big data analytics and applications. This research supports the UNSDG 3 (Good Health and Well-Being) and UNSDG 9 (Industry, Innovation and Infrastructure) by integration of machine learning in healthcare big data and promoting product innovation in the healthcare industry, respectively.

Details

Title
Recent Trends in Machine Learning for Healthcare Big Data Applications: Review of Velocity and Volume Challenges
Author
Khudhur, Doaa Yaseen 1 ; Shibghatullah Abdul Samad 2   VIAFID ORCID Logo  ; Shaker Khalid 3 ; Abdul Latif Aliza 2   VIAFID ORCID Logo  ; Muda, Zakaria Che 4   VIAFID ORCID Logo 

 Department of Informatics, College of Computing & Information Technology, Universiti Tenaga Nasional, Putrajaya Campus, Jalan IKRAM-UNITEN, Kajang 43000, Selangor, Malaysia, Departments of Artificial Intelligence & Information Technology, College of Computer Science and Information Technology, University of Anbar, Ramadi 31001, Iraq 
 Department of Informatics, College of Computing & Information Technology, Universiti Tenaga Nasional, Putrajaya Campus, Jalan IKRAM-UNITEN, Kajang 43000, Selangor, Malaysia 
 Departments of Artificial Intelligence & Information Technology, College of Computer Science and Information Technology, University of Anbar, Ramadi 31001, Iraq 
 Faculty of Engineering and Quantity Surveying, INTI International University, Nilai 71800, Negeri Sembilan, Malaysia; [email protected] 
First page
772
Publication year
2025
Publication date
2025
Publisher
MDPI AG
e-ISSN
19994893
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3286250375
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.