Content area
The World Health Organisation (WHO) has identified infectious diseases, particularly COVID-19, tuberculosis, malaria, and measles, as significant global health challenges in the past 5 years. The COVID-19 pandemic exposed critical limitations in traditional disease tracking systems, such as the lack of integrated data visualization, co-monitoring, and real-time analytics, leading to delayed and often ineffective public health responses. In this context, Big Data Analytics (BDA) offers significant potential for improving infectious disease mitigation through predictive modelling, mapping, tracking, and real-time monitoring. This study systematically reviews the role of BDA in monitoring and predicting epidemic and pandemic infections using the PRISMA methodology and quality appraisal techniques to provide comprehensive insights into its healthcare applications. From an initial pool of 846 articles from Scopus, PubMed, Science Direct, IEEE, ProQuest, and Springer, 30 high-quality studies were selected for in-depth analysis. The review identifies four key predictive models—epidemiological, time series, machine learning, and deep learning—and seven analytical techniques, including SIR, SEIR, regression analysis, random forest, support vector machines, auto-regressive methods, and deep learning. BDA supports infectious disease control by processing diverse healthcare data and leveraging technologies like IoT and social media to enhance diagnosis, clinical decision-making, and surveillance. However, a key limitation is predictive models’ limited reliability and generalizability in real-world settings, mainly due to low-quality, noisy, and incomplete data. For instance, during early COVID-19 phases, inconsistent case reporting hindered accurate forecasting and timely response efforts.
Details
Data analysis;
Health surveillance;
Malaria;
Scientific visualization;
Big Data;
Support vector machines;
Health care;
Prediction models;
Public health;
Disease control;
Telemedicine;
Tracking systems;
Laboratories;
Regression analysis;
Literature reviews;
Deep learning;
Machine learning;
Real time;
Decision trees;
Tuberculosis;
Reliability;
Surveillance;
Medical diagnosis;
Disease;
Mapping;
Data quality;
Delayed;
Social media;
Predictions;
Pandemics;
COVID-19;
Generalizability;
Tracking;
Responses;
Disease prevention;
Data processing;
Visualization;
Health services;
Time series;
Epidemiology;
Measles;
Decision making;
Mitigation;
Medical decision making;
Clinical decision making;
Forecasting;
Epidemics;
Machinery
1 Universiti Sains Islam Malaysia, Faculty of Science and Technology, Bandar Baru Nilai, Malaysia (GRID:grid.462995.5) (ISNI:0000 0001 2218 9236)
2 Universiti Sains Islam Malaysia, Faculty of Science and Technology, Bandar Baru Nilai, Malaysia (GRID:grid.462995.5) (ISNI:0000 0001 2218 9236); CyberSecurity and Systems (CSS) Research Unit, Faculty of Science and Technology, Bandar Baru Nilai, Malaysia (GRID:grid.462995.5)
3 Universiti Brunei Darussalam, Institute of Applied Data Analytics, Gadong, Brunei (GRID:grid.440600.6) (ISNI:0000 0001 2170 1621); Universiti Brunei Darussalam, School of Digital Science, Gadong, Brunei (GRID:grid.440600.6) (ISNI:0000 0001 2170 1621)
4 University of Birmingham, Rheumatology Research Group, Institute of Inflammation and Ageing, Birmingham, UK (GRID:grid.6572.6) (ISNI:0000 0004 1936 7486)