Content area

Abstract

In recent years, the open data initiative has led to the willingness of many governments, researchers, and organizations to share their data and make it publicly available. Healthcare, disease, and epidemiological data, such as privacy statistics on patients who have suffered from epidemic diseases such as the Coronavirus disease 2019 (COVID-19), are examples of open big data. Therefore, huge volumes of valuable data have been generated and collected at high speed from a wide variety of rich data sources. Analyzing these open big data can be of social benefit. For example, people gain a better understanding of disease by analyzing and mining disease statistics, which can inspire them to participate in disease prevention, detection, control, and combat. Visual representation further improves data understanding and corresponding results for analysis and mining, as a picture is worth a thousand words. In this paper, we present a visual data science solution for the visualization and visual analysis of large sequence data. These ideas are illustrated by the visualization and visual analysis of sequences of real epidemiological data of COVID-19. Through our solution, we enable users to visualize the epidemiological data of COVID-19 over time. It also allows people to visually analyze data and discover relationships between popular features associated with COVID-19 cases. The effectiveness of our visual data science solution in improving the user experience of visualization and visual analysis of large sequence data is demonstrated by the real-life evaluation of these sequenced epidemiological data of COVID-19.

Details

1009240
Business indexing term
Location
Title
A Machine-Learning-Based Data Science Framework for Effectively and Efficiently Processing, Managing, and Visualizing Big Sequential Data †
Author
Cuzzocrea Alfredo 1   VIAFID ORCID Logo  ; Islam, Belmerabet 2   VIAFID ORCID Logo  ; Abderraouf, Hafsaoui 2   VIAFID ORCID Logo  ; Leung, Carson K 3   VIAFID ORCID Logo 

 iDEA Lab, University of Calabria, 87036 Rende, Italy; [email protected] (I.B.); [email protected] (A.H.), Department of Computer Science, University of Paris City, 75006 Paris, France 
 iDEA Lab, University of Calabria, 87036 Rende, Italy; [email protected] (I.B.); [email protected] (A.H.) 
 Department of Computer Science, University of Manitoba, Winnipeg, MB R3T 2N2, Canada; [email protected] 
Publication title
Computers; Basel
Volume
14
Issue
7
First page
276
Number of pages
38
Publication year
2025
Publication date
2025
Publisher
MDPI AG
Place of publication
Basel
Country of publication
Switzerland
Publication subject
e-ISSN
2073431X
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-07-14
Milestone dates
2025-01-31 (Received); 2025-06-11 (Accepted)
Publication history
 
 
   First posting date
14 Jul 2025
ProQuest document ID
3233123553
Document URL
https://www.proquest.com/scholarly-journals/machine-learning-based-data-science-framework/docview/3233123553/se-2?accountid=208611
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-07-25
Database
3 databases
  • Coronavirus Research Database
  • ProQuest One Academic
  • ProQuest One Academic