Abstract

Whereas accelerated attention beclouded early stages of the coronavirus spread, knowledge of actual pathogenicity and origin of possible sub-strains remained unclear. By harvesting the Global initiative on Sharing All Influenza Data (GISAID) database (https://www.gisaid.org/), between December 2019 and January 15, 2021, a total of 8864 human SARS-CoV-2 complete genome sequences processed by gender, across 6 continents (88 countries) of the world, Antarctica exempt, were analyzed. We hypothesized that data speak for itself and can discern true and explainable patterns of the disease. Identical genome diversity and pattern correlates analysis performed using a hybrid of biotechnology and machine learning methods corroborate the emergence of inter- and intra- SARS-CoV-2 sub-strains transmission and sustain an increase in sub-strains within the various continents, with nucleotide mutations dynamically varying between individuals in close association with the virus as it adapts to its host/environment. Interestingly, some viral sub-strain patterns progressively transformed into new sub-strain clusters indicating varying amino acid, and strong nucleotide association derived from same lineage. A novel cognitive approach to knowledge mining helped the discovery of transmission routes and seamless contact tracing protocol. Our classification results were better than state-of-the-art methods, indicating a more robust system for predicting emerging or new viral sub-strain(s). The results therefore offer explanations for the growing concerns about the virus and its next wave(s). A future direction of this work is a defuzzification of confusable pattern clusters for precise intra-country SARS-CoV-2 sub-strains analytics.

Details

Title
A hybrid computational framework for intelligent inter-continent SARS-CoV-2 sub-strains characterization and prediction
Author
Ekpenyong Moses Effiong 1 ; Edoho, Mercy Ernest 2 ; Godwin, Inyang Udoinyang 2 ; Faith-Michael, Uzoka 3 ; Ekaidem Itemobong Samuel 4 ; Effiong, Moses Anietie 4 ; Emeje Martins Ochubiojo 5 ; Mirabeau, Tatfeng Youtchou 6 ; James, Udo Ifiok 2 ; Anwana EnoAbasi Deborah 7 ; Edem, Etim Oboso 8 ; Ikim, Geoffery Joseph 2 ; Ambrose, Dan Emmanuel 2 

 University of Uyo, Department of Computer Science, Uyo, Nigeria (GRID:grid.412960.8) (ISNI:0000 0000 9156 2260); University of Uyo, Centre for Research and Development, Uyo, Nigeria (GRID:grid.412960.8) (ISNI:0000 0000 9156 2260) 
 University of Uyo, Department of Computer Science, Uyo, Nigeria (GRID:grid.412960.8) (ISNI:0000 0000 9156 2260) 
 Mount Royal University, Department of Mathematics and Computing, Calgary, Canada (GRID:grid.411852.b) (ISNI:0000 0000 9943 9777) 
 University of Uyo, College of Health Sciences, Uyo, Nigeria (GRID:grid.412960.8) (ISNI:0000 0000 9156 2260) 
 National Institute for Pharmaceutical Research and Development (NIPRD), Abuja, Nigeria (GRID:grid.419437.c) (ISNI:0000 0001 0164 4826) 
 Niger Delta University, College of Health Sciences, Amassama, Nigeria (GRID:grid.442702.7) (ISNI:0000 0004 1763 4886) 
 University of Uyo, Department of Botany and Ecological Studies, Uyo, Nigeria (GRID:grid.412960.8) (ISNI:0000 0000 9156 2260) 
 University of Uyo, Department of Biochemistry, Uyo, Nigeria (GRID:grid.412960.8) (ISNI:0000 0000 9156 2260) 
Publication year
2021
Publication date
2021
Publisher
Nature Publishing Group
e-ISSN
20452322
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2551801564
Copyright
© The Author(s) 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.