Abstract

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a highly pathogenic virus that has caused the global COVID-19 pandemic. Tracing the evolution and transmission of the virus is crucial to respond to and control the pandemic through appropriate intervention strategies. This paper reports and analyses genomic mutations in the coding regions of SARS-CoV-2 and their probable protein secondary structure and solvent accessibility changes, which are predicted using deep learning models. Prediction results suggest that mutation D614G in the virus spike protein, which has attracted much attention from researchers, is unlikely to make changes in protein secondary structure and relative solvent accessibility. Based on 6324 viral genome sequences, we create a spreadsheet dataset of point mutations that can facilitate the investigation of SARS-CoV-2 in many perspectives, especially in tracing the evolution and worldwide spread of the virus. Our analysis results also show that coding genes E, M, ORF6, ORF7a, ORF7b and ORF10 are most stable, potentially suitable to be targeted for vaccine and drug development.

Details

Title
Genomic mutations and changes in protein secondary structure and solvent accessibility of SARS-CoV-2 (COVID-19 virus)
Author
Nguyen, Thanh Thi 1   VIAFID ORCID Logo  ; Pathirana, Pubudu N 2 ; Nguyen Thin 3 ; Nguyen Quoc Viet Hung 4   VIAFID ORCID Logo  ; Bhatti Asim 5   VIAFID ORCID Logo  ; Nguyen, Dinh C 2 ; Nguyen, Dung Tien 1 ; Nguyen, Ngoc Duy 5 ; Creighton, Douglas 5 ; Abdelrazek, Mohamed 1 

 Deakin University, School of Information Technology, Victoria, Australia (GRID:grid.1021.2) (ISNI:0000 0001 0526 7079) 
 Deakin University, School of Engineering, Victoria, Australia (GRID:grid.1021.2) (ISNI:0000 0001 0526 7079) 
 Deakin University, Applied Artificial Intelligence Institute (A2I2), Victoria, Australia (GRID:grid.1021.2) (ISNI:0000 0001 0526 7079) 
 Griffith University, School of Information and Communication Technology, Queensland, Australia (GRID:grid.1022.1) (ISNI:0000 0004 0437 5432) 
 Deakin University, Institute for Intelligent Systems Research and Innovation (IISRI), Victoria, Australia (GRID:grid.1021.2) (ISNI:0000 0001 0526 7079) 
Publication year
2021
Publication date
2021
Publisher
Nature Publishing Group
e-ISSN
20452322
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2488036615
Copyright
© The Author(s) 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.