Abstract

The Covid-19 pandemic, a disease transmitted by the SARS-CoV-2 virus, has already caused the infection of more than 120 million people, of which 70 million have been recovered, while 3 million people have died. The high speed of infection has led to the rapid depletion of public health resources in most countries. RT-PCR is Covid-19’s reference diagnostic method. In this work we propose a new technique for representing DNA sequences: they are divided into smaller sequences with overlap in a pseudo-convolutional approach and represented by co-occurrence matrices. This technique eliminates multiple sequence alignment. Through the proposed method, it is possible to identify virus sequences from a large database: 347,363 virus DNA sequences from 24 virus families and SARS-CoV-2. When comparing SARS-CoV-2 with virus families with similar symptoms, we obtained 0.97±0.03 for sensitivity and 0.9919±0.0005 for specificity with MLP classifier and 30% overlap. When SARS-CoV-2 is compared to other coronaviruses and healthy human DNA sequences, we obtained 0.99±0.01 for sensitivity and 0.9986±0.0002 for specificity with MLP and 50% overlap. Therefore, the molecular diagnosis of Covid-19 can be optimized by combining RT-PCR and our pseudo-convolutional method to identify DNA sequences for SARS-CoV-2 with greater specificity and sensitivity.

Details

Title
Covid-19 diagnosis by combining RT-PCR and pseudo-convolutional machines to characterize virus sequences
Author
Gomes, Juliana Carneiro 1 ; Masood Aras Ismael 2 ; Silva Leandro Honorato de S 3 ; da Cruz Ferreira Janderson Romário B 1 ; Freire Júnior Agostinho Antônio 1 ; Rocha Allana Laís dos Santos 1 ; de Oliveira Letícia Castro Portela 1 ; da Silva Nathália Regina Cauás 1 ; Fernandes Bruno José Torres 1 ; dos Santos Wellington Pinheiro 4   VIAFID ORCID Logo 

 Escola Politécnica da Universidade de Pernambuco, POLI-UPE, Recife, Brazil (GRID:grid.26141.30) (ISNI:0000 0000 9011 5442) 
 Sulaimani Polytechnic University, Information Technology Department, Technical College of Informatics, Sulaymaniyah, Iraq (GRID:grid.449505.9) (ISNI:0000 0004 5914 3700) 
 Escola Politécnica da Universidade de Pernambuco, POLI-UPE, Recife, Brazil (GRID:grid.26141.30) (ISNI:0000 0000 9011 5442); Ciência e Tecnologia da Paraíba, Campus Cajazeiras, IFPB, Instituto Federal de Educação, Cajazeiras, Brazil (GRID:grid.26141.30) 
 Escola Politécnica da Universidade de Pernambuco, POLI-UPE, Recife, Brazil (GRID:grid.26141.30) (ISNI:0000 0000 9011 5442); Universidade Federal de Pernambuco, DEBM-UFPE, Departamento de Engenharia Biomédica, Recife, Brazil (GRID:grid.411227.3) (ISNI:0000 0001 0670 7996) 
Publication year
2021
Publication date
2021
Publisher
Nature Publishing Group
e-ISSN
20452322
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2536110145
Copyright
© The Author(s) 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.