Abstract

Predicting protein–protein interaction and non-interaction are two important different aspects of multi-body structure predictions, which provide vital information about protein function. Some computational methods have recently been developed to complement experimental methods, but still cannot effectively detect real non-interacting protein pairs. We proposed a gene sequence-based method, named NVDT (Natural Vector combine with Dinucleotide and Triplet nucleotide), for the prediction of interaction and non-interaction. For protein–protein non-interactions (PPNIs), the proposed method obtained accuracies of 86.23% for Homo sapiens and 85.34% for Mus musculus, and it performed well on three types of non-interaction networks. For protein-protein interactions (PPIs), we obtained accuracies of 99.20, 94.94, 98.56, 95.41, and 94.83% for Saccharomyces cerevisiae, Drosophila melanogaster, Helicobacter pylori, Homo sapiens, and Mus musculus, respectively. Furthermore, NVDT outperformed established sequence-based methods and demonstrated high prediction results for cross-species interactions. NVDT is expected to be an effective approach for predicting PPIs and PPNIs.

Protein-protein non-interactions and interactions are distinguished and predicted by gene sequence using single nucleotide and contiguous nucleotides combined with machine learning models.

Details

Title
Protein–protein interaction and non-interaction predictions using gene sequence natural vector
Author
Zhao, Nan 1 ; Zhuo, Maji 1 ; Tian, Kun 1 ; Gong, Xinqi 2   VIAFID ORCID Logo 

 Renmin University of China, Institute for Mathematical Sciences, School of Mathematics, Beijing, China (GRID:grid.24539.39) (ISNI:0000 0004 0368 8103) 
 Renmin University of China, Institute for Mathematical Sciences, School of Mathematics, Beijing, China (GRID:grid.24539.39) (ISNI:0000 0004 0368 8103); Beijing Academy of Artificial Intelligence, Beijing, China (GRID:grid.511045.4); Tsinghua University, Beijing Advanced Innovation Center for Structural Biology, Beijing, China (GRID:grid.12527.33) (ISNI:0000 0001 0662 3178) 
Publication year
2022
Publication date
2022
Publisher
Nature Publishing Group
e-ISSN
23993642
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2683501037
Copyright
© The Author(s) 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.