Abstract

Drug-target interaction (DTI) prediction plays a crucial role in drug repositioning and virtual drug screening. Most DTI prediction methods cast the problem as a binary classification task to predict if interactions exist or as a regression task to predict continuous values that indicate a drug's ability to bind to a specific target. The regression-based methods provide insight beyond the binary relationship. However, most of these methods require the three-dimensional (3D) structural information of targets which are still not generally available to the targets. Despite this bottleneck, only a few methods address the drug-target binding affinity (DTBA) problem from a non-structure-based approach to avoid the 3D structure limitations. Here we propose Affinity2Vec, as a novel regression-based method that formulates the entire task as a graph-based problem. To develop this method, we constructed a weighted heterogeneous graph that integrates data from several sources, including drug-drug similarity, target-target similarity, and drug-target binding affinities. Affinity2Vec further combines several computational techniques from feature representation learning, graph mining, and machine learning to generate or extract features, build the model, and predict the binding affinity between the drug and the target with no 3D structural data. We conducted extensive experiments to evaluate and demonstrate the robustness and efficiency of the proposed method on benchmark datasets used in state-of-the-art non-structured-based drug-target binding affinity studies. Affinity2Vec showed superior and competitive results compared to the state-of-the-art methods based on several evaluation metrics, including mean squared error, rm2, concordance index, and area under the precision-recall curve.

Details

Title
Affinity2Vec: drug-target binding affinity prediction through representation learning, graph mining, and machine learning
Author
Thafar, Maha A 1 ; Alshahrani Mona 2 ; Somayah, Albaradei 3 ; Gojobori Takashi 4 ; Magbubah, Essack 4 ; Gao, Xin 4 

 King Abdullah University of Science and Technology (KAUST), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Computational Bioscience Research Center (CBRC), Thuwal, Saudi Arabia (GRID:grid.45672.32) (ISNI:0000 0001 1926 5090); Taif University, College of Computers and Information Technology, Taif, Saudi Arabia (GRID:grid.412895.3) (ISNI:0000 0004 0419 5255) 
 National Center for Artificial Intelligence (NCAI), Saudi Data and Artificial Intelligence Authority (SDAIA), Riyadh, Saudi Arabia (GRID:grid.412895.3) 
 King Abdullah University of Science and Technology (KAUST), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Computational Bioscience Research Center (CBRC), Thuwal, Saudi Arabia (GRID:grid.45672.32) (ISNI:0000 0001 1926 5090); King Abdulaziz University, Faculty of Computing and Information Technology, Jeddah, Saudi Arabia (GRID:grid.412125.1) (ISNI:0000 0001 0619 1117) 
 King Abdullah University of Science and Technology (KAUST), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Computational Bioscience Research Center (CBRC), Thuwal, Saudi Arabia (GRID:grid.45672.32) (ISNI:0000 0001 1926 5090) 
Publication year
2022
Publication date
2022
Publisher
Nature Publishing Group
e-ISSN
20452322
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2640669876
Copyright
© The Author(s) 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.