Abstract

We address the problem of generating novel molecules with desired interaction properties as a multi-objective optimization problem. Interaction binding models are learned from binding data using graph convolution networks (GCNs). Since the experimentally obtained property scores are recognised as having potentially gross errors, we adopted a robust loss for the model. Combinations of these terms, including drug likeness and synthetic accessibility, are then optimized using reinforcement learning based on a graph convolution policy approach. Some of the molecules generated, while legitimate chemically, can have excellent drug-likeness scores but appear unusual. We provide an example based on the binding potency of small molecules to dopamine transporters. We extend our method successfully to use a multi-objective reward function, in this case for generating novel molecules that bind with dopamine transporters but not with those for norepinephrine. Our method should be generally applicable to the generation in silico of molecules with desirable properties.

Details

Title
DeepGraphMolGen, a multi-objective, computational strategy for generating molecules with desirable properties: a graph convolution and reinforcement learning approach
Author
Khemchandani Yash 1 ; O’Hagan Stephen 2 ; Samanta Soumitra 3 ; Swainston, Neil 3 ; Roberts, Timothy J 3 ; Bollegala Danushka 4 ; Kell, Douglas B 5   VIAFID ORCID Logo 

 Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Department of Biochemistry and Systems Biology, Liverpool, UK (GRID:grid.10025.36) (ISNI:0000 0004 1936 8470); Indian Institute of Technology Bombay, Mumbai, India (GRID:grid.417971.d) (ISNI:0000 0001 2198 7527) 
 Dept of Chemistry, Manchester Institute of Biotechnology, The University of Manchester, Manchester, UK (GRID:grid.5379.8) (ISNI:0000000121662407) 
 Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Department of Biochemistry and Systems Biology, Liverpool, UK (GRID:grid.10025.36) (ISNI:0000 0004 1936 8470) 
 Dept of Computer Science, University of Liverpool, Liverpool, UK (GRID:grid.10025.36) (ISNI:0000 0004 1936 8470) 
 Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Department of Biochemistry and Systems Biology, Liverpool, UK (GRID:grid.10025.36) (ISNI:0000 0004 1936 8470); The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Lyngby, Denmark (GRID:grid.5170.3) (ISNI:0000 0001 2181 8870) 
Publication year
2020
Publication date
Dec 2020
Publisher
Springer Nature B.V.
e-ISSN
1758-2946
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2440213516
Copyright
© The Author(s) 2020. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.