Abstract

Retrosynthesis, the strategy of devising laboratory pathways by working backwards from the target compound, is crucial yet challenging. Enhancing retrosynthetic efficiency requires overcoming the vast complexity of chemical space, the limited known interconversions between molecules, and the challenges posed by limited experimental datasets. This study introduces generative machine learning methods for retrosynthetic planning. The approach features three innovations: generating reaction templates instead of reactants or synthons to create novel chemical transformations, allowing user selection of specific bonds to change for human-influenced synthesis, and employing a conditional kernel-elastic autoencoder (CKAE) to measure the similarity between generated and known reactions for chemical viability insights. These features form a coherent retrosynthetic framework, validated experimentally by designing a 3-step synthetic pathway for a challenging small molecule, demonstrating a significant improvement over previous 5-9 step approaches. This work highlights the utility and robustness of generative machine learning in addressing complex challenges in chemical synthesis.

Enhancing retrosynthetic efficiency requires overcoming the vast complexity of chemical space, the limited known interconversions between molecules, and the challenges posed by limited experimental datasets. Here, the authors introduce generative machine learning methods for retrosynthetic planning that generate reaction templates.

Details

Title
Site-specific template generative approach for retrosynthetic planning
Author
Shee, Yu 1   VIAFID ORCID Logo  ; Li, Haote 1 ; Zhang, Pengpeng 1 ; Nikolic, Andrea M. 1   VIAFID ORCID Logo  ; Lu, Wenxin 1   VIAFID ORCID Logo  ; Kelly, H. Ray 2   VIAFID ORCID Logo  ; Manee, Vidhyadhar 2 ; Sreekumar, Sanil 2 ; Buono, Frederic G. 2 ; Song, Jinhua J. 2 ; Newhouse, Timothy R. 1   VIAFID ORCID Logo  ; Batista, Victor S. 1   VIAFID ORCID Logo 

 Yale University, Department of Chemistry, New Haven, USA (GRID:grid.47100.32) (ISNI:0000 0004 1936 8710) 
 Boehringer Ingelheim Pharmaceuticals Inc, Chemical Development, Ridgefield, USA (GRID:grid.418412.a) (ISNI:0000 0001 1312 9717) 
Pages
7818
Publication year
2024
Publication date
2024
Publisher
Nature Publishing Group
e-ISSN
20411723
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3102223637
Copyright
© The Author(s) 2024. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.