Abstract

Predicting the blood–brain barrier (BBB) permeability of small-molecule compounds using a novel artificial intelligence platform is necessary for drug discovery. Machine learning and a large language model on artificial intelligence (AI) tools improve the accuracy and shorten the time for new drug development. The primary goal of this research is to develop artificial intelligence (AI) computing models and novel deep learning architectures capable of predicting whether molecules can permeate the human blood–brain barrier (BBB). The in silico (computational) and in vitro (experimental) results were validated by the Natural Products Research Laboratories (NPRL) at China Medical University Hospital (CMUH). The transformer-based MegaMolBART was used as the simplified molecular input line entry system (SMILES) encoder with an XGBoost classifier as an in silico method to check if a molecule could cross through the BBB. We used Morgan or Circular fingerprints to apply the Morgan algorithm to a set of atomic invariants as a baseline encoder also with an XGBoost classifier to compare the results. BBB permeability was assessed in vitro using three-dimensional (3D) human BBB spheroids (human brain microvascular endothelial cells, brain vascular pericytes, and astrocytes). Using multiple BBB databases, the results of the final in silico transformer and XGBoost model achieved an area under the receiver operating characteristic curve of 0.88 on the held-out test dataset. Temozolomide (TMZ) and 21 randomly selected BBB permeable compounds (Pred scores = 1, indicating BBB-permeable) from the NPRL penetrated human BBB spheroid cells. No evidence suggests that ferulic acid or five BBB-impermeable compounds (Pred scores < 1.29423E−05, which designate compounds that pass through the human BBB) can pass through the spheroid cells of the BBB. Our validation of in vitro experiments indicated that the in silico prediction of small-molecule permeation in the BBB model is accurate. Transformer-based models like MegaMolBART, leveraging the SMILES representations of molecules, show great promise for applications in new drug discovery. These models have the potential to accelerate the development of novel targeted treatments for disorders of the central nervous system.

Details

Title
Predicting blood–brain barrier permeability of molecules with a large language model and machine learning
Author
Huang, Eddie T. C. 1 ; Yang, Jai-Sing 2 ; Liao, Ken Y. K. 1 ; Tseng, Warren C. W. 1 ; Lee, C. K. 1 ; Gill, Michelle 1 ; Compas, Colin 1 ; See, Simon 1 ; Tsai, Fuu-Jen 3 

 NVIDIA AI Technology Center, NVIDIA Corporation, Santa Clara, USA (GRID:grid.451133.1) (ISNI:0000 0004 0458 4453) 
 China Medical University Hospital, China Medical University, Department of Medical Research, Taichung, Taiwan (GRID:grid.451133.1) 
 China Medical University, China Medical University Children’s Hospital, School of Chinese Medicine, College of Chinese Medicine, Taichung, Taiwan (GRID:grid.412449.e) (ISNI:0000 0000 9678 1884); China Medical University Children’s Hospital, Taichung, Taiwan (GRID:grid.254145.3) (ISNI:0000 0001 0083 6092) 
Pages
15844
Publication year
2024
Publication date
2024
Publisher
Nature Publishing Group
e-ISSN
20452322
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3077589832
Copyright
© The Author(s) 2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.