Abstract

Aptamers are single-stranded nucleic acid ligands that bind to target molecules with high affinity and specificity. They are typically discovered by searching large libraries for sequences with desirable binding properties. These libraries, however, are practically constrained to a fraction of the theoretical sequence space. Machine learning provides an opportunity to intelligently navigate this space to identify high-performing aptamers. Here, we propose an approach that employs particle display (PD) to partition a library of aptamers by affinity, and uses such data to train machine learning models to predict affinity in silico. Our model predicted high-affinity DNA aptamers from experimental candidates at a rate 11-fold higher than random perturbation and generated novel, high-affinity aptamers at a greater rate than observed by PD alone. Our approach also facilitated the design of truncated aptamers 70% shorter and with higher binding affinity (1.5 nM) than the best experimental candidate. This work demonstrates how combining machine learning and physical approaches can be used to expedite the discovery of better diagnostic and therapeutic agents.

Current aptamer discovery approaches are unable to probe the complete space of possible sequences. Here, the authors use machine learning to facilitate the development of DNA aptamers with improved binding affinities, and truncate them without significantly compromising binding affinity.

Details

Title
Machine learning guided aptamer refinement and discovery
Author
Bashir, Ali 1 ; Yang, Qin 2   VIAFID ORCID Logo  ; Wang, Jinpeng 2 ; Hoyer, Stephan 1   VIAFID ORCID Logo  ; Chou Wenchuan 2 ; McLean, Cory 1 ; Davis, Geoff 1 ; Gong Qiang 2 ; Armstrong, Zan 1   VIAFID ORCID Logo  ; Jang Junghoon 2 ; Kang, Hui 2 ; Pawlosky Annalisa 1 ; Scott, Alexander 2 ; Dahl, George E 1   VIAFID ORCID Logo  ; Berndl Marc 1   VIAFID ORCID Logo  ; Dimon, Michelle 1   VIAFID ORCID Logo  ; Scott, Ferguson B 2   VIAFID ORCID Logo 

 Google Research, Mountain View, USA (GRID:grid.420451.6) 
 Aptitude Medical Systems Inc., Santa Barbara, USA (GRID:grid.428023.f) 
Publication year
2021
Publication date
2021
Publisher
Nature Publishing Group
e-ISSN
20411723
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2516597384
Copyright
© The Author(s) 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.