Abstract

A major challenge in the application of the CRISPR-Cas13d system is to accurately predict its guide-dependent on-target and off-target effect. Here, we perform CRISPR-Cas13d proliferation screens and design a deep learning model, named DeepCas13, to predict the on-target activity from guide sequences and secondary structures. DeepCas13 outperforms existing methods to predict the efficiency of guides targeting both protein-coding and non-coding RNAs. Guides targeting non-essential genes display off-target viability effects, which are closely related to their on-target efficiencies. Choosing proper negative control guides during normalization mitigates the associated false positives in proliferation screens. We apply DeepCas13 to the guides targeting lncRNAs, and identify lncRNAs that affect cell viability and proliferation in multiple cell lines. The higher prediction accuracy of DeepCas13 over existing methods is extensively confirmed via a secondary CRISPR-Cas13d screen and quantitative RT-PCR experiments. DeepCas13 is freely accessible via http://deepcas13.weililab.org.

Application of CRISPR-Cas13d is limited by the inability to predict on- and off-targets. Here the authors perform CRISPR-Cas13d proliferation screens followed by modeling of Cas13d on- and off-targets; they design a deep learning model, DeepCas13, to predict the on-target activity of a gRNA.

Details

Title
Modeling CRISPR-Cas13d on-target and off-target effects using machine learning approaches
Author
Cheng, Xiaolong 1   VIAFID ORCID Logo  ; Li, Zexu 2   VIAFID ORCID Logo  ; Shan, Ruocheng 3 ; Li, Zihan 2   VIAFID ORCID Logo  ; Wang, Shengnan 2 ; Zhao, Wenchang 2 ; Zhang, Han 2 ; Chao, Lumen 1 ; Peng, Jian 4   VIAFID ORCID Logo  ; Fei, Teng 2   VIAFID ORCID Logo  ; Li, Wei 1   VIAFID ORCID Logo 

 Children’s National Hospital, Center for Genetic Medicine Research, Washington, USA (GRID:grid.239560.b) (ISNI:0000 0004 0482 1586); George Washington University, Department of Genomics and Precision Medicine, Washington, USA (GRID:grid.253615.6) (ISNI:0000 0004 1936 9510) 
 Northeastern University, National Frontiers Science Center for Industrial Intelligence and Systems Optimization, Key Laboratory of Bioresource Research and Development of Liaoning Province, College of Life and Health Sciences, Shenyang, China (GRID:grid.412252.2) (ISNI:0000 0004 0368 6968) 
 Children’s National Hospital, Center for Genetic Medicine Research, Washington, USA (GRID:grid.239560.b) (ISNI:0000 0004 0482 1586); George Washington University, Department of Computer Science, Washington, USA (GRID:grid.253615.6) (ISNI:0000 0004 1936 9510) 
 University of Illinois at Urbana-Champaign, Department of Computer Science, Urbana, USA (GRID:grid.35403.31) (ISNI:0000 0004 1936 9991) 
Pages
752
Publication year
2023
Publication date
2023
Publisher
Nature Publishing Group
e-ISSN
20411723
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2775138351
Copyright
© The Author(s) 2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.