It appears you don't have support to open PDFs in this web browser. To view this file, Open with your PDF reader
Abstract
NOABSTRACT
The precise prediction of off-target effects in CRISPR-Cas9 genome editing is critical for ensuring the safety and efficacy of this powerful tool. This study leverages machine learning techniques to predict off-target cleavage sites and investigate the underlying mechanisms that affect cleavage efficiencies. By integrating data from Tsai et al. and Kleinsteiver et al., who employed the GUIDE-seq method, we aim to enhance our understanding of the factors influencing CRISPR-Cas9 activity.
Our research analyzed datasets from Tsai et al. and Kleinsteiver et al., standardizing cleavage efficiencies to align with Tsai et al.’s comprehensive dataset. We identified a range of sequence features, including PAM sequence types, nucleotide composition, GC content, chromatin structure, CpG islands, and gene expression levels. Various machine learning models, including Artificial Neural Networks, Support Vector Machines, Naïve Bayes, k-Nearest Neighbors, Logistic Regression, and Extra Trees Classifiers, were developed and evaluated. The Extra Trees Classifier, particularly with class weighting, exhibited robust performance, achieving high accuracy, precision, recall, and F1 scores. SHAP analysis provided insights into feature importance, highlighting the significant factors contributing to model predictions.
The application of machine learning to predict CRISPR-Cas9 off-target effects demonstrates significant potential in enhancing the precision of genome editing. Our findings underscore the importance of considering a diverse range of sequence and genomic features to improve prediction models. The insights gained from this study can inform the development of safer and more effective CRISPR-based applications in medicine, agriculture, and biotechnology. Future work will focus on further refining these models and exploring their applicability across different genomic contexts.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details
1 School of Biotechnology, Gautam Buddha University, Greater Noida, Uttar Pradesh, 201312, India
2 School of Information and Communication Technology, Gautam Buddha University, Greater Noida, Uttar Pradesh, 201312, India




