Full text

Turn on search term navigation

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Copy number variation (CNV) is a primary source of structural variation in the human genome, leading to several disorders. Therefore, analyzing neonatal CNVs is crucial for managing CNV-related chromosomal disabilities. However, genomic waves can hinder accurate CNV analysis. To mitigate the influences of the waves, we adopted a machine learning approach and developed a new method that uses a modified log R ratio instead of the commonly used log R ratio. Validation results using samples with known CNVs demonstrated the superior performance of our method. We analyzed a total of 16,046 Korean newborn samples using the new method and identified CNVs related to 39 genetic disorders were identified in 342 cases. The most frequently detected CNV-related disorder was Joubert syndrome 4. The accuracy of our method was further confirmed by analyzing a subset of the detected results using NGS and comparing them with our results. The utilization of a genome-wide single nucleotide polymorphism array with wave offset was shown to be a powerful method for identifying CNVs in neonatal cases. The accurate screening and the ability to identify various disease susceptibilities offered by our new method could facilitate the identification of CNV-associated chromosomal disease etiologies.

Details

Title
Improving CNV Detection Performance in Microarray Data Using a Machine Learning-Based Approach
Author
Goh, Chul Jun 1 ; Hyuk-Jung Kwon 2   VIAFID ORCID Logo  ; Kim, Yoonhee 1 ; Jung, Seunghee 1 ; Park, Jiwoo 1 ; Isaac Kise Lee 3 ; Bo-Ram, Park 1 ; Myeong-Ji, Kim 1 ; Min-Jeong, Kim 4 ; Lee, Min-Seob 5 

 Eone-Diagnomics Genome Center, Inc., 143, Gaetbeol-ro, Yeonsu-gu, Incheon 21999, Republic of Korea; [email protected] (C.J.G.); [email protected] (H.-J.K.); [email protected] (Y.K.); [email protected] (S.J.); [email protected] (J.P.); [email protected] (I.K.L.); [email protected] (B.-R.P.); [email protected] (M.-J.K.) 
 Eone-Diagnomics Genome Center, Inc., 143, Gaetbeol-ro, Yeonsu-gu, Incheon 21999, Republic of Korea; [email protected] (C.J.G.); [email protected] (H.-J.K.); [email protected] (Y.K.); [email protected] (S.J.); [email protected] (J.P.); [email protected] (I.K.L.); [email protected] (B.-R.P.); [email protected] (M.-J.K.); Department of Computer Science and Engineering, Incheon National University (INU), Incheon 22012, Republic of Korea 
 Eone-Diagnomics Genome Center, Inc., 143, Gaetbeol-ro, Yeonsu-gu, Incheon 21999, Republic of Korea; [email protected] (C.J.G.); [email protected] (H.-J.K.); [email protected] (Y.K.); [email protected] (S.J.); [email protected] (J.P.); [email protected] (I.K.L.); [email protected] (B.-R.P.); [email protected] (M.-J.K.); Department of Computer Science and Engineering, Incheon National University (INU), Incheon 22012, Republic of Korea; NGENI Foundation, San Diego, CA 92127, USA 
 Diagnomics, Inc., 5795 Kearny Villa Rd., San Diego, CA 92123, USA; [email protected] 
 Eone-Diagnomics Genome Center, Inc., 143, Gaetbeol-ro, Yeonsu-gu, Incheon 21999, Republic of Korea; [email protected] (C.J.G.); [email protected] (H.-J.K.); [email protected] (Y.K.); [email protected] (S.J.); [email protected] (J.P.); [email protected] (I.K.L.); [email protected] (B.-R.P.); [email protected] (M.-J.K.); Diagnomics, Inc., 5795 Kearny Villa Rd., San Diego, CA 92123, USA; [email protected] 
First page
84
Publication year
2024
Publication date
2024
Publisher
MDPI AG
e-ISSN
20754418
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2912510751
Copyright
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.