Full text

Turn on search term navigation

Copyright © 2022 Haleema Attique et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0/

Abstract

DNA copy number variation (CNV) is the type of DNA variation which is associated with various human diseases. CNV ranges in size from 1 kilobase to several megabases on a chromosome. Most of the computational research for cancer classification is traditional machine learning based, which relies on handcrafted extraction and selection of features. To the best of our knowledge, the deep learning-based research also uses the step of feature extraction and selection. To understand the difference between multiple human cancers, we developed three end-to-end deep learning models, i.e., DNN (fully connected), CNN (convolution neural network), and RNN (recurrent neural network), to classify six cancer types using the CNV data of 24,174 genes. The strength of an end-to-end deep learning model lies in representation learning (automatic feature extraction). The purpose of proposing more than one model is to find which architecture among them performs better for CNV data. Our best model achieved 92% accuracy with an ROC of 0.99, and we compared the performances of our proposed models with state-of-the-art techniques. Our models have outperformed the state-of-the-art techniques in terms of accuracy, precision, and ROC. In the future, we aim to work on other types of cancers as well.

Details

Title
Multiclass Cancer Prediction Based on Copy Number Variation Using Deep Learning
Author
Attique, Haleema 1 ; Shah, Sajid 2   VIAFID ORCID Logo  ; Jabeen, Saima 3 ; Fiaz Gul Khan 1 ; Khan, Ahmad 1 ; ELAffendi, Mohammed 4   VIAFID ORCID Logo 

 Department of Computer Science, COMSATS University Islamabad, Abbottabad Campus, Islamabad, Pakistan 
 Department of Computer Science, COMSATS University Islamabad, Abbottabad Campus, Islamabad, Pakistan; EIAS Data Science Lab, College of Computer and Information Sciences, Prince Sultan University, Riyadh, Saudi Arabia 
 Department of IT and Computer Science, Pak-Austria Facchochschule: Institute of Applied Sciences and Technology, Mang, Haripur, KPK, Pakistan 
 EIAS Data Science Lab, College of Computer and Information Sciences, Prince Sultan University, Riyadh, Saudi Arabia 
Editor
Mohamed Abdelaziz
Publication year
2022
Publication date
2022
Publisher
John Wiley & Sons, Inc.
ISSN
16875265
e-ISSN
16875273
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2678217422
Copyright
Copyright © 2022 Haleema Attique et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0/