Abstract

ABSTRACT

We propose RgCop, a novel regularized copula based method for gene selection from large single cell RNA-seq data. RgCop utilizes copula correlation (Ccor), a robust equitable dependence measure that captures multivariate dependency among a set of genes in single cell expression data. We raise an objective function by adding a l1 regularization term with Ccor to penalizes the redundant co-efficient of features/genes, resulting non-redundant effective features/genes set. Results show a significant improvement in the clustering/classification performance of real life scRNA-seq data over the other state-of-the-art. RgCop performs extremely well in capturing dependence among the features of noisy data due to the scale invariant property of copula, thereby improving the stability of the method. Moreover, the differentially expressed (DE) genes identified from the clusters of scRNA-seq data are found to provide an accurate annotation of cells. Finally, the features/genes obtained from RgCop can able to annotate the unknown cells with high accuracy.

The corresponding software is available in: https://github.com/Snehalikalall/RgCop

Competing Interest Statement

The authors have declared no competing interest.

Details

Title
RgCop-A regularized copula based method for gene selection in single cell rna-seq data
Author
Lall, Snehalika; Ray, Sumanta; Bandyopadhyay, Sanghamitra
University/institution
Cold Spring Harbor Laboratory Press
Section
New Results
Publication year
2020
Publication date
Dec 24, 2020
Publisher
Cold Spring Harbor Laboratory Press
ISSN
2692-8205
Source type
Working Paper
Language of publication
English
ProQuest document ID
2506025697
Copyright
© 2020. This article is published under http://creativecommons.org/licenses/by/4.0/ (“the License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.