Abstract

The human genome contains more than 200,000 gene isoforms. However, different isoforms can be highly similar, and with an average length of 1.5kb remain difficult to study with short read sequencing. To systematically evaluate the ability to study the transcriptome at a resolution of individual isoforms we profiled 5 human cell lines with short read cDNA sequencing and Nanopore long read direct RNA, amplification-free direct cDNA, PCR-cDNA sequencing. The long read protocols showed a high level of consistency, with amplification-free RNA and cDNA sequencing being most similar. While short and long reads generated comparable gene expression estimates, they differed substantially for individual isoforms. We find that increased read length improves read-to-transcript assignment, identifies interactions between alternative promoters and splicing, enables the discovery of novel transcripts from repetitive regions, facilitates the quantification of full-length fusion isoforms and enables the simultaneous profiling of m6A RNA modifications when RNA is sequenced directly. Our study demonstrates the advantage of long read RNA sequencing and provides a comprehensive resource that will enable the development and benchmarking of computational methods for profiling complex transcriptional events at isoform-level resolution.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

* https://github.com/GoekeLab/sg-nex-data

Details

Title
A systematic benchmark of Nanopore long read RNA sequencing for transcript level analysis in human cell lines
Author
Chen, Ying; Davidson, Nadia; Wan, Yuk Kei; Patel, Harshil; Yao, Fei; Hwee Meng Low; Hendra, Christopher; Watten, Laura; Sim, Andre; Sawyer, Chelsea; Iakovleva, Viktoriia; Lee, Puay Leng; Xin, Lixia; Hui En Vanessa Ng; Loo, Jia Min; Ong, Xuewen; Hui Qi Amanda Ng; Wang, Jiaxu; Wei Qian Casslynn Koh; Suk Yeah Polly Poon; Stanojevic, Dominik; Hoang-Dai, Tran; Kok Hao Edwin Lim; Toh, Shen Yon; Ewels, Philip; Huck-Hui Ng; N Gopalakrishna Iyer; Thiery, Alexandre; Chng, Wee Joo; Chen, Leilei; Dasgupta, Ramanuj; Sikic, Mile; Yun-Shen, Chan; Boon Ooi Patrick Tan; Wan, Yue; Wai Leong Tam; Yu, Qiang; Khor, Chiea Chuen; Wuestefeld, Torsten; Pratanwanich, Ploy N; Love, Michael I; Wee Siong Sho Goh; Ng, Sarah; Oshlack, Alicia; Goeke, Jonathan
University/institution
Cold Spring Harbor Laboratory Press
Section
New Results
Publication year
2021
Publication date
Apr 22, 2021
Publisher
Cold Spring Harbor Laboratory Press
ISSN
2692-8205
Source type
Working Paper
Language of publication
English
ProQuest document ID
2516611705
Copyright
© 2021. This article is published under http://creativecommons.org/licenses/by-nd/4.0/ (“the License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.