It appears you don't have support to open PDFs in this web browser. To view this file, Open with your PDF reader
Abstract
ABSTRACT
Alternative splicing is widely acknowledged to be a crucial regulator of gene expression and is a key contributor to both normal developmental processes and disease states. While cost-effective and accurate for quantification, short-read RNA-seq lacks the ability to resolve full-length transcript isoforms despite increasingly sophisticated computational methods. Long-read sequencing platforms such as Pacific Biosciences (PacBio) and Oxford Nanopore (ONT) bypass the transcript reconstruction challenges of short reads. Here we introduce TALON, the ENCODE4 pipeline for platform-independent analysis of long-read transcriptomes. We apply TALON to the GM12878 cell line and show that while both PacBio and ONT technologies perform well at full-transcript discovery and quantification, each displayed distinct technical artifacts. We further apply TALON to mouse hippocampus and cortex transcriptomes and find that 422 genes found in these regions have more reads associated with novel isoforms than with annotated ones. We demonstrate that TALON is a capable of tracking both known and novel transcript models as well as their expression levels across datasets for both simple studies and in larger projects. These properties will enable TALON users to move beyond the limitations of short-read data to perform isoform discovery and quantification in a uniform manner on existing and future long-read platforms.
Footnotes
* Dana Wyman: dwyman{at}uci.edu, Gabriela Balderrama-Gutierrez: gbalderr{at}uci.edu, Fairlie Reese: freese{at}uci.edu, Shan Jiang: jiangs2{at}uci.edu, Sorena Rahmanian: sorenar{at}uci.edu, Stefania Forner: sforner{at}uci.edu, Dina Matheos: dina.matheos{at}gmail.com, Weihua Zeng: zengw{at}uci.edu, Brian Williams: bawilli{at}caltech.edu, Diane Trout: diane{at}caltech.edu, Whitney England: wengland{at}uci.edu, Shu-Hui Chu: shuhuic{at}uci.edu, Robert C. Spitale: rspitale{at}uci.edu, Andrea J. Tenner: atenner{at}uci.edu, Barbara J. Wold: woldb{at}caltech.edu, Ali Mortazavi: ali.mortazavi{at}uci.edu
* The manuscript now uses Sequel 2 and updated Nanopore direct RNA data as well as SIRV reference transcripts. All analysis have been updated to use version 5 of TALON.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer