Content area

Abstract

High-throughput sequencing of mRNA (RNA-seq) has become the standard method for measuring and comparing the levels of gene expression in a wide variety of species and conditions. RNA-seq experiments generate very large, complex data sets that demand fast, accurate and flexible software to reduce the raw read data to comprehensible results. HISAT (hierarchical indexing for spliced alignment of transcripts), StringTie and Ballgown are free, open-source software tools for comprehensive analysis of RNA-seq experiments. Together, they allow scientists to align reads to a genome, assemble transcripts including novel splice variants, compute the abundance of these transcripts in each sample and compare experiments to identify differentially expressed genes and transcripts. This protocol describes all the steps necessary to process a large set of raw sequencing reads and create lists of gene transcripts, expression levels, and differentially expressed genes and transcripts. The protocol's execution time depends on the computing resources, but it typically takes under 45 min of computer time. HISAT, StringTie and Ballgown are available from http://ccb.jhu.edu/software.shtml.

Details

Title
Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown
Author
Pertea, Mihaela; Kim, Daehwan; Pertea, Geo M; Leek, Jeffrey T; Salzberg, Steven L
Pages
1650-1667
Publication year
2016
Publication date
Sep 2016
Publisher
Nature Publishing Group
ISSN
17542189
e-ISSN
17502799
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
1815346347
Copyright
Copyright Nature Publishing Group Sep 2016