Abstract

A major contributor to the scientific reproducibility crisis has been that the results from homogeneous, single-center studies do not generalize to heterogeneous, real world populations. Multi-cohort gene expression analysis has helped to increase reproducibility by aggregating data from diverse populations into a single analysis. To make the multi-cohort analysis process more feasible, we have assembled an analysis pipeline which implements rigorously studied meta-analysis best practices. We have compiled and made publicly available the results of our own multi-cohort gene expression analysis of 103 diseases, spanning 615 studies and 36,915 samples, through a novel and interactive web application. As a result, we have made both the process of and the results from multi-cohort gene expression analysis more approachable for non-technical users.

Details

Title
Empowering Multi-Cohort Gene Expression Analysis to Increase Reproducibility
Author
Haynes, Winston A; Vallania, Francesco; Liu, Charles; Bongen, Erika; Tomczak, Aurelie; Marta Andres-Terr��; Lofgren, Shane; Tam, Andrew; Deisseroth, Cole A; Li, Matthew D; Sweeney, Timothy E; Khatri, Purvesh
University/institution
Cold Spring Harbor Laboratory Press
Section
New Results
Publication year
2016
Publication date
Aug 25, 2016
Publisher
Cold Spring Harbor Laboratory Press
ISSN
2692-8205
Source type
Working Paper
Language of publication
English
ProQuest document ID
2070551234
Copyright
�� 2016. This article is published under http://creativecommons.org/licenses/by/4.0/ (���the License���). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.