Full text

Turn on search term navigation

© 2017 Zych et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

In high-throughput molecular profiling studies, genotype labels can be wrongly assigned at various experimental steps; the resulting mislabeled samples seriously reduce the power to detect the genetic basis of phenotypic variation. We have developed an approach to detect potential mislabeling, recover the “ideal” genotype and identify “best-matched” labels for mislabeled samples. On average, we identified 4% of samples as mislabeled in eight published datasets, highlighting the necessity of applying a “data cleaning” step before standard data analysis.

Details

Title
reGenotyper: Detecting mislabeled samples in genetic data
Author
Zych, Konrad; Snoek, Basten L; Elvin, Mark; Rodriguez, Miriam; K Joeri Van der Velde; Arends, Danny; Harm-Jan Westra; Swertz, Morris A; Poulin, Gino; Kammenga, Jan E; Breitling, Rainer; Jansen, Ritsert C; Yang, Li
First page
e0171324
Section
Research Article
Publication year
2017
Publication date
Feb 2017
Publisher
Public Library of Science
e-ISSN
19326203
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
1867965814
Copyright
© 2017 Zych et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.