Cargando…
Removing batch effects for prediction problems with frozen surrogate variable analysis
Batch effects are responsible for the failure of promising genomic prognostic signatures, major ambiguities in published genomic results, and retractions of widely-publicized findings. Batch effect corrections have been developed to remove these artifacts, but they are designed to be used in populat...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4179553/ https://www.ncbi.nlm.nih.gov/pubmed/25332844 http://dx.doi.org/10.7717/peerj.561 |
_version_ | 1782337109388951552 |
---|---|
author | Parker, Hilary S. Corrada Bravo, Héctor Leek, Jeffrey T. |
author_facet | Parker, Hilary S. Corrada Bravo, Héctor Leek, Jeffrey T. |
author_sort | Parker, Hilary S. |
collection | PubMed |
description | Batch effects are responsible for the failure of promising genomic prognostic signatures, major ambiguities in published genomic results, and retractions of widely-publicized findings. Batch effect corrections have been developed to remove these artifacts, but they are designed to be used in population studies. But genomic technologies are beginning to be used in clinical applications where samples are analyzed one at a time for diagnostic, prognostic, and predictive applications. There are currently no batch correction methods that have been developed specifically for prediction. In this paper, we propose an new method called frozen surrogate variable analysis (fSVA) that borrows strength from a training set for individual sample batch correction. We show that fSVA improves prediction accuracy in simulations and in public genomic studies. fSVA is available as part of the sva Bioconductor package. |
format | Online Article Text |
id | pubmed-4179553 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-41795532014-10-20 Removing batch effects for prediction problems with frozen surrogate variable analysis Parker, Hilary S. Corrada Bravo, Héctor Leek, Jeffrey T. PeerJ Bioinformatics Batch effects are responsible for the failure of promising genomic prognostic signatures, major ambiguities in published genomic results, and retractions of widely-publicized findings. Batch effect corrections have been developed to remove these artifacts, but they are designed to be used in population studies. But genomic technologies are beginning to be used in clinical applications where samples are analyzed one at a time for diagnostic, prognostic, and predictive applications. There are currently no batch correction methods that have been developed specifically for prediction. In this paper, we propose an new method called frozen surrogate variable analysis (fSVA) that borrows strength from a training set for individual sample batch correction. We show that fSVA improves prediction accuracy in simulations and in public genomic studies. fSVA is available as part of the sva Bioconductor package. PeerJ Inc. 2014-09-23 /pmc/articles/PMC4179553/ /pubmed/25332844 http://dx.doi.org/10.7717/peerj.561 Text en © 2014 Parker et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. |
spellingShingle | Bioinformatics Parker, Hilary S. Corrada Bravo, Héctor Leek, Jeffrey T. Removing batch effects for prediction problems with frozen surrogate variable analysis |
title | Removing batch effects for prediction problems with frozen surrogate variable analysis |
title_full | Removing batch effects for prediction problems with frozen surrogate variable analysis |
title_fullStr | Removing batch effects for prediction problems with frozen surrogate variable analysis |
title_full_unstemmed | Removing batch effects for prediction problems with frozen surrogate variable analysis |
title_short | Removing batch effects for prediction problems with frozen surrogate variable analysis |
title_sort | removing batch effects for prediction problems with frozen surrogate variable analysis |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4179553/ https://www.ncbi.nlm.nih.gov/pubmed/25332844 http://dx.doi.org/10.7717/peerj.561 |
work_keys_str_mv | AT parkerhilarys removingbatcheffectsforpredictionproblemswithfrozensurrogatevariableanalysis AT corradabravohector removingbatcheffectsforpredictionproblemswithfrozensurrogatevariableanalysis AT leekjeffreyt removingbatcheffectsforpredictionproblemswithfrozensurrogatevariableanalysis |