Cargando…

Robust Regression Analysis of Copy Number Variation Data based on a Univariate Score

MOTIVATION: The discovery that copy number variants (CNVs) are widespread in the human genome has motivated development of numerous algorithms that attempt to detect CNVs from intensity data. However, all approaches are plagued by high false discovery rates. Further, because CNVs are characterized b...

Descripción completa

Detalles Bibliográficos
Autores principales: Satten, Glen A., Allen, Andrew S., Ikeda, Morna, Mulle, Jennifer G., Warren, Stephen T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3917847/
https://www.ncbi.nlm.nih.gov/pubmed/24516529
http://dx.doi.org/10.1371/journal.pone.0086272
_version_ 1782302885737922560
author Satten, Glen A.
Allen, Andrew S.
Ikeda, Morna
Mulle, Jennifer G.
Warren, Stephen T.
author_facet Satten, Glen A.
Allen, Andrew S.
Ikeda, Morna
Mulle, Jennifer G.
Warren, Stephen T.
author_sort Satten, Glen A.
collection PubMed
description MOTIVATION: The discovery that copy number variants (CNVs) are widespread in the human genome has motivated development of numerous algorithms that attempt to detect CNVs from intensity data. However, all approaches are plagued by high false discovery rates. Further, because CNVs are characterized by two dimensions (length and intensity) it is unclear how to order called CNVs to prioritize experimental validation. RESULTS: We developed a univariate score that correlates with the likelihood that a CNV is true. This score can be used to order CNV calls in such a way that calls having larger scores are more likely to overlap a true CNV. We developed cnv.beast, a computationally efficient algorithm for calling CNVs that uses robust backward elimination regression to keep CNV calls with scores that exceed a user-defined threshold. Using an independent dataset that was measured using a different platform, we validated our score and showed that our approach performed better than six other currently-available methods. AVAILABILITY: cnv.beast is available at http://www.duke.edu/~asallen/Software.html.
format Online
Article
Text
id pubmed-3917847
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-39178472014-02-10 Robust Regression Analysis of Copy Number Variation Data based on a Univariate Score Satten, Glen A. Allen, Andrew S. Ikeda, Morna Mulle, Jennifer G. Warren, Stephen T. PLoS One Research Article MOTIVATION: The discovery that copy number variants (CNVs) are widespread in the human genome has motivated development of numerous algorithms that attempt to detect CNVs from intensity data. However, all approaches are plagued by high false discovery rates. Further, because CNVs are characterized by two dimensions (length and intensity) it is unclear how to order called CNVs to prioritize experimental validation. RESULTS: We developed a univariate score that correlates with the likelihood that a CNV is true. This score can be used to order CNV calls in such a way that calls having larger scores are more likely to overlap a true CNV. We developed cnv.beast, a computationally efficient algorithm for calling CNVs that uses robust backward elimination regression to keep CNV calls with scores that exceed a user-defined threshold. Using an independent dataset that was measured using a different platform, we validated our score and showed that our approach performed better than six other currently-available methods. AVAILABILITY: cnv.beast is available at http://www.duke.edu/~asallen/Software.html. Public Library of Science 2014-02-07 /pmc/articles/PMC3917847/ /pubmed/24516529 http://dx.doi.org/10.1371/journal.pone.0086272 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration, which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose.
spellingShingle Research Article
Satten, Glen A.
Allen, Andrew S.
Ikeda, Morna
Mulle, Jennifer G.
Warren, Stephen T.
Robust Regression Analysis of Copy Number Variation Data based on a Univariate Score
title Robust Regression Analysis of Copy Number Variation Data based on a Univariate Score
title_full Robust Regression Analysis of Copy Number Variation Data based on a Univariate Score
title_fullStr Robust Regression Analysis of Copy Number Variation Data based on a Univariate Score
title_full_unstemmed Robust Regression Analysis of Copy Number Variation Data based on a Univariate Score
title_short Robust Regression Analysis of Copy Number Variation Data based on a Univariate Score
title_sort robust regression analysis of copy number variation data based on a univariate score
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3917847/
https://www.ncbi.nlm.nih.gov/pubmed/24516529
http://dx.doi.org/10.1371/journal.pone.0086272
work_keys_str_mv AT sattenglena robustregressionanalysisofcopynumbervariationdatabasedonaunivariatescore
AT allenandrews robustregressionanalysisofcopynumbervariationdatabasedonaunivariatescore
AT ikedamorna robustregressionanalysisofcopynumbervariationdatabasedonaunivariatescore
AT mullejenniferg robustregressionanalysisofcopynumbervariationdatabasedonaunivariatescore
AT warrenstephent robustregressionanalysisofcopynumbervariationdatabasedonaunivariatescore