Cargando…

Performance evaluation of DNA copy number segmentation methods

A number of bioinformatic or biostatistical methods are available for analyzing DNA copy number profiles measured from microarray or sequencing technologies. In the absence of rich enough gold standard data sets, the performance of these methods is generally assessed using unrealistic simulation stu...

Descripción completa

Detalles Bibliográficos
Autores principales: Pierre-Jean, Morgane, Rigaill, Guillem, Neuvial, Pierre
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4501247/
https://www.ncbi.nlm.nih.gov/pubmed/25202135
http://dx.doi.org/10.1093/bib/bbu026
_version_ 1782381038768488448
author Pierre-Jean, Morgane
Rigaill, Guillem
Neuvial, Pierre
author_facet Pierre-Jean, Morgane
Rigaill, Guillem
Neuvial, Pierre
author_sort Pierre-Jean, Morgane
collection PubMed
description A number of bioinformatic or biostatistical methods are available for analyzing DNA copy number profiles measured from microarray or sequencing technologies. In the absence of rich enough gold standard data sets, the performance of these methods is generally assessed using unrealistic simulation studies, or based on small real data analyses. To make an objective and reproducible performance assessment, we have designed and implemented a framework to generate realistic DNA copy number profiles of cancer samples with known truth. These profiles are generated by resampling publicly available SNP microarray data from genomic regions with known copy-number state. The original data have been extracted from dilutions series of tumor cell lines with matched blood samples at several concentrations. Therefore, the signal-to-noise ratio of the generated profiles can be controlled through the (known) percentage of tumor cells in the sample. This article describes this framework and its application to a comparison study between methods for segmenting DNA copy number profiles from SNP microarrays. This study indicates that no single method is uniformly better than all others. It also helps identifying pros and cons of the compared methods as a function of biologically informative parameters, such as the fraction of tumor cells in the sample and the proportion of heterozygous markers. This comparison study may be reproduced using the open source and cross-platform R package jointseg, which implements the proposed data generation and evaluation framework: http://r-forge.r-project.org/R/?group_id=1562.
format Online
Article
Text
id pubmed-4501247
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-45012472015-07-16 Performance evaluation of DNA copy number segmentation methods Pierre-Jean, Morgane Rigaill, Guillem Neuvial, Pierre Brief Bioinform Papers A number of bioinformatic or biostatistical methods are available for analyzing DNA copy number profiles measured from microarray or sequencing technologies. In the absence of rich enough gold standard data sets, the performance of these methods is generally assessed using unrealistic simulation studies, or based on small real data analyses. To make an objective and reproducible performance assessment, we have designed and implemented a framework to generate realistic DNA copy number profiles of cancer samples with known truth. These profiles are generated by resampling publicly available SNP microarray data from genomic regions with known copy-number state. The original data have been extracted from dilutions series of tumor cell lines with matched blood samples at several concentrations. Therefore, the signal-to-noise ratio of the generated profiles can be controlled through the (known) percentage of tumor cells in the sample. This article describes this framework and its application to a comparison study between methods for segmenting DNA copy number profiles from SNP microarrays. This study indicates that no single method is uniformly better than all others. It also helps identifying pros and cons of the compared methods as a function of biologically informative parameters, such as the fraction of tumor cells in the sample and the proportion of heterozygous markers. This comparison study may be reproduced using the open source and cross-platform R package jointseg, which implements the proposed data generation and evaluation framework: http://r-forge.r-project.org/R/?group_id=1562. Oxford University Press 2015-07 2014-09-08 /pmc/articles/PMC4501247/ /pubmed/25202135 http://dx.doi.org/10.1093/bib/bbu026 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Papers
Pierre-Jean, Morgane
Rigaill, Guillem
Neuvial, Pierre
Performance evaluation of DNA copy number segmentation methods
title Performance evaluation of DNA copy number segmentation methods
title_full Performance evaluation of DNA copy number segmentation methods
title_fullStr Performance evaluation of DNA copy number segmentation methods
title_full_unstemmed Performance evaluation of DNA copy number segmentation methods
title_short Performance evaluation of DNA copy number segmentation methods
title_sort performance evaluation of dna copy number segmentation methods
topic Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4501247/
https://www.ncbi.nlm.nih.gov/pubmed/25202135
http://dx.doi.org/10.1093/bib/bbu026
work_keys_str_mv AT pierrejeanmorgane performanceevaluationofdnacopynumbersegmentationmethods
AT rigaillguillem performanceevaluationofdnacopynumbersegmentationmethods
AT neuvialpierre performanceevaluationofdnacopynumbersegmentationmethods