Cargando…
Performance evaluation of DNA copy number segmentation methods
A number of bioinformatic or biostatistical methods are available for analyzing DNA copy number profiles measured from microarray or sequencing technologies. In the absence of rich enough gold standard data sets, the performance of these methods is generally assessed using unrealistic simulation stu...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4501247/ https://www.ncbi.nlm.nih.gov/pubmed/25202135 http://dx.doi.org/10.1093/bib/bbu026 |
_version_ | 1782381038768488448 |
---|---|
author | Pierre-Jean, Morgane Rigaill, Guillem Neuvial, Pierre |
author_facet | Pierre-Jean, Morgane Rigaill, Guillem Neuvial, Pierre |
author_sort | Pierre-Jean, Morgane |
collection | PubMed |
description | A number of bioinformatic or biostatistical methods are available for analyzing DNA copy number profiles measured from microarray or sequencing technologies. In the absence of rich enough gold standard data sets, the performance of these methods is generally assessed using unrealistic simulation studies, or based on small real data analyses. To make an objective and reproducible performance assessment, we have designed and implemented a framework to generate realistic DNA copy number profiles of cancer samples with known truth. These profiles are generated by resampling publicly available SNP microarray data from genomic regions with known copy-number state. The original data have been extracted from dilutions series of tumor cell lines with matched blood samples at several concentrations. Therefore, the signal-to-noise ratio of the generated profiles can be controlled through the (known) percentage of tumor cells in the sample. This article describes this framework and its application to a comparison study between methods for segmenting DNA copy number profiles from SNP microarrays. This study indicates that no single method is uniformly better than all others. It also helps identifying pros and cons of the compared methods as a function of biologically informative parameters, such as the fraction of tumor cells in the sample and the proportion of heterozygous markers. This comparison study may be reproduced using the open source and cross-platform R package jointseg, which implements the proposed data generation and evaluation framework: http://r-forge.r-project.org/R/?group_id=1562. |
format | Online Article Text |
id | pubmed-4501247 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-45012472015-07-16 Performance evaluation of DNA copy number segmentation methods Pierre-Jean, Morgane Rigaill, Guillem Neuvial, Pierre Brief Bioinform Papers A number of bioinformatic or biostatistical methods are available for analyzing DNA copy number profiles measured from microarray or sequencing technologies. In the absence of rich enough gold standard data sets, the performance of these methods is generally assessed using unrealistic simulation studies, or based on small real data analyses. To make an objective and reproducible performance assessment, we have designed and implemented a framework to generate realistic DNA copy number profiles of cancer samples with known truth. These profiles are generated by resampling publicly available SNP microarray data from genomic regions with known copy-number state. The original data have been extracted from dilutions series of tumor cell lines with matched blood samples at several concentrations. Therefore, the signal-to-noise ratio of the generated profiles can be controlled through the (known) percentage of tumor cells in the sample. This article describes this framework and its application to a comparison study between methods for segmenting DNA copy number profiles from SNP microarrays. This study indicates that no single method is uniformly better than all others. It also helps identifying pros and cons of the compared methods as a function of biologically informative parameters, such as the fraction of tumor cells in the sample and the proportion of heterozygous markers. This comparison study may be reproduced using the open source and cross-platform R package jointseg, which implements the proposed data generation and evaluation framework: http://r-forge.r-project.org/R/?group_id=1562. Oxford University Press 2015-07 2014-09-08 /pmc/articles/PMC4501247/ /pubmed/25202135 http://dx.doi.org/10.1093/bib/bbu026 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Papers Pierre-Jean, Morgane Rigaill, Guillem Neuvial, Pierre Performance evaluation of DNA copy number segmentation methods |
title | Performance evaluation of DNA copy number segmentation methods |
title_full | Performance evaluation of DNA copy number segmentation methods |
title_fullStr | Performance evaluation of DNA copy number segmentation methods |
title_full_unstemmed | Performance evaluation of DNA copy number segmentation methods |
title_short | Performance evaluation of DNA copy number segmentation methods |
title_sort | performance evaluation of dna copy number segmentation methods |
topic | Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4501247/ https://www.ncbi.nlm.nih.gov/pubmed/25202135 http://dx.doi.org/10.1093/bib/bbu026 |
work_keys_str_mv | AT pierrejeanmorgane performanceevaluationofdnacopynumbersegmentationmethods AT rigaillguillem performanceevaluationofdnacopynumbersegmentationmethods AT neuvialpierre performanceevaluationofdnacopynumbersegmentationmethods |