Cargando…
A single-array preprocessing method for estimating full-resolution raw copy numbers from all Affymetrix genotyping arrays including GenomeWideSNP 5 & 6
Motivation: High-resolution copy-number (CN) analysis has in recent years gained much attention, not only for the purpose of identifying CN aberrations associated with a certain phenotype, but also for identifying CN polymorphisms. In order for such studies to be successful and cost effective, the s...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2734319/ https://www.ncbi.nlm.nih.gov/pubmed/19535535 http://dx.doi.org/10.1093/bioinformatics/btp371 |
_version_ | 1782171145154330624 |
---|---|
author | Bengtsson, Henrik Wirapati, Pratyaksha Speed, Terence P. |
author_facet | Bengtsson, Henrik Wirapati, Pratyaksha Speed, Terence P. |
author_sort | Bengtsson, Henrik |
collection | PubMed |
description | Motivation: High-resolution copy-number (CN) analysis has in recent years gained much attention, not only for the purpose of identifying CN aberrations associated with a certain phenotype, but also for identifying CN polymorphisms. In order for such studies to be successful and cost effective, the statistical methods have to be optimized. We propose a single-array preprocessing method for estimating full-resolution total CNs. It is applicable to all Affymetrix genotyping arrays, including the recent ones that also contain non-polymorphic probes. A reference signal is only needed at the last step when calculating relative CNs. Results: As with our method for earlier generations of arrays, this one controls for allelic crosstalk, probe affinities and PCR fragment-length effects. Additionally, it also corrects for probe sequence effects and co-hybridization of fragments digested by multiple enzymes that takes place on the latest chips. We compare our method with Affymetrix's CN5 method and the dChip method by assessing how well they differentiate between various CN states at the full resolution and various amounts of smoothing. Although CRMA v2 is a single-array method, we observe that it performs as well as or better than alternative methods that use data from all arrays for their preprocessing. This shows that it is possible to do online analysis in large-scale projects where additional arrays are introduced over time. Availability: A bounded-memory implementation that can process any number of arrays is available in the open source R package aroma.affymetrix. Contact: hb@stat.berkeley.edu Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Text |
id | pubmed-2734319 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-27343192009-08-31 A single-array preprocessing method for estimating full-resolution raw copy numbers from all Affymetrix genotyping arrays including GenomeWideSNP 5 & 6 Bengtsson, Henrik Wirapati, Pratyaksha Speed, Terence P. Bioinformatics Original Papers Motivation: High-resolution copy-number (CN) analysis has in recent years gained much attention, not only for the purpose of identifying CN aberrations associated with a certain phenotype, but also for identifying CN polymorphisms. In order for such studies to be successful and cost effective, the statistical methods have to be optimized. We propose a single-array preprocessing method for estimating full-resolution total CNs. It is applicable to all Affymetrix genotyping arrays, including the recent ones that also contain non-polymorphic probes. A reference signal is only needed at the last step when calculating relative CNs. Results: As with our method for earlier generations of arrays, this one controls for allelic crosstalk, probe affinities and PCR fragment-length effects. Additionally, it also corrects for probe sequence effects and co-hybridization of fragments digested by multiple enzymes that takes place on the latest chips. We compare our method with Affymetrix's CN5 method and the dChip method by assessing how well they differentiate between various CN states at the full resolution and various amounts of smoothing. Although CRMA v2 is a single-array method, we observe that it performs as well as or better than alternative methods that use data from all arrays for their preprocessing. This shows that it is possible to do online analysis in large-scale projects where additional arrays are introduced over time. Availability: A bounded-memory implementation that can process any number of arrays is available in the open source R package aroma.affymetrix. Contact: hb@stat.berkeley.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2009-09-01 2009-06-17 /pmc/articles/PMC2734319/ /pubmed/19535535 http://dx.doi.org/10.1093/bioinformatics/btp371 Text en © 2009 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Bengtsson, Henrik Wirapati, Pratyaksha Speed, Terence P. A single-array preprocessing method for estimating full-resolution raw copy numbers from all Affymetrix genotyping arrays including GenomeWideSNP 5 & 6 |
title | A single-array preprocessing method for estimating full-resolution raw copy numbers from all Affymetrix genotyping arrays including GenomeWideSNP 5 & 6 |
title_full | A single-array preprocessing method for estimating full-resolution raw copy numbers from all Affymetrix genotyping arrays including GenomeWideSNP 5 & 6 |
title_fullStr | A single-array preprocessing method for estimating full-resolution raw copy numbers from all Affymetrix genotyping arrays including GenomeWideSNP 5 & 6 |
title_full_unstemmed | A single-array preprocessing method for estimating full-resolution raw copy numbers from all Affymetrix genotyping arrays including GenomeWideSNP 5 & 6 |
title_short | A single-array preprocessing method for estimating full-resolution raw copy numbers from all Affymetrix genotyping arrays including GenomeWideSNP 5 & 6 |
title_sort | single-array preprocessing method for estimating full-resolution raw copy numbers from all affymetrix genotyping arrays including genomewidesnp 5 & 6 |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2734319/ https://www.ncbi.nlm.nih.gov/pubmed/19535535 http://dx.doi.org/10.1093/bioinformatics/btp371 |
work_keys_str_mv | AT bengtssonhenrik asinglearraypreprocessingmethodforestimatingfullresolutionrawcopynumbersfromallaffymetrixgenotypingarraysincludinggenomewidesnp56 AT wirapatipratyaksha asinglearraypreprocessingmethodforestimatingfullresolutionrawcopynumbersfromallaffymetrixgenotypingarraysincludinggenomewidesnp56 AT speedterencep asinglearraypreprocessingmethodforestimatingfullresolutionrawcopynumbersfromallaffymetrixgenotypingarraysincludinggenomewidesnp56 AT bengtssonhenrik singlearraypreprocessingmethodforestimatingfullresolutionrawcopynumbersfromallaffymetrixgenotypingarraysincludinggenomewidesnp56 AT wirapatipratyaksha singlearraypreprocessingmethodforestimatingfullresolutionrawcopynumbersfromallaffymetrixgenotypingarraysincludinggenomewidesnp56 AT speedterencep singlearraypreprocessingmethodforestimatingfullresolutionrawcopynumbersfromallaffymetrixgenotypingarraysincludinggenomewidesnp56 |