Cargando…

A single-array preprocessing method for estimating full-resolution raw copy numbers from all Affymetrix genotyping arrays including GenomeWideSNP 5 & 6

Motivation: High-resolution copy-number (CN) analysis has in recent years gained much attention, not only for the purpose of identifying CN aberrations associated with a certain phenotype, but also for identifying CN polymorphisms. In order for such studies to be successful and cost effective, the s...

Descripción completa

Detalles Bibliográficos
Autores principales: Bengtsson, Henrik, Wirapati, Pratyaksha, Speed, Terence P.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2734319/
https://www.ncbi.nlm.nih.gov/pubmed/19535535
http://dx.doi.org/10.1093/bioinformatics/btp371
_version_ 1782171145154330624
author Bengtsson, Henrik
Wirapati, Pratyaksha
Speed, Terence P.
author_facet Bengtsson, Henrik
Wirapati, Pratyaksha
Speed, Terence P.
author_sort Bengtsson, Henrik
collection PubMed
description Motivation: High-resolution copy-number (CN) analysis has in recent years gained much attention, not only for the purpose of identifying CN aberrations associated with a certain phenotype, but also for identifying CN polymorphisms. In order for such studies to be successful and cost effective, the statistical methods have to be optimized. We propose a single-array preprocessing method for estimating full-resolution total CNs. It is applicable to all Affymetrix genotyping arrays, including the recent ones that also contain non-polymorphic probes. A reference signal is only needed at the last step when calculating relative CNs. Results: As with our method for earlier generations of arrays, this one controls for allelic crosstalk, probe affinities and PCR fragment-length effects. Additionally, it also corrects for probe sequence effects and co-hybridization of fragments digested by multiple enzymes that takes place on the latest chips. We compare our method with Affymetrix's CN5 method and the dChip method by assessing how well they differentiate between various CN states at the full resolution and various amounts of smoothing. Although CRMA v2 is a single-array method, we observe that it performs as well as or better than alternative methods that use data from all arrays for their preprocessing. This shows that it is possible to do online analysis in large-scale projects where additional arrays are introduced over time. Availability: A bounded-memory implementation that can process any number of arrays is available in the open source R package aroma.affymetrix. Contact: hb@stat.berkeley.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Text
id pubmed-2734319
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-27343192009-08-31 A single-array preprocessing method for estimating full-resolution raw copy numbers from all Affymetrix genotyping arrays including GenomeWideSNP 5 & 6 Bengtsson, Henrik Wirapati, Pratyaksha Speed, Terence P. Bioinformatics Original Papers Motivation: High-resolution copy-number (CN) analysis has in recent years gained much attention, not only for the purpose of identifying CN aberrations associated with a certain phenotype, but also for identifying CN polymorphisms. In order for such studies to be successful and cost effective, the statistical methods have to be optimized. We propose a single-array preprocessing method for estimating full-resolution total CNs. It is applicable to all Affymetrix genotyping arrays, including the recent ones that also contain non-polymorphic probes. A reference signal is only needed at the last step when calculating relative CNs. Results: As with our method for earlier generations of arrays, this one controls for allelic crosstalk, probe affinities and PCR fragment-length effects. Additionally, it also corrects for probe sequence effects and co-hybridization of fragments digested by multiple enzymes that takes place on the latest chips. We compare our method with Affymetrix's CN5 method and the dChip method by assessing how well they differentiate between various CN states at the full resolution and various amounts of smoothing. Although CRMA v2 is a single-array method, we observe that it performs as well as or better than alternative methods that use data from all arrays for their preprocessing. This shows that it is possible to do online analysis in large-scale projects where additional arrays are introduced over time. Availability: A bounded-memory implementation that can process any number of arrays is available in the open source R package aroma.affymetrix. Contact: hb@stat.berkeley.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2009-09-01 2009-06-17 /pmc/articles/PMC2734319/ /pubmed/19535535 http://dx.doi.org/10.1093/bioinformatics/btp371 Text en © 2009 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Bengtsson, Henrik
Wirapati, Pratyaksha
Speed, Terence P.
A single-array preprocessing method for estimating full-resolution raw copy numbers from all Affymetrix genotyping arrays including GenomeWideSNP 5 & 6
title A single-array preprocessing method for estimating full-resolution raw copy numbers from all Affymetrix genotyping arrays including GenomeWideSNP 5 & 6
title_full A single-array preprocessing method for estimating full-resolution raw copy numbers from all Affymetrix genotyping arrays including GenomeWideSNP 5 & 6
title_fullStr A single-array preprocessing method for estimating full-resolution raw copy numbers from all Affymetrix genotyping arrays including GenomeWideSNP 5 & 6
title_full_unstemmed A single-array preprocessing method for estimating full-resolution raw copy numbers from all Affymetrix genotyping arrays including GenomeWideSNP 5 & 6
title_short A single-array preprocessing method for estimating full-resolution raw copy numbers from all Affymetrix genotyping arrays including GenomeWideSNP 5 & 6
title_sort single-array preprocessing method for estimating full-resolution raw copy numbers from all affymetrix genotyping arrays including genomewidesnp 5 & 6
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2734319/
https://www.ncbi.nlm.nih.gov/pubmed/19535535
http://dx.doi.org/10.1093/bioinformatics/btp371
work_keys_str_mv AT bengtssonhenrik asinglearraypreprocessingmethodforestimatingfullresolutionrawcopynumbersfromallaffymetrixgenotypingarraysincludinggenomewidesnp56
AT wirapatipratyaksha asinglearraypreprocessingmethodforestimatingfullresolutionrawcopynumbersfromallaffymetrixgenotypingarraysincludinggenomewidesnp56
AT speedterencep asinglearraypreprocessingmethodforestimatingfullresolutionrawcopynumbersfromallaffymetrixgenotypingarraysincludinggenomewidesnp56
AT bengtssonhenrik singlearraypreprocessingmethodforestimatingfullresolutionrawcopynumbersfromallaffymetrixgenotypingarraysincludinggenomewidesnp56
AT wirapatipratyaksha singlearraypreprocessingmethodforestimatingfullresolutionrawcopynumbersfromallaffymetrixgenotypingarraysincludinggenomewidesnp56
AT speedterencep singlearraypreprocessingmethodforestimatingfullresolutionrawcopynumbersfromallaffymetrixgenotypingarraysincludinggenomewidesnp56