Cargando…

Assessment of algorithms for high throughput detection of genomic copy number variation in oligonucleotide microarray data

BACKGROUND: Genomic deletions and duplications are important in the pathogenesis of diseases, such as cancer and mental retardation, and have recently been shown to occur frequently in unaffected individuals as polymorphisms. Affymetrix GeneChip whole genome sampling analysis (WGSA) combined with 10...

Descripción completa

Detalles Bibliográficos
Autores principales: Baross, Ágnes, Delaney, Allen D, Li, H Irene, Nayar, Tarun, Flibotte, Stephane, Qian, Hong, Chan, Susanna Y, Asano, Jennifer, Ally, Adrian, Cao, Manqiu, Birch, Patricia, Brown-John, Mabel, Fernandes, Nicole, Go, Anne, Kennedy, Giulia, Langlois, Sylvie, Eydoux, Patrice, Friedman, JM, Marra, Marco A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2148068/
https://www.ncbi.nlm.nih.gov/pubmed/17910767
http://dx.doi.org/10.1186/1471-2105-8-368
_version_ 1782144501161132032
author Baross, Ágnes
Delaney, Allen D
Li, H Irene
Nayar, Tarun
Flibotte, Stephane
Qian, Hong
Chan, Susanna Y
Asano, Jennifer
Ally, Adrian
Cao, Manqiu
Birch, Patricia
Brown-John, Mabel
Fernandes, Nicole
Go, Anne
Kennedy, Giulia
Langlois, Sylvie
Eydoux, Patrice
Friedman, JM
Marra, Marco A
author_facet Baross, Ágnes
Delaney, Allen D
Li, H Irene
Nayar, Tarun
Flibotte, Stephane
Qian, Hong
Chan, Susanna Y
Asano, Jennifer
Ally, Adrian
Cao, Manqiu
Birch, Patricia
Brown-John, Mabel
Fernandes, Nicole
Go, Anne
Kennedy, Giulia
Langlois, Sylvie
Eydoux, Patrice
Friedman, JM
Marra, Marco A
author_sort Baross, Ágnes
collection PubMed
description BACKGROUND: Genomic deletions and duplications are important in the pathogenesis of diseases, such as cancer and mental retardation, and have recently been shown to occur frequently in unaffected individuals as polymorphisms. Affymetrix GeneChip whole genome sampling analysis (WGSA) combined with 100 K single nucleotide polymorphism (SNP) genotyping arrays is one of several microarray-based approaches that are now being used to detect such structural genomic changes. The popularity of this technology and its associated open source data format have resulted in the development of an increasing number of software packages for the analysis of copy number changes using these SNP arrays. RESULTS: We evaluated four publicly available software packages for high throughput copy number analysis using synthetic and empirical 100 K SNP array data sets, the latter obtained from 107 mental retardation (MR) patients and their unaffected parents and siblings. We evaluated the software with regards to overall suitability for high-throughput 100 K SNP array data analysis, as well as effectiveness of normalization, scaling with various reference sets and feature extraction, as well as true and false positive rates of genomic copy number variant (CNV) detection. CONCLUSION: We observed considerable variation among the numbers and types of candidate CNVs detected by different analysis approaches, and found that multiple programs were needed to find all real aberrations in our test set. The frequency of false positive deletions was substantial, but could be greatly reduced by using the SNP genotype information to confirm loss of heterozygosity.
format Text
id pubmed-2148068
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-21480682007-12-20 Assessment of algorithms for high throughput detection of genomic copy number variation in oligonucleotide microarray data Baross, Ágnes Delaney, Allen D Li, H Irene Nayar, Tarun Flibotte, Stephane Qian, Hong Chan, Susanna Y Asano, Jennifer Ally, Adrian Cao, Manqiu Birch, Patricia Brown-John, Mabel Fernandes, Nicole Go, Anne Kennedy, Giulia Langlois, Sylvie Eydoux, Patrice Friedman, JM Marra, Marco A BMC Bioinformatics Research Article BACKGROUND: Genomic deletions and duplications are important in the pathogenesis of diseases, such as cancer and mental retardation, and have recently been shown to occur frequently in unaffected individuals as polymorphisms. Affymetrix GeneChip whole genome sampling analysis (WGSA) combined with 100 K single nucleotide polymorphism (SNP) genotyping arrays is one of several microarray-based approaches that are now being used to detect such structural genomic changes. The popularity of this technology and its associated open source data format have resulted in the development of an increasing number of software packages for the analysis of copy number changes using these SNP arrays. RESULTS: We evaluated four publicly available software packages for high throughput copy number analysis using synthetic and empirical 100 K SNP array data sets, the latter obtained from 107 mental retardation (MR) patients and their unaffected parents and siblings. We evaluated the software with regards to overall suitability for high-throughput 100 K SNP array data analysis, as well as effectiveness of normalization, scaling with various reference sets and feature extraction, as well as true and false positive rates of genomic copy number variant (CNV) detection. CONCLUSION: We observed considerable variation among the numbers and types of candidate CNVs detected by different analysis approaches, and found that multiple programs were needed to find all real aberrations in our test set. The frequency of false positive deletions was substantial, but could be greatly reduced by using the SNP genotype information to confirm loss of heterozygosity. BioMed Central 2007-10-02 /pmc/articles/PMC2148068/ /pubmed/17910767 http://dx.doi.org/10.1186/1471-2105-8-368 Text en Copyright © 2007 Baross et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Baross, Ágnes
Delaney, Allen D
Li, H Irene
Nayar, Tarun
Flibotte, Stephane
Qian, Hong
Chan, Susanna Y
Asano, Jennifer
Ally, Adrian
Cao, Manqiu
Birch, Patricia
Brown-John, Mabel
Fernandes, Nicole
Go, Anne
Kennedy, Giulia
Langlois, Sylvie
Eydoux, Patrice
Friedman, JM
Marra, Marco A
Assessment of algorithms for high throughput detection of genomic copy number variation in oligonucleotide microarray data
title Assessment of algorithms for high throughput detection of genomic copy number variation in oligonucleotide microarray data
title_full Assessment of algorithms for high throughput detection of genomic copy number variation in oligonucleotide microarray data
title_fullStr Assessment of algorithms for high throughput detection of genomic copy number variation in oligonucleotide microarray data
title_full_unstemmed Assessment of algorithms for high throughput detection of genomic copy number variation in oligonucleotide microarray data
title_short Assessment of algorithms for high throughput detection of genomic copy number variation in oligonucleotide microarray data
title_sort assessment of algorithms for high throughput detection of genomic copy number variation in oligonucleotide microarray data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2148068/
https://www.ncbi.nlm.nih.gov/pubmed/17910767
http://dx.doi.org/10.1186/1471-2105-8-368
work_keys_str_mv AT barossagnes assessmentofalgorithmsforhighthroughputdetectionofgenomiccopynumbervariationinoligonucleotidemicroarraydata
AT delaneyallend assessmentofalgorithmsforhighthroughputdetectionofgenomiccopynumbervariationinoligonucleotidemicroarraydata
AT lihirene assessmentofalgorithmsforhighthroughputdetectionofgenomiccopynumbervariationinoligonucleotidemicroarraydata
AT nayartarun assessmentofalgorithmsforhighthroughputdetectionofgenomiccopynumbervariationinoligonucleotidemicroarraydata
AT flibottestephane assessmentofalgorithmsforhighthroughputdetectionofgenomiccopynumbervariationinoligonucleotidemicroarraydata
AT qianhong assessmentofalgorithmsforhighthroughputdetectionofgenomiccopynumbervariationinoligonucleotidemicroarraydata
AT chansusannay assessmentofalgorithmsforhighthroughputdetectionofgenomiccopynumbervariationinoligonucleotidemicroarraydata
AT asanojennifer assessmentofalgorithmsforhighthroughputdetectionofgenomiccopynumbervariationinoligonucleotidemicroarraydata
AT allyadrian assessmentofalgorithmsforhighthroughputdetectionofgenomiccopynumbervariationinoligonucleotidemicroarraydata
AT caomanqiu assessmentofalgorithmsforhighthroughputdetectionofgenomiccopynumbervariationinoligonucleotidemicroarraydata
AT birchpatricia assessmentofalgorithmsforhighthroughputdetectionofgenomiccopynumbervariationinoligonucleotidemicroarraydata
AT brownjohnmabel assessmentofalgorithmsforhighthroughputdetectionofgenomiccopynumbervariationinoligonucleotidemicroarraydata
AT fernandesnicole assessmentofalgorithmsforhighthroughputdetectionofgenomiccopynumbervariationinoligonucleotidemicroarraydata
AT goanne assessmentofalgorithmsforhighthroughputdetectionofgenomiccopynumbervariationinoligonucleotidemicroarraydata
AT kennedygiulia assessmentofalgorithmsforhighthroughputdetectionofgenomiccopynumbervariationinoligonucleotidemicroarraydata
AT langloissylvie assessmentofalgorithmsforhighthroughputdetectionofgenomiccopynumbervariationinoligonucleotidemicroarraydata
AT eydouxpatrice assessmentofalgorithmsforhighthroughputdetectionofgenomiccopynumbervariationinoligonucleotidemicroarraydata
AT friedmanjm assessmentofalgorithmsforhighthroughputdetectionofgenomiccopynumbervariationinoligonucleotidemicroarraydata
AT marramarcoa assessmentofalgorithmsforhighthroughputdetectionofgenomiccopynumbervariationinoligonucleotidemicroarraydata