Cargando…

Assessing affymetrix GeneChip microarray quality

BACKGROUND: Microarray technology has become a widely used tool in the biological sciences. Over the past decade, the number of users has grown exponentially, and with the number of applications and secondary data analyses rapidly increasing, we expect this rate to continue. Various initiatives such...

Descripción completa

Detalles Bibliográficos
Autores principales: McCall, Matthew N, Murakami, Peter N, Lukk, Margus, Huber, Wolfgang, Irizarry, Rafael A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3097162/
https://www.ncbi.nlm.nih.gov/pubmed/21548974
http://dx.doi.org/10.1186/1471-2105-12-137
_version_ 1782203792722231296
author McCall, Matthew N
Murakami, Peter N
Lukk, Margus
Huber, Wolfgang
Irizarry, Rafael A
author_facet McCall, Matthew N
Murakami, Peter N
Lukk, Margus
Huber, Wolfgang
Irizarry, Rafael A
author_sort McCall, Matthew N
collection PubMed
description BACKGROUND: Microarray technology has become a widely used tool in the biological sciences. Over the past decade, the number of users has grown exponentially, and with the number of applications and secondary data analyses rapidly increasing, we expect this rate to continue. Various initiatives such as the External RNA Control Consortium (ERCC) and the MicroArray Quality Control (MAQC) project have explored ways to provide standards for the technology. For microarrays to become generally accepted as a reliable technology, statistical methods for assessing quality will be an indispensable component; however, there remains a lack of consensus in both defining and measuring microarray quality. RESULTS: We begin by providing a precise definition of microarray quality and reviewing existing Affymetrix GeneChip quality metrics in light of this definition. We show that the best-performing metrics require multiple arrays to be assessed simultaneously. While such multi-array quality metrics are adequate for bench science, as microarrays begin to be used in clinical settings, single-array quality metrics will be indispensable. To this end, we define a single-array version of one of the best multi-array quality metrics and show that this metric performs as well as the best multi-array metrics. We then use this new quality metric to assess the quality of microarry data available via the Gene Expression Omnibus (GEO) using more than 22,000 Affymetrix HGU133a and HGU133plus2 arrays from 809 studies. CONCLUSIONS: We find that approximately 10 percent of these publicly available arrays are of poor quality. Moreover, the quality of microarray measurements varies greatly from hybridization to hybridization, study to study, and lab to lab, with some experiments producing unusable data. Many of the concepts described here are applicable to other high-throughput technologies.
format Text
id pubmed-3097162
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30971622011-05-19 Assessing affymetrix GeneChip microarray quality McCall, Matthew N Murakami, Peter N Lukk, Margus Huber, Wolfgang Irizarry, Rafael A BMC Bioinformatics Methodology Article BACKGROUND: Microarray technology has become a widely used tool in the biological sciences. Over the past decade, the number of users has grown exponentially, and with the number of applications and secondary data analyses rapidly increasing, we expect this rate to continue. Various initiatives such as the External RNA Control Consortium (ERCC) and the MicroArray Quality Control (MAQC) project have explored ways to provide standards for the technology. For microarrays to become generally accepted as a reliable technology, statistical methods for assessing quality will be an indispensable component; however, there remains a lack of consensus in both defining and measuring microarray quality. RESULTS: We begin by providing a precise definition of microarray quality and reviewing existing Affymetrix GeneChip quality metrics in light of this definition. We show that the best-performing metrics require multiple arrays to be assessed simultaneously. While such multi-array quality metrics are adequate for bench science, as microarrays begin to be used in clinical settings, single-array quality metrics will be indispensable. To this end, we define a single-array version of one of the best multi-array quality metrics and show that this metric performs as well as the best multi-array metrics. We then use this new quality metric to assess the quality of microarry data available via the Gene Expression Omnibus (GEO) using more than 22,000 Affymetrix HGU133a and HGU133plus2 arrays from 809 studies. CONCLUSIONS: We find that approximately 10 percent of these publicly available arrays are of poor quality. Moreover, the quality of microarray measurements varies greatly from hybridization to hybridization, study to study, and lab to lab, with some experiments producing unusable data. Many of the concepts described here are applicable to other high-throughput technologies. BioMed Central 2011-05-07 /pmc/articles/PMC3097162/ /pubmed/21548974 http://dx.doi.org/10.1186/1471-2105-12-137 Text en Copyright ©2011 McCall et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
McCall, Matthew N
Murakami, Peter N
Lukk, Margus
Huber, Wolfgang
Irizarry, Rafael A
Assessing affymetrix GeneChip microarray quality
title Assessing affymetrix GeneChip microarray quality
title_full Assessing affymetrix GeneChip microarray quality
title_fullStr Assessing affymetrix GeneChip microarray quality
title_full_unstemmed Assessing affymetrix GeneChip microarray quality
title_short Assessing affymetrix GeneChip microarray quality
title_sort assessing affymetrix genechip microarray quality
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3097162/
https://www.ncbi.nlm.nih.gov/pubmed/21548974
http://dx.doi.org/10.1186/1471-2105-12-137
work_keys_str_mv AT mccallmatthewn assessingaffymetrixgenechipmicroarrayquality
AT murakamipetern assessingaffymetrixgenechipmicroarrayquality
AT lukkmargus assessingaffymetrixgenechipmicroarrayquality
AT huberwolfgang assessingaffymetrixgenechipmicroarrayquality
AT irizarryrafaela assessingaffymetrixgenechipmicroarrayquality