Cargando…

Assessment of the relationship between pre-chip and post-chip quality measures for Affymetrix GeneChip expression data

BACKGROUND: Gene expression microarray experiments are expensive to conduct and guidelines for acceptable quality control at intermediate steps before and after the samples are hybridised to chips are vague. We conducted an experiment hybridising RNA from human brain to 117 U133A Affymetrix GeneChip...

Descripción completa

Detalles Bibliográficos
Autores principales: Jones, Lesley, Goldstein, Darlene R, Hughes, Gareth, Strand, Andrew D, Collin, Francois, Dunnett, Stephen B, Kooperberg, Charles, Aragaki, Aaron, Olson, James M, Augood, Sarah J, Faull, Richard LM, Luthi-Carter, Ruth, Moskvina, Valentina, Hodges, Angela K
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1524996/
https://www.ncbi.nlm.nih.gov/pubmed/16623940
http://dx.doi.org/10.1186/1471-2105-7-211
_version_ 1782128875202936832
author Jones, Lesley
Goldstein, Darlene R
Hughes, Gareth
Strand, Andrew D
Collin, Francois
Dunnett, Stephen B
Kooperberg, Charles
Aragaki, Aaron
Olson, James M
Augood, Sarah J
Faull, Richard LM
Luthi-Carter, Ruth
Moskvina, Valentina
Hodges, Angela K
author_facet Jones, Lesley
Goldstein, Darlene R
Hughes, Gareth
Strand, Andrew D
Collin, Francois
Dunnett, Stephen B
Kooperberg, Charles
Aragaki, Aaron
Olson, James M
Augood, Sarah J
Faull, Richard LM
Luthi-Carter, Ruth
Moskvina, Valentina
Hodges, Angela K
author_sort Jones, Lesley
collection PubMed
description BACKGROUND: Gene expression microarray experiments are expensive to conduct and guidelines for acceptable quality control at intermediate steps before and after the samples are hybridised to chips are vague. We conducted an experiment hybridising RNA from human brain to 117 U133A Affymetrix GeneChips and used these data to explore the relationship between 4 pre-chip variables and 22 post-chip outcomes and quality control measures. RESULTS: We found that the pre-chip variables were significantly correlated with each other but that this correlation was strongest between measures of RNA quality and cRNA yield. Post-mortem interval was negatively correlated with these variables. Four principal components, reflecting array outliers, array adjustment, hybridisation noise and RNA integrity, explain about 75% of the total post-chip measure variability. Two significant canonical correlations existed between the pre-chip and post-chip variables, derived from MAS 5.0, dChip and the Bioconductor packages affy and affyPLM. The strongest (CANCOR 0.838, p < 0.0001) correlated RNA integrity and yield with post chip quality control (QC) measures indexing 3'/5' RNA ratios, bias or scaling of the chip and scaling of the variability of the signal across the chip. Post-mortem interval was relatively unimportant. We also found that the RNA integrity number (RIN) could be moderately well predicted by post-chip measures B_ACTIN35, GAPDH35 and SF. CONCLUSION: We have found that the post-chip variables having the strongest association with quantities measurable before hybridisation are those reflecting RNA integrity. Other aspects of quality, such as noise measures (reflecting the execution of the assay) or measures reflecting data quality (outlier status and array adjustment variables) are not well predicted by the variables we were able to determine ahead of time. There could be other variables measurable pre-hybridisation which may be better associated with expression data quality measures. Uncovering such connections could create savings on costly microarray experiments by eliminating poor samples before hybridisation.
format Text
id pubmed-1524996
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-15249962006-08-01 Assessment of the relationship between pre-chip and post-chip quality measures for Affymetrix GeneChip expression data Jones, Lesley Goldstein, Darlene R Hughes, Gareth Strand, Andrew D Collin, Francois Dunnett, Stephen B Kooperberg, Charles Aragaki, Aaron Olson, James M Augood, Sarah J Faull, Richard LM Luthi-Carter, Ruth Moskvina, Valentina Hodges, Angela K BMC Bioinformatics Methodology Article BACKGROUND: Gene expression microarray experiments are expensive to conduct and guidelines for acceptable quality control at intermediate steps before and after the samples are hybridised to chips are vague. We conducted an experiment hybridising RNA from human brain to 117 U133A Affymetrix GeneChips and used these data to explore the relationship between 4 pre-chip variables and 22 post-chip outcomes and quality control measures. RESULTS: We found that the pre-chip variables were significantly correlated with each other but that this correlation was strongest between measures of RNA quality and cRNA yield. Post-mortem interval was negatively correlated with these variables. Four principal components, reflecting array outliers, array adjustment, hybridisation noise and RNA integrity, explain about 75% of the total post-chip measure variability. Two significant canonical correlations existed between the pre-chip and post-chip variables, derived from MAS 5.0, dChip and the Bioconductor packages affy and affyPLM. The strongest (CANCOR 0.838, p < 0.0001) correlated RNA integrity and yield with post chip quality control (QC) measures indexing 3'/5' RNA ratios, bias or scaling of the chip and scaling of the variability of the signal across the chip. Post-mortem interval was relatively unimportant. We also found that the RNA integrity number (RIN) could be moderately well predicted by post-chip measures B_ACTIN35, GAPDH35 and SF. CONCLUSION: We have found that the post-chip variables having the strongest association with quantities measurable before hybridisation are those reflecting RNA integrity. Other aspects of quality, such as noise measures (reflecting the execution of the assay) or measures reflecting data quality (outlier status and array adjustment variables) are not well predicted by the variables we were able to determine ahead of time. There could be other variables measurable pre-hybridisation which may be better associated with expression data quality measures. Uncovering such connections could create savings on costly microarray experiments by eliminating poor samples before hybridisation. BioMed Central 2006-04-19 /pmc/articles/PMC1524996/ /pubmed/16623940 http://dx.doi.org/10.1186/1471-2105-7-211 Text en Copyright © 2006 Jones et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Jones, Lesley
Goldstein, Darlene R
Hughes, Gareth
Strand, Andrew D
Collin, Francois
Dunnett, Stephen B
Kooperberg, Charles
Aragaki, Aaron
Olson, James M
Augood, Sarah J
Faull, Richard LM
Luthi-Carter, Ruth
Moskvina, Valentina
Hodges, Angela K
Assessment of the relationship between pre-chip and post-chip quality measures for Affymetrix GeneChip expression data
title Assessment of the relationship between pre-chip and post-chip quality measures for Affymetrix GeneChip expression data
title_full Assessment of the relationship between pre-chip and post-chip quality measures for Affymetrix GeneChip expression data
title_fullStr Assessment of the relationship between pre-chip and post-chip quality measures for Affymetrix GeneChip expression data
title_full_unstemmed Assessment of the relationship between pre-chip and post-chip quality measures for Affymetrix GeneChip expression data
title_short Assessment of the relationship between pre-chip and post-chip quality measures for Affymetrix GeneChip expression data
title_sort assessment of the relationship between pre-chip and post-chip quality measures for affymetrix genechip expression data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1524996/
https://www.ncbi.nlm.nih.gov/pubmed/16623940
http://dx.doi.org/10.1186/1471-2105-7-211
work_keys_str_mv AT joneslesley assessmentoftherelationshipbetweenprechipandpostchipqualitymeasuresforaffymetrixgenechipexpressiondata
AT goldsteindarlener assessmentoftherelationshipbetweenprechipandpostchipqualitymeasuresforaffymetrixgenechipexpressiondata
AT hughesgareth assessmentoftherelationshipbetweenprechipandpostchipqualitymeasuresforaffymetrixgenechipexpressiondata
AT strandandrewd assessmentoftherelationshipbetweenprechipandpostchipqualitymeasuresforaffymetrixgenechipexpressiondata
AT collinfrancois assessmentoftherelationshipbetweenprechipandpostchipqualitymeasuresforaffymetrixgenechipexpressiondata
AT dunnettstephenb assessmentoftherelationshipbetweenprechipandpostchipqualitymeasuresforaffymetrixgenechipexpressiondata
AT kooperbergcharles assessmentoftherelationshipbetweenprechipandpostchipqualitymeasuresforaffymetrixgenechipexpressiondata
AT aragakiaaron assessmentoftherelationshipbetweenprechipandpostchipqualitymeasuresforaffymetrixgenechipexpressiondata
AT olsonjamesm assessmentoftherelationshipbetweenprechipandpostchipqualitymeasuresforaffymetrixgenechipexpressiondata
AT augoodsarahj assessmentoftherelationshipbetweenprechipandpostchipqualitymeasuresforaffymetrixgenechipexpressiondata
AT faullrichardlm assessmentoftherelationshipbetweenprechipandpostchipqualitymeasuresforaffymetrixgenechipexpressiondata
AT luthicarterruth assessmentoftherelationshipbetweenprechipandpostchipqualitymeasuresforaffymetrixgenechipexpressiondata
AT moskvinavalentina assessmentoftherelationshipbetweenprechipandpostchipqualitymeasuresforaffymetrixgenechipexpressiondata
AT hodgesangelak assessmentoftherelationshipbetweenprechipandpostchipqualitymeasuresforaffymetrixgenechipexpressiondata