Cargando…

Empirical validation of the S-Score algorithm in the analysis of gene expression data

BACKGROUND: Current methods of analyzing Affymetrix GeneChip(® )microarray data require the estimation of probe set expression summaries, followed by application of statistical tests to determine which genes are differentially expressed. The S-Score algorithm described by Zhang and colleagues is an...

Descripción completa

Detalles Bibliográficos
Autores principales: Kennedy, Richard E, Archer, Kellie J, Miles, Michael F
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1550434/
https://www.ncbi.nlm.nih.gov/pubmed/16545131
http://dx.doi.org/10.1186/1471-2105-7-154
_version_ 1782129230714241024
author Kennedy, Richard E
Archer, Kellie J
Miles, Michael F
author_facet Kennedy, Richard E
Archer, Kellie J
Miles, Michael F
author_sort Kennedy, Richard E
collection PubMed
description BACKGROUND: Current methods of analyzing Affymetrix GeneChip(® )microarray data require the estimation of probe set expression summaries, followed by application of statistical tests to determine which genes are differentially expressed. The S-Score algorithm described by Zhang and colleagues is an alternative method that allows tests of hypotheses directly from probe level data. It is based on an error model in which the detected signal is proportional to the probe pair signal for highly expressed genes, but approaches a background level (rather than 0) for genes with low levels of expression. This model is used to calculate relative change in probe pair intensities that converts probe signals into multiple measurements with equalized errors, which are summed over a probe set to form the S-Score. Assuming no expression differences between chips, the S-Score follows a standard normal distribution, allowing direct tests of hypotheses to be made. Using spike-in and dilution datasets, we validated the S-Score method against comparisons of gene expression utilizing the more recently developed methods RMA, dChip, and MAS5. RESULTS: The S-score showed excellent sensitivity and specificity in detecting low-level gene expression changes. Rank ordering of S-Score values more accurately reflected known fold-change values compared to other algorithms. CONCLUSION: The S-score method, utilizing probe level data directly, offers significant advantages over comparisons using only probe set expression summaries.
format Text
id pubmed-1550434
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-15504342006-08-19 Empirical validation of the S-Score algorithm in the analysis of gene expression data Kennedy, Richard E Archer, Kellie J Miles, Michael F BMC Bioinformatics Methodology Article BACKGROUND: Current methods of analyzing Affymetrix GeneChip(® )microarray data require the estimation of probe set expression summaries, followed by application of statistical tests to determine which genes are differentially expressed. The S-Score algorithm described by Zhang and colleagues is an alternative method that allows tests of hypotheses directly from probe level data. It is based on an error model in which the detected signal is proportional to the probe pair signal for highly expressed genes, but approaches a background level (rather than 0) for genes with low levels of expression. This model is used to calculate relative change in probe pair intensities that converts probe signals into multiple measurements with equalized errors, which are summed over a probe set to form the S-Score. Assuming no expression differences between chips, the S-Score follows a standard normal distribution, allowing direct tests of hypotheses to be made. Using spike-in and dilution datasets, we validated the S-Score method against comparisons of gene expression utilizing the more recently developed methods RMA, dChip, and MAS5. RESULTS: The S-score showed excellent sensitivity and specificity in detecting low-level gene expression changes. Rank ordering of S-Score values more accurately reflected known fold-change values compared to other algorithms. CONCLUSION: The S-score method, utilizing probe level data directly, offers significant advantages over comparisons using only probe set expression summaries. BioMed Central 2006-03-17 /pmc/articles/PMC1550434/ /pubmed/16545131 http://dx.doi.org/10.1186/1471-2105-7-154 Text en Copyright © 2006 Kennedy et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Kennedy, Richard E
Archer, Kellie J
Miles, Michael F
Empirical validation of the S-Score algorithm in the analysis of gene expression data
title Empirical validation of the S-Score algorithm in the analysis of gene expression data
title_full Empirical validation of the S-Score algorithm in the analysis of gene expression data
title_fullStr Empirical validation of the S-Score algorithm in the analysis of gene expression data
title_full_unstemmed Empirical validation of the S-Score algorithm in the analysis of gene expression data
title_short Empirical validation of the S-Score algorithm in the analysis of gene expression data
title_sort empirical validation of the s-score algorithm in the analysis of gene expression data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1550434/
https://www.ncbi.nlm.nih.gov/pubmed/16545131
http://dx.doi.org/10.1186/1471-2105-7-154
work_keys_str_mv AT kennedyricharde empiricalvalidationofthesscorealgorithmintheanalysisofgeneexpressiondata
AT archerkelliej empiricalvalidationofthesscorealgorithmintheanalysisofgeneexpressiondata
AT milesmichaelf empiricalvalidationofthesscorealgorithmintheanalysisofgeneexpressiondata