Cargando…

A comprehensive sensitivity analysis of microarray breast cancer classification under feature variability

BACKGROUND: Large discrepancies in signature composition and outcome concordance have been observed between different microarray breast cancer expression profiling studies. This is often ascribed to differences in array platform as well as biological variability. We conjecture that other reasons for...

Descripción completa

Detalles Bibliográficos
Autores principales: Sontrop, Herman MJ, Moerland, Perry D, van den Ham, René, Reinders, Marcel JT, Verhaegh, Wim FJ
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2789744/
https://www.ncbi.nlm.nih.gov/pubmed/19941644
http://dx.doi.org/10.1186/1471-2105-10-389
_version_ 1782175069481467904
author Sontrop, Herman MJ
Moerland, Perry D
van den Ham, René
Reinders, Marcel JT
Verhaegh, Wim FJ
author_facet Sontrop, Herman MJ
Moerland, Perry D
van den Ham, René
Reinders, Marcel JT
Verhaegh, Wim FJ
author_sort Sontrop, Herman MJ
collection PubMed
description BACKGROUND: Large discrepancies in signature composition and outcome concordance have been observed between different microarray breast cancer expression profiling studies. This is often ascribed to differences in array platform as well as biological variability. We conjecture that other reasons for the observed discrepancies are the measurement error associated with each feature and the choice of preprocessing method. Microarray data are known to be subject to technical variation and the confidence intervals around individual point estimates of expression levels can be wide. Furthermore, the estimated expression values also vary depending on the selected preprocessing scheme. In microarray breast cancer classification studies, however, these two forms of feature variability are almost always ignored and hence their exact role is unclear. RESULTS: We have performed a comprehensive sensitivity analysis of microarray breast cancer classification under the two types of feature variability mentioned above. We used data from six state of the art preprocessing methods, using a compendium consisting of eight diferent datasets, involving 1131 hybridizations, containing data from both one and two-color array technology. For a wide range of classifiers, we performed a joint study on performance, concordance and stability. In the stability analysis we explicitly tested classifiers for their noise tolerance by using perturbed expression profiles that are based on uncertainty information directly related to the preprocessing methods. Our results indicate that signature composition is strongly influenced by feature variability, even if the array platform and the stratification of patient samples are identical. In addition, we show that there is often a high level of discordance between individual class assignments for signatures constructed on data coming from different preprocessing schemes, even if the actual signature composition is identical. CONCLUSION: Feature variability can have a strong impact on breast cancer signature composition, as well as the classification of individual patient samples. We therefore strongly recommend that feature variability is considered in analyzing data from microarray breast cancer expression profiling experiments.
format Text
id pubmed-2789744
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27897442009-12-08 A comprehensive sensitivity analysis of microarray breast cancer classification under feature variability Sontrop, Herman MJ Moerland, Perry D van den Ham, René Reinders, Marcel JT Verhaegh, Wim FJ BMC Bioinformatics Research article BACKGROUND: Large discrepancies in signature composition and outcome concordance have been observed between different microarray breast cancer expression profiling studies. This is often ascribed to differences in array platform as well as biological variability. We conjecture that other reasons for the observed discrepancies are the measurement error associated with each feature and the choice of preprocessing method. Microarray data are known to be subject to technical variation and the confidence intervals around individual point estimates of expression levels can be wide. Furthermore, the estimated expression values also vary depending on the selected preprocessing scheme. In microarray breast cancer classification studies, however, these two forms of feature variability are almost always ignored and hence their exact role is unclear. RESULTS: We have performed a comprehensive sensitivity analysis of microarray breast cancer classification under the two types of feature variability mentioned above. We used data from six state of the art preprocessing methods, using a compendium consisting of eight diferent datasets, involving 1131 hybridizations, containing data from both one and two-color array technology. For a wide range of classifiers, we performed a joint study on performance, concordance and stability. In the stability analysis we explicitly tested classifiers for their noise tolerance by using perturbed expression profiles that are based on uncertainty information directly related to the preprocessing methods. Our results indicate that signature composition is strongly influenced by feature variability, even if the array platform and the stratification of patient samples are identical. In addition, we show that there is often a high level of discordance between individual class assignments for signatures constructed on data coming from different preprocessing schemes, even if the actual signature composition is identical. CONCLUSION: Feature variability can have a strong impact on breast cancer signature composition, as well as the classification of individual patient samples. We therefore strongly recommend that feature variability is considered in analyzing data from microarray breast cancer expression profiling experiments. BioMed Central 2009-11-26 /pmc/articles/PMC2789744/ /pubmed/19941644 http://dx.doi.org/10.1186/1471-2105-10-389 Text en Copyright ©2009 Sontrop et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research article
Sontrop, Herman MJ
Moerland, Perry D
van den Ham, René
Reinders, Marcel JT
Verhaegh, Wim FJ
A comprehensive sensitivity analysis of microarray breast cancer classification under feature variability
title A comprehensive sensitivity analysis of microarray breast cancer classification under feature variability
title_full A comprehensive sensitivity analysis of microarray breast cancer classification under feature variability
title_fullStr A comprehensive sensitivity analysis of microarray breast cancer classification under feature variability
title_full_unstemmed A comprehensive sensitivity analysis of microarray breast cancer classification under feature variability
title_short A comprehensive sensitivity analysis of microarray breast cancer classification under feature variability
title_sort comprehensive sensitivity analysis of microarray breast cancer classification under feature variability
topic Research article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2789744/
https://www.ncbi.nlm.nih.gov/pubmed/19941644
http://dx.doi.org/10.1186/1471-2105-10-389
work_keys_str_mv AT sontrophermanmj acomprehensivesensitivityanalysisofmicroarraybreastcancerclassificationunderfeaturevariability
AT moerlandperryd acomprehensivesensitivityanalysisofmicroarraybreastcancerclassificationunderfeaturevariability
AT vandenhamrene acomprehensivesensitivityanalysisofmicroarraybreastcancerclassificationunderfeaturevariability
AT reindersmarceljt acomprehensivesensitivityanalysisofmicroarraybreastcancerclassificationunderfeaturevariability
AT verhaeghwimfj acomprehensivesensitivityanalysisofmicroarraybreastcancerclassificationunderfeaturevariability
AT sontrophermanmj comprehensivesensitivityanalysisofmicroarraybreastcancerclassificationunderfeaturevariability
AT moerlandperryd comprehensivesensitivityanalysisofmicroarraybreastcancerclassificationunderfeaturevariability
AT vandenhamrene comprehensivesensitivityanalysisofmicroarraybreastcancerclassificationunderfeaturevariability
AT reindersmarceljt comprehensivesensitivityanalysisofmicroarraybreastcancerclassificationunderfeaturevariability
AT verhaeghwimfj comprehensivesensitivityanalysisofmicroarraybreastcancerclassificationunderfeaturevariability