Cargando…

Integrated analysis of independent gene expression microarray datasets improves the predictability of breast cancer outcome

BACKGROUND: Gene expression profiles based on microarray data have been suggested by many studies as potential molecular prognostic indexes of breast cancer. However, due to the confounding effect of clinical background, independent studies often obtained inconsistent results. The current study inve...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Zhe, Chen, Dechang, Fenstermacher, David A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central|1 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2064937/
https://www.ncbi.nlm.nih.gov/pubmed/17883867
http://dx.doi.org/10.1186/1471-2164-8-331
_version_ 1782137642797760512
author Zhang, Zhe
Chen, Dechang
Fenstermacher, David A
author_facet Zhang, Zhe
Chen, Dechang
Fenstermacher, David A
author_sort Zhang, Zhe
collection PubMed
description BACKGROUND: Gene expression profiles based on microarray data have been suggested by many studies as potential molecular prognostic indexes of breast cancer. However, due to the confounding effect of clinical background, independent studies often obtained inconsistent results. The current study investigated the possibility to improve the quality and generality of expression profiles by integrated analysis of multiple datasets. Profiles of recurrence outcome were derived from two independent datasets and validated by a third dataset. RESULTS: The clinical background of patients significantly influenced the content and performance of expression profiles when the training samples were unbalanced. The integrated profiling of two independent datasets lead to higher classification accuracy (71.11% vs. 70.59%) and larger ROC curve area (0.789 vs. 0.767) of the testing samples. Cell cycle, especially M phase mitosis, was significantly overrepresented by the 60-gene profile obtained from integrated analysis (p < 0.0001). This profiles significantly differentiated poor and good prognosis in a third patient cohort (p = 0.003). Simulation procedures demonstrated that the change of profile specificity had more instant influence on the performance of expression profiles than the change of profile sensitivity. CONCLUSION: The current study confirmed that the gene expression profile generated by integrated analysis of multiple datasets achieved better prediction of breast cancer recurrence. However, the content and performance of profiles was confounded by clinical background of training patients. In future studies, prognostic profile applicable to the general population should be derived from more diversified and balanced patient cohorts in larger scale.
format Text
id pubmed-2064937
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central|1
record_format MEDLINE/PubMed
spelling pubmed-20649372007-11-07 Integrated analysis of independent gene expression microarray datasets improves the predictability of breast cancer outcome Zhang, Zhe Chen, Dechang Fenstermacher, David A BMC Genomics Research Article BACKGROUND: Gene expression profiles based on microarray data have been suggested by many studies as potential molecular prognostic indexes of breast cancer. However, due to the confounding effect of clinical background, independent studies often obtained inconsistent results. The current study investigated the possibility to improve the quality and generality of expression profiles by integrated analysis of multiple datasets. Profiles of recurrence outcome were derived from two independent datasets and validated by a third dataset. RESULTS: The clinical background of patients significantly influenced the content and performance of expression profiles when the training samples were unbalanced. The integrated profiling of two independent datasets lead to higher classification accuracy (71.11% vs. 70.59%) and larger ROC curve area (0.789 vs. 0.767) of the testing samples. Cell cycle, especially M phase mitosis, was significantly overrepresented by the 60-gene profile obtained from integrated analysis (p < 0.0001). This profiles significantly differentiated poor and good prognosis in a third patient cohort (p = 0.003). Simulation procedures demonstrated that the change of profile specificity had more instant influence on the performance of expression profiles than the change of profile sensitivity. CONCLUSION: The current study confirmed that the gene expression profile generated by integrated analysis of multiple datasets achieved better prediction of breast cancer recurrence. However, the content and performance of profiles was confounded by clinical background of training patients. In future studies, prognostic profile applicable to the general population should be derived from more diversified and balanced patient cohorts in larger scale. BioMed Central|1 2007-09-20 /pmc/articles/PMC2064937/ /pubmed/17883867 http://dx.doi.org/10.1186/1471-2164-8-331 Text en Copyright © 2007 Zhang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhang, Zhe
Chen, Dechang
Fenstermacher, David A
Integrated analysis of independent gene expression microarray datasets improves the predictability of breast cancer outcome
title Integrated analysis of independent gene expression microarray datasets improves the predictability of breast cancer outcome
title_full Integrated analysis of independent gene expression microarray datasets improves the predictability of breast cancer outcome
title_fullStr Integrated analysis of independent gene expression microarray datasets improves the predictability of breast cancer outcome
title_full_unstemmed Integrated analysis of independent gene expression microarray datasets improves the predictability of breast cancer outcome
title_short Integrated analysis of independent gene expression microarray datasets improves the predictability of breast cancer outcome
title_sort integrated analysis of independent gene expression microarray datasets improves the predictability of breast cancer outcome
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2064937/
https://www.ncbi.nlm.nih.gov/pubmed/17883867
http://dx.doi.org/10.1186/1471-2164-8-331
work_keys_str_mv AT zhangzhe integratedanalysisofindependentgeneexpressionmicroarraydatasetsimprovesthepredictabilityofbreastcanceroutcome
AT chendechang integratedanalysisofindependentgeneexpressionmicroarraydatasetsimprovesthepredictabilityofbreastcanceroutcome
AT fenstermacherdavida integratedanalysisofindependentgeneexpressionmicroarraydatasetsimprovesthepredictabilityofbreastcanceroutcome