Cargando…

Prediction of early breast cancer patient survival using ensembles of hypoxia signatures

BACKGROUND: Biomarkers are a key component of precision medicine. However, full clinical integration of biomarkers has been met with challenges, partly attributed to analytical difficulties. It has been shown that biomarker reproducibility is susceptible to data preprocessing approaches. Here, we sy...

Descripción completa

Detalles Bibliográficos
Autores principales: Gong, Inna Y., Fox, Natalie S., Huang, Vincent, Boutros, Paul C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6138385/
https://www.ncbi.nlm.nih.gov/pubmed/30216362
http://dx.doi.org/10.1371/journal.pone.0204123
_version_ 1783355336167522304
author Gong, Inna Y.
Fox, Natalie S.
Huang, Vincent
Boutros, Paul C.
author_facet Gong, Inna Y.
Fox, Natalie S.
Huang, Vincent
Boutros, Paul C.
author_sort Gong, Inna Y.
collection PubMed
description BACKGROUND: Biomarkers are a key component of precision medicine. However, full clinical integration of biomarkers has been met with challenges, partly attributed to analytical difficulties. It has been shown that biomarker reproducibility is susceptible to data preprocessing approaches. Here, we systematically evaluated machine-learning ensembles of preprocessing methods as a general strategy to improve biomarker performance for prediction of survival from early breast cancer. RESULTS: We risk stratified breast cancer patients into either low-risk or high-risk groups based on four published hypoxia signatures (Buffa, Winter, Hu, and Sorensen), using 24 different preprocessing approaches for microarray normalization. The 24 binary risk profiles determined for each hypoxia signature were combined using a random forest to evaluate the efficacy of a preprocessing ensemble classifier. We demonstrate that the best way of merging preprocessing methods varies from signature to signature, and that there is likely no ‘best’ preprocessing pipeline that is universal across datasets, highlighting the need to evaluate ensembles of preprocessing algorithms. Further, we developed novel signatures for each preprocessing method and the risk classifications from each were incorporated in a meta-random forest model. Interestingly, the classification of these biomarkers and its ensemble show striking consistency, demonstrating that similar intrinsic biological information are being faithfully represented. As such, these classification patterns further confirm that there is a subset of patients whose prognosis is consistently challenging to predict. CONCLUSIONS: Performance of different prognostic signatures varies with pre-processing method. A simple classifier by unanimous voting of classifications is a reliable way of improving on single preprocessing methods. Future signatures will likely require integration of intrinsic and extrinsic clinico-pathological variables to better predict disease-related outcomes.
format Online
Article
Text
id pubmed-6138385
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-61383852018-09-27 Prediction of early breast cancer patient survival using ensembles of hypoxia signatures Gong, Inna Y. Fox, Natalie S. Huang, Vincent Boutros, Paul C. PLoS One Research Article BACKGROUND: Biomarkers are a key component of precision medicine. However, full clinical integration of biomarkers has been met with challenges, partly attributed to analytical difficulties. It has been shown that biomarker reproducibility is susceptible to data preprocessing approaches. Here, we systematically evaluated machine-learning ensembles of preprocessing methods as a general strategy to improve biomarker performance for prediction of survival from early breast cancer. RESULTS: We risk stratified breast cancer patients into either low-risk or high-risk groups based on four published hypoxia signatures (Buffa, Winter, Hu, and Sorensen), using 24 different preprocessing approaches for microarray normalization. The 24 binary risk profiles determined for each hypoxia signature were combined using a random forest to evaluate the efficacy of a preprocessing ensemble classifier. We demonstrate that the best way of merging preprocessing methods varies from signature to signature, and that there is likely no ‘best’ preprocessing pipeline that is universal across datasets, highlighting the need to evaluate ensembles of preprocessing algorithms. Further, we developed novel signatures for each preprocessing method and the risk classifications from each were incorporated in a meta-random forest model. Interestingly, the classification of these biomarkers and its ensemble show striking consistency, demonstrating that similar intrinsic biological information are being faithfully represented. As such, these classification patterns further confirm that there is a subset of patients whose prognosis is consistently challenging to predict. CONCLUSIONS: Performance of different prognostic signatures varies with pre-processing method. A simple classifier by unanimous voting of classifications is a reliable way of improving on single preprocessing methods. Future signatures will likely require integration of intrinsic and extrinsic clinico-pathological variables to better predict disease-related outcomes. Public Library of Science 2018-09-14 /pmc/articles/PMC6138385/ /pubmed/30216362 http://dx.doi.org/10.1371/journal.pone.0204123 Text en © 2018 Gong et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Gong, Inna Y.
Fox, Natalie S.
Huang, Vincent
Boutros, Paul C.
Prediction of early breast cancer patient survival using ensembles of hypoxia signatures
title Prediction of early breast cancer patient survival using ensembles of hypoxia signatures
title_full Prediction of early breast cancer patient survival using ensembles of hypoxia signatures
title_fullStr Prediction of early breast cancer patient survival using ensembles of hypoxia signatures
title_full_unstemmed Prediction of early breast cancer patient survival using ensembles of hypoxia signatures
title_short Prediction of early breast cancer patient survival using ensembles of hypoxia signatures
title_sort prediction of early breast cancer patient survival using ensembles of hypoxia signatures
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6138385/
https://www.ncbi.nlm.nih.gov/pubmed/30216362
http://dx.doi.org/10.1371/journal.pone.0204123
work_keys_str_mv AT gonginnay predictionofearlybreastcancerpatientsurvivalusingensemblesofhypoxiasignatures
AT foxnatalies predictionofearlybreastcancerpatientsurvivalusingensemblesofhypoxiasignatures
AT huangvincent predictionofearlybreastcancerpatientsurvivalusingensemblesofhypoxiasignatures
AT boutrospaulc predictionofearlybreastcancerpatientsurvivalusingensemblesofhypoxiasignatures