Cargando…

Do Two Machine-Learning Based Prognostic Signatures for Breast Cancer Capture the Same Biological Processes?

The fact that there is very little if any overlap between the genes of different prognostic signatures for early-discovery breast cancer is well documented. The reasons for this apparent discrepancy have been explained by the limits of simple machine-learning identification and ranking techniques, a...

Descripción completa

Detalles Bibliográficos
Autores principales: Drier, Yotam, Domany, Eytan
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3056769/
https://www.ncbi.nlm.nih.gov/pubmed/21423753
http://dx.doi.org/10.1371/journal.pone.0017795
_version_ 1782200238938783744
author Drier, Yotam
Domany, Eytan
author_facet Drier, Yotam
Domany, Eytan
author_sort Drier, Yotam
collection PubMed
description The fact that there is very little if any overlap between the genes of different prognostic signatures for early-discovery breast cancer is well documented. The reasons for this apparent discrepancy have been explained by the limits of simple machine-learning identification and ranking techniques, and the biological relevance and meaning of the prognostic gene lists was questioned. Subsequently, proponents of the prognostic gene lists claimed that different lists do capture similar underlying biological processes and pathways. The present study places under scrutiny the validity of this claim, for two important gene lists that are at the focus of current large-scale validation efforts. We performed careful enrichment analysis, controlling the effects of multiple testing in a manner which takes into account the nested dependent structure of gene ontologies. In contradiction to several previous publications, we find that the only biological process or pathway for which statistically significant concordance can be claimed is cell proliferation, a process whose relevance and prognostic value was well known long before gene expression profiling. We found that the claims reported by others, of wider concordance between the biological processes captured by the two prognostic signatures studied, were found either to be lacking statistical rigor or were in fact based on addressing some other question.
format Text
id pubmed-3056769
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-30567692011-03-18 Do Two Machine-Learning Based Prognostic Signatures for Breast Cancer Capture the Same Biological Processes? Drier, Yotam Domany, Eytan PLoS One Research Article The fact that there is very little if any overlap between the genes of different prognostic signatures for early-discovery breast cancer is well documented. The reasons for this apparent discrepancy have been explained by the limits of simple machine-learning identification and ranking techniques, and the biological relevance and meaning of the prognostic gene lists was questioned. Subsequently, proponents of the prognostic gene lists claimed that different lists do capture similar underlying biological processes and pathways. The present study places under scrutiny the validity of this claim, for two important gene lists that are at the focus of current large-scale validation efforts. We performed careful enrichment analysis, controlling the effects of multiple testing in a manner which takes into account the nested dependent structure of gene ontologies. In contradiction to several previous publications, we find that the only biological process or pathway for which statistically significant concordance can be claimed is cell proliferation, a process whose relevance and prognostic value was well known long before gene expression profiling. We found that the claims reported by others, of wider concordance between the biological processes captured by the two prognostic signatures studied, were found either to be lacking statistical rigor or were in fact based on addressing some other question. Public Library of Science 2011-03-14 /pmc/articles/PMC3056769/ /pubmed/21423753 http://dx.doi.org/10.1371/journal.pone.0017795 Text en Drier, Domany. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Drier, Yotam
Domany, Eytan
Do Two Machine-Learning Based Prognostic Signatures for Breast Cancer Capture the Same Biological Processes?
title Do Two Machine-Learning Based Prognostic Signatures for Breast Cancer Capture the Same Biological Processes?
title_full Do Two Machine-Learning Based Prognostic Signatures for Breast Cancer Capture the Same Biological Processes?
title_fullStr Do Two Machine-Learning Based Prognostic Signatures for Breast Cancer Capture the Same Biological Processes?
title_full_unstemmed Do Two Machine-Learning Based Prognostic Signatures for Breast Cancer Capture the Same Biological Processes?
title_short Do Two Machine-Learning Based Prognostic Signatures for Breast Cancer Capture the Same Biological Processes?
title_sort do two machine-learning based prognostic signatures for breast cancer capture the same biological processes?
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3056769/
https://www.ncbi.nlm.nih.gov/pubmed/21423753
http://dx.doi.org/10.1371/journal.pone.0017795
work_keys_str_mv AT drieryotam dotwomachinelearningbasedprognosticsignaturesforbreastcancercapturethesamebiologicalprocesses
AT domanyeytan dotwomachinelearningbasedprognosticsignaturesforbreastcancercapturethesamebiologicalprocesses