Cargando…
Do Two Machine-Learning Based Prognostic Signatures for Breast Cancer Capture the Same Biological Processes?
The fact that there is very little if any overlap between the genes of different prognostic signatures for early-discovery breast cancer is well documented. The reasons for this apparent discrepancy have been explained by the limits of simple machine-learning identification and ranking techniques, a...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3056769/ https://www.ncbi.nlm.nih.gov/pubmed/21423753 http://dx.doi.org/10.1371/journal.pone.0017795 |
_version_ | 1782200238938783744 |
---|---|
author | Drier, Yotam Domany, Eytan |
author_facet | Drier, Yotam Domany, Eytan |
author_sort | Drier, Yotam |
collection | PubMed |
description | The fact that there is very little if any overlap between the genes of different prognostic signatures for early-discovery breast cancer is well documented. The reasons for this apparent discrepancy have been explained by the limits of simple machine-learning identification and ranking techniques, and the biological relevance and meaning of the prognostic gene lists was questioned. Subsequently, proponents of the prognostic gene lists claimed that different lists do capture similar underlying biological processes and pathways. The present study places under scrutiny the validity of this claim, for two important gene lists that are at the focus of current large-scale validation efforts. We performed careful enrichment analysis, controlling the effects of multiple testing in a manner which takes into account the nested dependent structure of gene ontologies. In contradiction to several previous publications, we find that the only biological process or pathway for which statistically significant concordance can be claimed is cell proliferation, a process whose relevance and prognostic value was well known long before gene expression profiling. We found that the claims reported by others, of wider concordance between the biological processes captured by the two prognostic signatures studied, were found either to be lacking statistical rigor or were in fact based on addressing some other question. |
format | Text |
id | pubmed-3056769 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-30567692011-03-18 Do Two Machine-Learning Based Prognostic Signatures for Breast Cancer Capture the Same Biological Processes? Drier, Yotam Domany, Eytan PLoS One Research Article The fact that there is very little if any overlap between the genes of different prognostic signatures for early-discovery breast cancer is well documented. The reasons for this apparent discrepancy have been explained by the limits of simple machine-learning identification and ranking techniques, and the biological relevance and meaning of the prognostic gene lists was questioned. Subsequently, proponents of the prognostic gene lists claimed that different lists do capture similar underlying biological processes and pathways. The present study places under scrutiny the validity of this claim, for two important gene lists that are at the focus of current large-scale validation efforts. We performed careful enrichment analysis, controlling the effects of multiple testing in a manner which takes into account the nested dependent structure of gene ontologies. In contradiction to several previous publications, we find that the only biological process or pathway for which statistically significant concordance can be claimed is cell proliferation, a process whose relevance and prognostic value was well known long before gene expression profiling. We found that the claims reported by others, of wider concordance between the biological processes captured by the two prognostic signatures studied, were found either to be lacking statistical rigor or were in fact based on addressing some other question. Public Library of Science 2011-03-14 /pmc/articles/PMC3056769/ /pubmed/21423753 http://dx.doi.org/10.1371/journal.pone.0017795 Text en Drier, Domany. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Drier, Yotam Domany, Eytan Do Two Machine-Learning Based Prognostic Signatures for Breast Cancer Capture the Same Biological Processes? |
title | Do Two Machine-Learning Based Prognostic Signatures for Breast Cancer
Capture the Same Biological Processes? |
title_full | Do Two Machine-Learning Based Prognostic Signatures for Breast Cancer
Capture the Same Biological Processes? |
title_fullStr | Do Two Machine-Learning Based Prognostic Signatures for Breast Cancer
Capture the Same Biological Processes? |
title_full_unstemmed | Do Two Machine-Learning Based Prognostic Signatures for Breast Cancer
Capture the Same Biological Processes? |
title_short | Do Two Machine-Learning Based Prognostic Signatures for Breast Cancer
Capture the Same Biological Processes? |
title_sort | do two machine-learning based prognostic signatures for breast cancer
capture the same biological processes? |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3056769/ https://www.ncbi.nlm.nih.gov/pubmed/21423753 http://dx.doi.org/10.1371/journal.pone.0017795 |
work_keys_str_mv | AT drieryotam dotwomachinelearningbasedprognosticsignaturesforbreastcancercapturethesamebiologicalprocesses AT domanyeytan dotwomachinelearningbasedprognosticsignaturesforbreastcancercapturethesamebiologicalprocesses |