Cargando…
Evaluation and integration of cancer gene classifiers: identification and ranking of plausible drivers
The number of mutated genes in cancer cells is far larger than the number of mutations that drive cancer. The difficulty this creates for identifying relevant alterations has stimulated the development of various computational approaches to distinguishing drivers from bystanders. We develop and appl...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4650817/ https://www.ncbi.nlm.nih.gov/pubmed/25961669 http://dx.doi.org/10.1038/srep10204 |
_version_ | 1782401562935558144 |
---|---|
author | Liu, Yang Tian, Feng Hu, Zhenjun DeLisi, Charles |
author_facet | Liu, Yang Tian, Feng Hu, Zhenjun DeLisi, Charles |
author_sort | Liu, Yang |
collection | PubMed |
description | The number of mutated genes in cancer cells is far larger than the number of mutations that drive cancer. The difficulty this creates for identifying relevant alterations has stimulated the development of various computational approaches to distinguishing drivers from bystanders. We develop and apply an ensemble classifier (EC) machine learning method, which integrates 10 classifiers that are publically available, and apply it to breast and ovarian cancer. In particular we find the following: (1) Using both standard and non-standard metrics, EC almost always outperforms single method classifiers, often by wide margins. (2) Of the 50 highest ranked genes for breast (ovarian) cancer, 34 (30) are associated with other cancers in either the OMIM, CGC or NCG database (P < 10(−22)). (3) Another 10, for both breast and ovarian cancer, have been identified by GWAS studies. (4) Several of the remaining genes--including a protein kinase that regulates the Fra-1 transcription factor which is overexpressed in ER negative breast cancer cells; and Fyn, which is overexpressed in pancreatic and prostate cancer, among others--are biologically plausible. Biological implications are briefly discussed. Source codes and detailed results are available at http://www.visantnet.org/misi/driver_integration.zip. |
format | Online Article Text |
id | pubmed-4650817 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-46508172015-11-24 Evaluation and integration of cancer gene classifiers: identification and ranking of plausible drivers Liu, Yang Tian, Feng Hu, Zhenjun DeLisi, Charles Sci Rep Article The number of mutated genes in cancer cells is far larger than the number of mutations that drive cancer. The difficulty this creates for identifying relevant alterations has stimulated the development of various computational approaches to distinguishing drivers from bystanders. We develop and apply an ensemble classifier (EC) machine learning method, which integrates 10 classifiers that are publically available, and apply it to breast and ovarian cancer. In particular we find the following: (1) Using both standard and non-standard metrics, EC almost always outperforms single method classifiers, often by wide margins. (2) Of the 50 highest ranked genes for breast (ovarian) cancer, 34 (30) are associated with other cancers in either the OMIM, CGC or NCG database (P < 10(−22)). (3) Another 10, for both breast and ovarian cancer, have been identified by GWAS studies. (4) Several of the remaining genes--including a protein kinase that regulates the Fra-1 transcription factor which is overexpressed in ER negative breast cancer cells; and Fyn, which is overexpressed in pancreatic and prostate cancer, among others--are biologically plausible. Biological implications are briefly discussed. Source codes and detailed results are available at http://www.visantnet.org/misi/driver_integration.zip. Nature Publishing Group 2015-05-11 /pmc/articles/PMC4650817/ /pubmed/25961669 http://dx.doi.org/10.1038/srep10204 Text en Copyright © 2015, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Liu, Yang Tian, Feng Hu, Zhenjun DeLisi, Charles Evaluation and integration of cancer gene classifiers: identification and ranking of plausible drivers |
title | Evaluation and integration of cancer gene classifiers: identification and ranking of plausible drivers |
title_full | Evaluation and integration of cancer gene classifiers: identification and ranking of plausible drivers |
title_fullStr | Evaluation and integration of cancer gene classifiers: identification and ranking of plausible drivers |
title_full_unstemmed | Evaluation and integration of cancer gene classifiers: identification and ranking of plausible drivers |
title_short | Evaluation and integration of cancer gene classifiers: identification and ranking of plausible drivers |
title_sort | evaluation and integration of cancer gene classifiers: identification and ranking of plausible drivers |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4650817/ https://www.ncbi.nlm.nih.gov/pubmed/25961669 http://dx.doi.org/10.1038/srep10204 |
work_keys_str_mv | AT liuyang evaluationandintegrationofcancergeneclassifiersidentificationandrankingofplausibledrivers AT tianfeng evaluationandintegrationofcancergeneclassifiersidentificationandrankingofplausibledrivers AT huzhenjun evaluationandintegrationofcancergeneclassifiersidentificationandrankingofplausibledrivers AT delisicharles evaluationandintegrationofcancergeneclassifiersidentificationandrankingofplausibledrivers |