Cargando…
Prediction of cancer driver genes through network-based moment propagation of mutation scores
MOTIVATION: Gaining a comprehensive understanding of the genetics underlying cancer development and progression is a central goal of biomedical research. Its accomplishment promises key mechanistic, diagnostic and therapeutic insights. One major step in this direction is the identification of genes...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355253/ https://www.ncbi.nlm.nih.gov/pubmed/32657361 http://dx.doi.org/10.1093/bioinformatics/btaa452 |
_version_ | 1783558237613719552 |
---|---|
author | Gumpinger, Anja C Lage, Kasper Horn, Heiko Borgwardt, Karsten |
author_facet | Gumpinger, Anja C Lage, Kasper Horn, Heiko Borgwardt, Karsten |
author_sort | Gumpinger, Anja C |
collection | PubMed |
description | MOTIVATION: Gaining a comprehensive understanding of the genetics underlying cancer development and progression is a central goal of biomedical research. Its accomplishment promises key mechanistic, diagnostic and therapeutic insights. One major step in this direction is the identification of genes that drive the emergence of tumors upon mutation. Recent advances in the field of computational biology have shown the potential of combining genetic summary statistics that represent the mutational burden in genes with biological networks, such as protein–protein interaction networks, to identify cancer driver genes. Those approaches superimpose the summary statistics on the nodes in the network, followed by an unsupervised propagation of the node scores through the network. However, this unsupervised setting does not leverage any knowledge on well-established cancer genes, a potentially valuable resource to improve the identification of novel cancer drivers. RESULTS: We develop a novel node embedding that enables classification of cancer driver genes in a supervised setting. The embedding combines a representation of the mutation score distribution in a node’s local neighborhood with network propagation. We leverage the knowledge of well-established cancer driver genes to define a positive class, resulting in a partially labeled dataset, and develop a cross-validation scheme to enable supervised prediction. The proposed node embedding followed by a supervised classification improves the predictive performance compared with baseline methods and yields a set of promising genes that constitute candidates for further biological validation. AVAILABILITY AND IMPLEMENTATION: Code available at https://github.com/BorgwardtLab/MoProEmbeddings. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-7355253 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-73552532020-07-16 Prediction of cancer driver genes through network-based moment propagation of mutation scores Gumpinger, Anja C Lage, Kasper Horn, Heiko Borgwardt, Karsten Bioinformatics Systems Biology and Networks MOTIVATION: Gaining a comprehensive understanding of the genetics underlying cancer development and progression is a central goal of biomedical research. Its accomplishment promises key mechanistic, diagnostic and therapeutic insights. One major step in this direction is the identification of genes that drive the emergence of tumors upon mutation. Recent advances in the field of computational biology have shown the potential of combining genetic summary statistics that represent the mutational burden in genes with biological networks, such as protein–protein interaction networks, to identify cancer driver genes. Those approaches superimpose the summary statistics on the nodes in the network, followed by an unsupervised propagation of the node scores through the network. However, this unsupervised setting does not leverage any knowledge on well-established cancer genes, a potentially valuable resource to improve the identification of novel cancer drivers. RESULTS: We develop a novel node embedding that enables classification of cancer driver genes in a supervised setting. The embedding combines a representation of the mutation score distribution in a node’s local neighborhood with network propagation. We leverage the knowledge of well-established cancer driver genes to define a positive class, resulting in a partially labeled dataset, and develop a cross-validation scheme to enable supervised prediction. The proposed node embedding followed by a supervised classification improves the predictive performance compared with baseline methods and yields a set of promising genes that constitute candidates for further biological validation. AVAILABILITY AND IMPLEMENTATION: Code available at https://github.com/BorgwardtLab/MoProEmbeddings. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-07 2020-07-13 /pmc/articles/PMC7355253/ /pubmed/32657361 http://dx.doi.org/10.1093/bioinformatics/btaa452 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Systems Biology and Networks Gumpinger, Anja C Lage, Kasper Horn, Heiko Borgwardt, Karsten Prediction of cancer driver genes through network-based moment propagation of mutation scores |
title | Prediction of cancer driver genes through network-based moment propagation of mutation scores |
title_full | Prediction of cancer driver genes through network-based moment propagation of mutation scores |
title_fullStr | Prediction of cancer driver genes through network-based moment propagation of mutation scores |
title_full_unstemmed | Prediction of cancer driver genes through network-based moment propagation of mutation scores |
title_short | Prediction of cancer driver genes through network-based moment propagation of mutation scores |
title_sort | prediction of cancer driver genes through network-based moment propagation of mutation scores |
topic | Systems Biology and Networks |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355253/ https://www.ncbi.nlm.nih.gov/pubmed/32657361 http://dx.doi.org/10.1093/bioinformatics/btaa452 |
work_keys_str_mv | AT gumpingeranjac predictionofcancerdrivergenesthroughnetworkbasedmomentpropagationofmutationscores AT lagekasper predictionofcancerdrivergenesthroughnetworkbasedmomentpropagationofmutationscores AT hornheiko predictionofcancerdrivergenesthroughnetworkbasedmomentpropagationofmutationscores AT borgwardtkarsten predictionofcancerdrivergenesthroughnetworkbasedmomentpropagationofmutationscores |