Cargando…

Prediction of cancer driver genes through network-based moment propagation of mutation scores

MOTIVATION: Gaining a comprehensive understanding of the genetics underlying cancer development and progression is a central goal of biomedical research. Its accomplishment promises key mechanistic, diagnostic and therapeutic insights. One major step in this direction is the identification of genes...

Descripción completa

Detalles Bibliográficos
Autores principales: Gumpinger, Anja C, Lage, Kasper, Horn, Heiko, Borgwardt, Karsten
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355253/
https://www.ncbi.nlm.nih.gov/pubmed/32657361
http://dx.doi.org/10.1093/bioinformatics/btaa452
_version_ 1783558237613719552
author Gumpinger, Anja C
Lage, Kasper
Horn, Heiko
Borgwardt, Karsten
author_facet Gumpinger, Anja C
Lage, Kasper
Horn, Heiko
Borgwardt, Karsten
author_sort Gumpinger, Anja C
collection PubMed
description MOTIVATION: Gaining a comprehensive understanding of the genetics underlying cancer development and progression is a central goal of biomedical research. Its accomplishment promises key mechanistic, diagnostic and therapeutic insights. One major step in this direction is the identification of genes that drive the emergence of tumors upon mutation. Recent advances in the field of computational biology have shown the potential of combining genetic summary statistics that represent the mutational burden in genes with biological networks, such as protein–protein interaction networks, to identify cancer driver genes. Those approaches superimpose the summary statistics on the nodes in the network, followed by an unsupervised propagation of the node scores through the network. However, this unsupervised setting does not leverage any knowledge on well-established cancer genes, a potentially valuable resource to improve the identification of novel cancer drivers. RESULTS: We develop a novel node embedding that enables classification of cancer driver genes in a supervised setting. The embedding combines a representation of the mutation score distribution in a node’s local neighborhood with network propagation. We leverage the knowledge of well-established cancer driver genes to define a positive class, resulting in a partially labeled dataset, and develop a cross-validation scheme to enable supervised prediction. The proposed node embedding followed by a supervised classification improves the predictive performance compared with baseline methods and yields a set of promising genes that constitute candidates for further biological validation. AVAILABILITY AND IMPLEMENTATION: Code available at https://github.com/BorgwardtLab/MoProEmbeddings. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-7355253
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-73552532020-07-16 Prediction of cancer driver genes through network-based moment propagation of mutation scores Gumpinger, Anja C Lage, Kasper Horn, Heiko Borgwardt, Karsten Bioinformatics Systems Biology and Networks MOTIVATION: Gaining a comprehensive understanding of the genetics underlying cancer development and progression is a central goal of biomedical research. Its accomplishment promises key mechanistic, diagnostic and therapeutic insights. One major step in this direction is the identification of genes that drive the emergence of tumors upon mutation. Recent advances in the field of computational biology have shown the potential of combining genetic summary statistics that represent the mutational burden in genes with biological networks, such as protein–protein interaction networks, to identify cancer driver genes. Those approaches superimpose the summary statistics on the nodes in the network, followed by an unsupervised propagation of the node scores through the network. However, this unsupervised setting does not leverage any knowledge on well-established cancer genes, a potentially valuable resource to improve the identification of novel cancer drivers. RESULTS: We develop a novel node embedding that enables classification of cancer driver genes in a supervised setting. The embedding combines a representation of the mutation score distribution in a node’s local neighborhood with network propagation. We leverage the knowledge of well-established cancer driver genes to define a positive class, resulting in a partially labeled dataset, and develop a cross-validation scheme to enable supervised prediction. The proposed node embedding followed by a supervised classification improves the predictive performance compared with baseline methods and yields a set of promising genes that constitute candidates for further biological validation. AVAILABILITY AND IMPLEMENTATION: Code available at https://github.com/BorgwardtLab/MoProEmbeddings. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-07 2020-07-13 /pmc/articles/PMC7355253/ /pubmed/32657361 http://dx.doi.org/10.1093/bioinformatics/btaa452 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Systems Biology and Networks
Gumpinger, Anja C
Lage, Kasper
Horn, Heiko
Borgwardt, Karsten
Prediction of cancer driver genes through network-based moment propagation of mutation scores
title Prediction of cancer driver genes through network-based moment propagation of mutation scores
title_full Prediction of cancer driver genes through network-based moment propagation of mutation scores
title_fullStr Prediction of cancer driver genes through network-based moment propagation of mutation scores
title_full_unstemmed Prediction of cancer driver genes through network-based moment propagation of mutation scores
title_short Prediction of cancer driver genes through network-based moment propagation of mutation scores
title_sort prediction of cancer driver genes through network-based moment propagation of mutation scores
topic Systems Biology and Networks
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355253/
https://www.ncbi.nlm.nih.gov/pubmed/32657361
http://dx.doi.org/10.1093/bioinformatics/btaa452
work_keys_str_mv AT gumpingeranjac predictionofcancerdrivergenesthroughnetworkbasedmomentpropagationofmutationscores
AT lagekasper predictionofcancerdrivergenesthroughnetworkbasedmomentpropagationofmutationscores
AT hornheiko predictionofcancerdrivergenesthroughnetworkbasedmomentpropagationofmutationscores
AT borgwardtkarsten predictionofcancerdrivergenesthroughnetworkbasedmomentpropagationofmutationscores