Cargando…
Network embedding framework for driver gene discovery by combining functional and structural information
Comprehensive analysis of multiple data sets can identify potential driver genes for various cancers. In recent years, driver gene discovery based on massive mutation data and gene interaction networks has attracted increasing attention, but there is still a need to explore combining functional and...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10386255/ https://www.ncbi.nlm.nih.gov/pubmed/37516822 http://dx.doi.org/10.1186/s12864-023-09515-x |
_version_ | 1785081618648530944 |
---|---|
author | Chu, Xin Guan, Boxin Dai, Lingyun Liu, Jin-xing Li, Feng Shang, Junliang |
author_facet | Chu, Xin Guan, Boxin Dai, Lingyun Liu, Jin-xing Li, Feng Shang, Junliang |
author_sort | Chu, Xin |
collection | PubMed |
description | Comprehensive analysis of multiple data sets can identify potential driver genes for various cancers. In recent years, driver gene discovery based on massive mutation data and gene interaction networks has attracted increasing attention, but there is still a need to explore combining functional and structural information of genes in protein interaction networks to identify driver genes. Therefore, we propose a network embedding framework combining functional and structural information to identify driver genes. Firstly, we combine the mutation data and gene interaction networks to construct mutation integration network using network propagation algorithm. Secondly, the struc2vec model is used for extracting gene features from the mutation integration network, which contains both gene's functional and structural information. Finally, machine learning algorithms are utilized to identify the driver genes. Compared with the previous four excellent methods, our method can find gene pairs that are distant from each other through structural similarities and has better performance in identifying driver genes for 12 cancers in the cancer genome atlas. At the same time, we also conduct a comparative analysis of three gene interaction networks, three gene standard sets, and five machine learning algorithms. Our framework provides a new perspective for feature selection to identify novel driver genes. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-023-09515-x. |
format | Online Article Text |
id | pubmed-10386255 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-103862552023-07-30 Network embedding framework for driver gene discovery by combining functional and structural information Chu, Xin Guan, Boxin Dai, Lingyun Liu, Jin-xing Li, Feng Shang, Junliang BMC Genomics Research Comprehensive analysis of multiple data sets can identify potential driver genes for various cancers. In recent years, driver gene discovery based on massive mutation data and gene interaction networks has attracted increasing attention, but there is still a need to explore combining functional and structural information of genes in protein interaction networks to identify driver genes. Therefore, we propose a network embedding framework combining functional and structural information to identify driver genes. Firstly, we combine the mutation data and gene interaction networks to construct mutation integration network using network propagation algorithm. Secondly, the struc2vec model is used for extracting gene features from the mutation integration network, which contains both gene's functional and structural information. Finally, machine learning algorithms are utilized to identify the driver genes. Compared with the previous four excellent methods, our method can find gene pairs that are distant from each other through structural similarities and has better performance in identifying driver genes for 12 cancers in the cancer genome atlas. At the same time, we also conduct a comparative analysis of three gene interaction networks, three gene standard sets, and five machine learning algorithms. Our framework provides a new perspective for feature selection to identify novel driver genes. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-023-09515-x. BioMed Central 2023-07-29 /pmc/articles/PMC10386255/ /pubmed/37516822 http://dx.doi.org/10.1186/s12864-023-09515-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Chu, Xin Guan, Boxin Dai, Lingyun Liu, Jin-xing Li, Feng Shang, Junliang Network embedding framework for driver gene discovery by combining functional and structural information |
title | Network embedding framework for driver gene discovery by combining functional and structural information |
title_full | Network embedding framework for driver gene discovery by combining functional and structural information |
title_fullStr | Network embedding framework for driver gene discovery by combining functional and structural information |
title_full_unstemmed | Network embedding framework for driver gene discovery by combining functional and structural information |
title_short | Network embedding framework for driver gene discovery by combining functional and structural information |
title_sort | network embedding framework for driver gene discovery by combining functional and structural information |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10386255/ https://www.ncbi.nlm.nih.gov/pubmed/37516822 http://dx.doi.org/10.1186/s12864-023-09515-x |
work_keys_str_mv | AT chuxin networkembeddingframeworkfordrivergenediscoverybycombiningfunctionalandstructuralinformation AT guanboxin networkembeddingframeworkfordrivergenediscoverybycombiningfunctionalandstructuralinformation AT dailingyun networkembeddingframeworkfordrivergenediscoverybycombiningfunctionalandstructuralinformation AT liujinxing networkembeddingframeworkfordrivergenediscoverybycombiningfunctionalandstructuralinformation AT lifeng networkembeddingframeworkfordrivergenediscoverybycombiningfunctionalandstructuralinformation AT shangjunliang networkembeddingframeworkfordrivergenediscoverybycombiningfunctionalandstructuralinformation |