Cargando…

Network embedding framework for driver gene discovery by combining functional and structural information

Comprehensive analysis of multiple data sets can identify potential driver genes for various cancers. In recent years, driver gene discovery based on massive mutation data and gene interaction networks has attracted increasing attention, but there is still a need to explore combining functional and...

Descripción completa

Detalles Bibliográficos
Autores principales: Chu, Xin, Guan, Boxin, Dai, Lingyun, Liu, Jin-xing, Li, Feng, Shang, Junliang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10386255/
https://www.ncbi.nlm.nih.gov/pubmed/37516822
http://dx.doi.org/10.1186/s12864-023-09515-x
_version_ 1785081618648530944
author Chu, Xin
Guan, Boxin
Dai, Lingyun
Liu, Jin-xing
Li, Feng
Shang, Junliang
author_facet Chu, Xin
Guan, Boxin
Dai, Lingyun
Liu, Jin-xing
Li, Feng
Shang, Junliang
author_sort Chu, Xin
collection PubMed
description Comprehensive analysis of multiple data sets can identify potential driver genes for various cancers. In recent years, driver gene discovery based on massive mutation data and gene interaction networks has attracted increasing attention, but there is still a need to explore combining functional and structural information of genes in protein interaction networks to identify driver genes. Therefore, we propose a network embedding framework combining functional and structural information to identify driver genes. Firstly, we combine the mutation data and gene interaction networks to construct mutation integration network using network propagation algorithm. Secondly, the struc2vec model is used for extracting gene features from the mutation integration network, which contains both gene's functional and structural information. Finally, machine learning algorithms are utilized to identify the driver genes. Compared with the previous four excellent methods, our method can find gene pairs that are distant from each other through structural similarities and has better performance in identifying driver genes for 12 cancers in the cancer genome atlas. At the same time, we also conduct a comparative analysis of three gene interaction networks, three gene standard sets, and five machine learning algorithms. Our framework provides a new perspective for feature selection to identify novel driver genes. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-023-09515-x.
format Online
Article
Text
id pubmed-10386255
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-103862552023-07-30 Network embedding framework for driver gene discovery by combining functional and structural information Chu, Xin Guan, Boxin Dai, Lingyun Liu, Jin-xing Li, Feng Shang, Junliang BMC Genomics Research Comprehensive analysis of multiple data sets can identify potential driver genes for various cancers. In recent years, driver gene discovery based on massive mutation data and gene interaction networks has attracted increasing attention, but there is still a need to explore combining functional and structural information of genes in protein interaction networks to identify driver genes. Therefore, we propose a network embedding framework combining functional and structural information to identify driver genes. Firstly, we combine the mutation data and gene interaction networks to construct mutation integration network using network propagation algorithm. Secondly, the struc2vec model is used for extracting gene features from the mutation integration network, which contains both gene's functional and structural information. Finally, machine learning algorithms are utilized to identify the driver genes. Compared with the previous four excellent methods, our method can find gene pairs that are distant from each other through structural similarities and has better performance in identifying driver genes for 12 cancers in the cancer genome atlas. At the same time, we also conduct a comparative analysis of three gene interaction networks, three gene standard sets, and five machine learning algorithms. Our framework provides a new perspective for feature selection to identify novel driver genes. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-023-09515-x. BioMed Central 2023-07-29 /pmc/articles/PMC10386255/ /pubmed/37516822 http://dx.doi.org/10.1186/s12864-023-09515-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Chu, Xin
Guan, Boxin
Dai, Lingyun
Liu, Jin-xing
Li, Feng
Shang, Junliang
Network embedding framework for driver gene discovery by combining functional and structural information
title Network embedding framework for driver gene discovery by combining functional and structural information
title_full Network embedding framework for driver gene discovery by combining functional and structural information
title_fullStr Network embedding framework for driver gene discovery by combining functional and structural information
title_full_unstemmed Network embedding framework for driver gene discovery by combining functional and structural information
title_short Network embedding framework for driver gene discovery by combining functional and structural information
title_sort network embedding framework for driver gene discovery by combining functional and structural information
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10386255/
https://www.ncbi.nlm.nih.gov/pubmed/37516822
http://dx.doi.org/10.1186/s12864-023-09515-x
work_keys_str_mv AT chuxin networkembeddingframeworkfordrivergenediscoverybycombiningfunctionalandstructuralinformation
AT guanboxin networkembeddingframeworkfordrivergenediscoverybycombiningfunctionalandstructuralinformation
AT dailingyun networkembeddingframeworkfordrivergenediscoverybycombiningfunctionalandstructuralinformation
AT liujinxing networkembeddingframeworkfordrivergenediscoverybycombiningfunctionalandstructuralinformation
AT lifeng networkembeddingframeworkfordrivergenediscoverybycombiningfunctionalandstructuralinformation
AT shangjunliang networkembeddingframeworkfordrivergenediscoverybycombiningfunctionalandstructuralinformation