Cargando…

Identifying Cancer genes by combining two-rounds RWR based on multiple biological data

BACKGROUND: It’s a very urgent task to identify cancer genes that enables us to understand the mechanisms of biochemical processes at a biomolecular level and facilitates the development of bioinformatics. Although a large number of methods have been proposed to identify cancer genes at recent times...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Wenxiang, Lei (IEEE member), Xiujuan, Bian, Chen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6876101/
https://www.ncbi.nlm.nih.gov/pubmed/31760937
http://dx.doi.org/10.1186/s12859-019-3123-8
_version_ 1783473155656908800
author Zhang, Wenxiang
Lei (IEEE member), Xiujuan
Bian, Chen
author_facet Zhang, Wenxiang
Lei (IEEE member), Xiujuan
Bian, Chen
author_sort Zhang, Wenxiang
collection PubMed
description BACKGROUND: It’s a very urgent task to identify cancer genes that enables us to understand the mechanisms of biochemical processes at a biomolecular level and facilitates the development of bioinformatics. Although a large number of methods have been proposed to identify cancer genes at recent times, the biological data utilized by most of these methods is still quite less, which reflects an insufficient consideration of the relationship between genes and diseases from a variety of factors. RESULTS: In this paper, we propose a two-rounds random walk algorithm to identify cancer genes based on multiple biological data (TRWR-MB), including protein-protein interaction (PPI) network, pathway network, microRNA similarity network, lncRNA similarity network, cancer similarity network and protein complexes. In the first-round random walk, all cancer nodes, cancer-related genes, cancer-related microRNAs and cancer-related lncRNAs, being associated with all the cancer, are used as seed nodes, and then a random walker walks on a quadruple layer heterogeneous network constructed by multiple biological data. The first-round random walk aims to select the top score k of potential cancer genes. Then in the second-round random walk, genes, microRNAs and lncRNAs, being associated with a certain special cancer in corresponding cancer class, are regarded as seed nodes, and then the walker walks on a new quadruple layer heterogeneous network constructed by lncRNAs, microRNAs, cancer and selected potential cancer genes. After the above walks finish, we combine the results of two-rounds RWR as ranking score for experimental analysis. As a result, a higher value of area under the receiver operating characteristic curve (AUC) is obtained. Besides, cases studies for identifying new cancer genes are performed in corresponding section. CONCLUSION: In summary, TRWR-MB integrates multiple biological data to identify cancer genes by analyzing the relationship between genes and cancer from a variety of biological molecular perspective.
format Online
Article
Text
id pubmed-6876101
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68761012019-11-29 Identifying Cancer genes by combining two-rounds RWR based on multiple biological data Zhang, Wenxiang Lei (IEEE member), Xiujuan Bian, Chen BMC Bioinformatics Research BACKGROUND: It’s a very urgent task to identify cancer genes that enables us to understand the mechanisms of biochemical processes at a biomolecular level and facilitates the development of bioinformatics. Although a large number of methods have been proposed to identify cancer genes at recent times, the biological data utilized by most of these methods is still quite less, which reflects an insufficient consideration of the relationship between genes and diseases from a variety of factors. RESULTS: In this paper, we propose a two-rounds random walk algorithm to identify cancer genes based on multiple biological data (TRWR-MB), including protein-protein interaction (PPI) network, pathway network, microRNA similarity network, lncRNA similarity network, cancer similarity network and protein complexes. In the first-round random walk, all cancer nodes, cancer-related genes, cancer-related microRNAs and cancer-related lncRNAs, being associated with all the cancer, are used as seed nodes, and then a random walker walks on a quadruple layer heterogeneous network constructed by multiple biological data. The first-round random walk aims to select the top score k of potential cancer genes. Then in the second-round random walk, genes, microRNAs and lncRNAs, being associated with a certain special cancer in corresponding cancer class, are regarded as seed nodes, and then the walker walks on a new quadruple layer heterogeneous network constructed by lncRNAs, microRNAs, cancer and selected potential cancer genes. After the above walks finish, we combine the results of two-rounds RWR as ranking score for experimental analysis. As a result, a higher value of area under the receiver operating characteristic curve (AUC) is obtained. Besides, cases studies for identifying new cancer genes are performed in corresponding section. CONCLUSION: In summary, TRWR-MB integrates multiple biological data to identify cancer genes by analyzing the relationship between genes and cancer from a variety of biological molecular perspective. BioMed Central 2019-11-25 /pmc/articles/PMC6876101/ /pubmed/31760937 http://dx.doi.org/10.1186/s12859-019-3123-8 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Zhang, Wenxiang
Lei (IEEE member), Xiujuan
Bian, Chen
Identifying Cancer genes by combining two-rounds RWR based on multiple biological data
title Identifying Cancer genes by combining two-rounds RWR based on multiple biological data
title_full Identifying Cancer genes by combining two-rounds RWR based on multiple biological data
title_fullStr Identifying Cancer genes by combining two-rounds RWR based on multiple biological data
title_full_unstemmed Identifying Cancer genes by combining two-rounds RWR based on multiple biological data
title_short Identifying Cancer genes by combining two-rounds RWR based on multiple biological data
title_sort identifying cancer genes by combining two-rounds rwr based on multiple biological data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6876101/
https://www.ncbi.nlm.nih.gov/pubmed/31760937
http://dx.doi.org/10.1186/s12859-019-3123-8
work_keys_str_mv AT zhangwenxiang identifyingcancergenesbycombiningtworoundsrwrbasedonmultiplebiologicaldata
AT leiieeememberxiujuan identifyingcancergenesbycombiningtworoundsrwrbasedonmultiplebiologicaldata
AT bianchen identifyingcancergenesbycombiningtworoundsrwrbasedonmultiplebiologicaldata