Cargando…

Identification of Latent Oncogenes with a Network Embedding Method and Random Forest

Oncogene is a special type of genes, which can promote the tumor initiation. Good study on oncogenes is helpful for understanding the cause of cancers. Experimental techniques in early time are quite popular in detecting oncogenes. However, their defects become more and more evident in recent years,...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Ran, Hu, Bin, Chen, Lei, Zhou, Bo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7530476/
https://www.ncbi.nlm.nih.gov/pubmed/33029511
http://dx.doi.org/10.1155/2020/5160396
_version_ 1783589577532899328
author Zhao, Ran
Hu, Bin
Chen, Lei
Zhou, Bo
author_facet Zhao, Ran
Hu, Bin
Chen, Lei
Zhou, Bo
author_sort Zhao, Ran
collection PubMed
description Oncogene is a special type of genes, which can promote the tumor initiation. Good study on oncogenes is helpful for understanding the cause of cancers. Experimental techniques in early time are quite popular in detecting oncogenes. However, their defects become more and more evident in recent years, such as high cost and long time. The newly proposed computational methods provide an alternative way to study oncogenes, which can provide useful clues for further investigations on candidate genes. Considering the limitations of some previous computational methods, such as lack of learning procedures and terming genes as individual subjects, a novel computational method was proposed in this study. The method adopted the features derived from multiple protein networks, viewing proteins in a system level. A classic machine learning algorithm, random forest, was applied on these features to capture the essential characteristic of oncogenes, thereby building the prediction model. All genes except validated oncogenes were ranked with a measurement yielded by the prediction model. Top genes were quite different from potential oncogenes discovered by previous methods, and they can be confirmed to become novel oncogenes. It was indicated that the newly identified genes can be essential supplements for previous results.
format Online
Article
Text
id pubmed-7530476
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-75304762020-10-06 Identification of Latent Oncogenes with a Network Embedding Method and Random Forest Zhao, Ran Hu, Bin Chen, Lei Zhou, Bo Biomed Res Int Research Article Oncogene is a special type of genes, which can promote the tumor initiation. Good study on oncogenes is helpful for understanding the cause of cancers. Experimental techniques in early time are quite popular in detecting oncogenes. However, their defects become more and more evident in recent years, such as high cost and long time. The newly proposed computational methods provide an alternative way to study oncogenes, which can provide useful clues for further investigations on candidate genes. Considering the limitations of some previous computational methods, such as lack of learning procedures and terming genes as individual subjects, a novel computational method was proposed in this study. The method adopted the features derived from multiple protein networks, viewing proteins in a system level. A classic machine learning algorithm, random forest, was applied on these features to capture the essential characteristic of oncogenes, thereby building the prediction model. All genes except validated oncogenes were ranked with a measurement yielded by the prediction model. Top genes were quite different from potential oncogenes discovered by previous methods, and they can be confirmed to become novel oncogenes. It was indicated that the newly identified genes can be essential supplements for previous results. Hindawi 2020-09-23 /pmc/articles/PMC7530476/ /pubmed/33029511 http://dx.doi.org/10.1155/2020/5160396 Text en Copyright © 2020 Ran Zhao et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhao, Ran
Hu, Bin
Chen, Lei
Zhou, Bo
Identification of Latent Oncogenes with a Network Embedding Method and Random Forest
title Identification of Latent Oncogenes with a Network Embedding Method and Random Forest
title_full Identification of Latent Oncogenes with a Network Embedding Method and Random Forest
title_fullStr Identification of Latent Oncogenes with a Network Embedding Method and Random Forest
title_full_unstemmed Identification of Latent Oncogenes with a Network Embedding Method and Random Forest
title_short Identification of Latent Oncogenes with a Network Embedding Method and Random Forest
title_sort identification of latent oncogenes with a network embedding method and random forest
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7530476/
https://www.ncbi.nlm.nih.gov/pubmed/33029511
http://dx.doi.org/10.1155/2020/5160396
work_keys_str_mv AT zhaoran identificationoflatentoncogeneswithanetworkembeddingmethodandrandomforest
AT hubin identificationoflatentoncogeneswithanetworkembeddingmethodandrandomforest
AT chenlei identificationoflatentoncogeneswithanetworkembeddingmethodandrandomforest
AT zhoubo identificationoflatentoncogeneswithanetworkembeddingmethodandrandomforest