Cargando…
Identification of Latent Oncogenes with a Network Embedding Method and Random Forest
Oncogene is a special type of genes, which can promote the tumor initiation. Good study on oncogenes is helpful for understanding the cause of cancers. Experimental techniques in early time are quite popular in detecting oncogenes. However, their defects become more and more evident in recent years,...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7530476/ https://www.ncbi.nlm.nih.gov/pubmed/33029511 http://dx.doi.org/10.1155/2020/5160396 |
_version_ | 1783589577532899328 |
---|---|
author | Zhao, Ran Hu, Bin Chen, Lei Zhou, Bo |
author_facet | Zhao, Ran Hu, Bin Chen, Lei Zhou, Bo |
author_sort | Zhao, Ran |
collection | PubMed |
description | Oncogene is a special type of genes, which can promote the tumor initiation. Good study on oncogenes is helpful for understanding the cause of cancers. Experimental techniques in early time are quite popular in detecting oncogenes. However, their defects become more and more evident in recent years, such as high cost and long time. The newly proposed computational methods provide an alternative way to study oncogenes, which can provide useful clues for further investigations on candidate genes. Considering the limitations of some previous computational methods, such as lack of learning procedures and terming genes as individual subjects, a novel computational method was proposed in this study. The method adopted the features derived from multiple protein networks, viewing proteins in a system level. A classic machine learning algorithm, random forest, was applied on these features to capture the essential characteristic of oncogenes, thereby building the prediction model. All genes except validated oncogenes were ranked with a measurement yielded by the prediction model. Top genes were quite different from potential oncogenes discovered by previous methods, and they can be confirmed to become novel oncogenes. It was indicated that the newly identified genes can be essential supplements for previous results. |
format | Online Article Text |
id | pubmed-7530476 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-75304762020-10-06 Identification of Latent Oncogenes with a Network Embedding Method and Random Forest Zhao, Ran Hu, Bin Chen, Lei Zhou, Bo Biomed Res Int Research Article Oncogene is a special type of genes, which can promote the tumor initiation. Good study on oncogenes is helpful for understanding the cause of cancers. Experimental techniques in early time are quite popular in detecting oncogenes. However, their defects become more and more evident in recent years, such as high cost and long time. The newly proposed computational methods provide an alternative way to study oncogenes, which can provide useful clues for further investigations on candidate genes. Considering the limitations of some previous computational methods, such as lack of learning procedures and terming genes as individual subjects, a novel computational method was proposed in this study. The method adopted the features derived from multiple protein networks, viewing proteins in a system level. A classic machine learning algorithm, random forest, was applied on these features to capture the essential characteristic of oncogenes, thereby building the prediction model. All genes except validated oncogenes were ranked with a measurement yielded by the prediction model. Top genes were quite different from potential oncogenes discovered by previous methods, and they can be confirmed to become novel oncogenes. It was indicated that the newly identified genes can be essential supplements for previous results. Hindawi 2020-09-23 /pmc/articles/PMC7530476/ /pubmed/33029511 http://dx.doi.org/10.1155/2020/5160396 Text en Copyright © 2020 Ran Zhao et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Zhao, Ran Hu, Bin Chen, Lei Zhou, Bo Identification of Latent Oncogenes with a Network Embedding Method and Random Forest |
title | Identification of Latent Oncogenes with a Network Embedding Method and Random Forest |
title_full | Identification of Latent Oncogenes with a Network Embedding Method and Random Forest |
title_fullStr | Identification of Latent Oncogenes with a Network Embedding Method and Random Forest |
title_full_unstemmed | Identification of Latent Oncogenes with a Network Embedding Method and Random Forest |
title_short | Identification of Latent Oncogenes with a Network Embedding Method and Random Forest |
title_sort | identification of latent oncogenes with a network embedding method and random forest |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7530476/ https://www.ncbi.nlm.nih.gov/pubmed/33029511 http://dx.doi.org/10.1155/2020/5160396 |
work_keys_str_mv | AT zhaoran identificationoflatentoncogeneswithanetworkembeddingmethodandrandomforest AT hubin identificationoflatentoncogeneswithanetworkembeddingmethodandrandomforest AT chenlei identificationoflatentoncogeneswithanetworkembeddingmethodandrandomforest AT zhoubo identificationoflatentoncogeneswithanetworkembeddingmethodandrandomforest |