Cargando…

Large-Scale Gastric Cancer Susceptibility Gene Identification Based on Gradient Boosting Decision Tree

The early clinical symptoms of gastric cancer are not obvious, and metastasis may have occurred at the time of treatment. Poor prognosis is one of the important reasons for the high mortality of gastric cancer. Therefore, the identification of gastric cancer-related genes can be used as relevant mar...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Qing, Zhang, Ji, Bao, Banghe, Zhang, Fan, Zhou, Jie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8793069/
https://www.ncbi.nlm.nih.gov/pubmed/35096975
http://dx.doi.org/10.3389/fmolb.2021.815243
_version_ 1784640516226285568
author Chen, Qing
Zhang, Ji
Bao, Banghe
Zhang, Fan
Zhou, Jie
author_facet Chen, Qing
Zhang, Ji
Bao, Banghe
Zhang, Fan
Zhou, Jie
author_sort Chen, Qing
collection PubMed
description The early clinical symptoms of gastric cancer are not obvious, and metastasis may have occurred at the time of treatment. Poor prognosis is one of the important reasons for the high mortality of gastric cancer. Therefore, the identification of gastric cancer-related genes can be used as relevant markers for diagnosis and treatment to improve diagnosis precision and guide personalized treatment. In order to further reveal the pathogenesis of gastric cancer at the gene level, we proposed a method based on Gradient Boosting Decision Tree (GBDT) to identify the susceptible genes of gastric cancer through gene interaction network. Based on the known genes related to gastric cancer, we collected more genes which can interact with them and constructed a gene interaction network. Random Walk was used to extract network association of each gene and we used GBDT to identify the gastric cancer-related genes. To verify the AUC and AUPR of our algorithm, we implemented 10-fold cross-validation. GBDT achieved AUC as 0.89 and AUPR as 0.81. We selected four other methods to compare with GBDT and found GBDT performed best.
format Online
Article
Text
id pubmed-8793069
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-87930692022-01-28 Large-Scale Gastric Cancer Susceptibility Gene Identification Based on Gradient Boosting Decision Tree Chen, Qing Zhang, Ji Bao, Banghe Zhang, Fan Zhou, Jie Front Mol Biosci Molecular Biosciences The early clinical symptoms of gastric cancer are not obvious, and metastasis may have occurred at the time of treatment. Poor prognosis is one of the important reasons for the high mortality of gastric cancer. Therefore, the identification of gastric cancer-related genes can be used as relevant markers for diagnosis and treatment to improve diagnosis precision and guide personalized treatment. In order to further reveal the pathogenesis of gastric cancer at the gene level, we proposed a method based on Gradient Boosting Decision Tree (GBDT) to identify the susceptible genes of gastric cancer through gene interaction network. Based on the known genes related to gastric cancer, we collected more genes which can interact with them and constructed a gene interaction network. Random Walk was used to extract network association of each gene and we used GBDT to identify the gastric cancer-related genes. To verify the AUC and AUPR of our algorithm, we implemented 10-fold cross-validation. GBDT achieved AUC as 0.89 and AUPR as 0.81. We selected four other methods to compare with GBDT and found GBDT performed best. Frontiers Media S.A. 2022-01-13 /pmc/articles/PMC8793069/ /pubmed/35096975 http://dx.doi.org/10.3389/fmolb.2021.815243 Text en Copyright © 2022 Chen, Zhang, Bao, Zhang and Zhou. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Molecular Biosciences
Chen, Qing
Zhang, Ji
Bao, Banghe
Zhang, Fan
Zhou, Jie
Large-Scale Gastric Cancer Susceptibility Gene Identification Based on Gradient Boosting Decision Tree
title Large-Scale Gastric Cancer Susceptibility Gene Identification Based on Gradient Boosting Decision Tree
title_full Large-Scale Gastric Cancer Susceptibility Gene Identification Based on Gradient Boosting Decision Tree
title_fullStr Large-Scale Gastric Cancer Susceptibility Gene Identification Based on Gradient Boosting Decision Tree
title_full_unstemmed Large-Scale Gastric Cancer Susceptibility Gene Identification Based on Gradient Boosting Decision Tree
title_short Large-Scale Gastric Cancer Susceptibility Gene Identification Based on Gradient Boosting Decision Tree
title_sort large-scale gastric cancer susceptibility gene identification based on gradient boosting decision tree
topic Molecular Biosciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8793069/
https://www.ncbi.nlm.nih.gov/pubmed/35096975
http://dx.doi.org/10.3389/fmolb.2021.815243
work_keys_str_mv AT chenqing largescalegastriccancersusceptibilitygeneidentificationbasedongradientboostingdecisiontree
AT zhangji largescalegastriccancersusceptibilitygeneidentificationbasedongradientboostingdecisiontree
AT baobanghe largescalegastriccancersusceptibilitygeneidentificationbasedongradientboostingdecisiontree
AT zhangfan largescalegastriccancersusceptibilitygeneidentificationbasedongradientboostingdecisiontree
AT zhoujie largescalegastriccancersusceptibilitygeneidentificationbasedongradientboostingdecisiontree