Cargando…

ACP-GBDT: An improved anticancer peptide identification method with gradient boosting decision tree

Cancer is one of the most dangerous diseases in the world, killing millions of people every year. Drugs composed of anticancer peptides have been used to treat cancer with low side effects in recent years. Therefore, identifying anticancer peptides has become a focus of research. In this study, an i...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Yanjuan, Ma, Di, Chen, Dong, Chen, Yu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10090421/
https://www.ncbi.nlm.nih.gov/pubmed/37065496
http://dx.doi.org/10.3389/fgene.2023.1165765
_version_ 1785022959269707776
author Li, Yanjuan
Ma, Di
Chen, Dong
Chen, Yu
author_facet Li, Yanjuan
Ma, Di
Chen, Dong
Chen, Yu
author_sort Li, Yanjuan
collection PubMed
description Cancer is one of the most dangerous diseases in the world, killing millions of people every year. Drugs composed of anticancer peptides have been used to treat cancer with low side effects in recent years. Therefore, identifying anticancer peptides has become a focus of research. In this study, an improved anticancer peptide predictor named ACP-GBDT, based on gradient boosting decision tree (GBDT) and sequence information, is proposed. To encode the peptide sequences included in the anticancer peptide dataset, ACP-GBDT uses a merged-feature composed of AAIndex and SVMProt-188D. A GBDT is adopted to train the prediction model in ACP-GBDT. Independent testing and ten-fold cross-validation show that ACP-GBDT can effectively distinguish anticancer peptides from non-anticancer ones. The comparison results of the benchmark dataset show that ACP-GBDT is simpler and more effective than other existing anticancer peptide prediction methods.
format Online
Article
Text
id pubmed-10090421
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-100904212023-04-13 ACP-GBDT: An improved anticancer peptide identification method with gradient boosting decision tree Li, Yanjuan Ma, Di Chen, Dong Chen, Yu Front Genet Genetics Cancer is one of the most dangerous diseases in the world, killing millions of people every year. Drugs composed of anticancer peptides have been used to treat cancer with low side effects in recent years. Therefore, identifying anticancer peptides has become a focus of research. In this study, an improved anticancer peptide predictor named ACP-GBDT, based on gradient boosting decision tree (GBDT) and sequence information, is proposed. To encode the peptide sequences included in the anticancer peptide dataset, ACP-GBDT uses a merged-feature composed of AAIndex and SVMProt-188D. A GBDT is adopted to train the prediction model in ACP-GBDT. Independent testing and ten-fold cross-validation show that ACP-GBDT can effectively distinguish anticancer peptides from non-anticancer ones. The comparison results of the benchmark dataset show that ACP-GBDT is simpler and more effective than other existing anticancer peptide prediction methods. Frontiers Media S.A. 2023-03-29 /pmc/articles/PMC10090421/ /pubmed/37065496 http://dx.doi.org/10.3389/fgene.2023.1165765 Text en Copyright © 2023 Li, Ma, Chen and Chen. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Li, Yanjuan
Ma, Di
Chen, Dong
Chen, Yu
ACP-GBDT: An improved anticancer peptide identification method with gradient boosting decision tree
title ACP-GBDT: An improved anticancer peptide identification method with gradient boosting decision tree
title_full ACP-GBDT: An improved anticancer peptide identification method with gradient boosting decision tree
title_fullStr ACP-GBDT: An improved anticancer peptide identification method with gradient boosting decision tree
title_full_unstemmed ACP-GBDT: An improved anticancer peptide identification method with gradient boosting decision tree
title_short ACP-GBDT: An improved anticancer peptide identification method with gradient boosting decision tree
title_sort acp-gbdt: an improved anticancer peptide identification method with gradient boosting decision tree
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10090421/
https://www.ncbi.nlm.nih.gov/pubmed/37065496
http://dx.doi.org/10.3389/fgene.2023.1165765
work_keys_str_mv AT liyanjuan acpgbdtanimprovedanticancerpeptideidentificationmethodwithgradientboostingdecisiontree
AT madi acpgbdtanimprovedanticancerpeptideidentificationmethodwithgradientboostingdecisiontree
AT chendong acpgbdtanimprovedanticancerpeptideidentificationmethodwithgradientboostingdecisiontree
AT chenyu acpgbdtanimprovedanticancerpeptideidentificationmethodwithgradientboostingdecisiontree