Cargando…

EnACP: An Ensemble Learning Model for Identification of Anticancer Peptides

As cancer remains one of the main threats of human life, developing efficient cancer treatments is urgent. Anticancer peptides, which could overcome the significant side effects and poor results of traditional cancer treatments, have become a new potential alternative these years. However, identifyi...

Descripción completa

Detalles Bibliográficos
Autores principales: Ge, Ruiquan, Feng, Guanwen, Jing, Xiaoyang, Zhang, Renfeng, Wang, Pu, Wu, Qing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7438906/
https://www.ncbi.nlm.nih.gov/pubmed/32903636
http://dx.doi.org/10.3389/fgene.2020.00760
_version_ 1783572887403233280
author Ge, Ruiquan
Feng, Guanwen
Jing, Xiaoyang
Zhang, Renfeng
Wang, Pu
Wu, Qing
author_facet Ge, Ruiquan
Feng, Guanwen
Jing, Xiaoyang
Zhang, Renfeng
Wang, Pu
Wu, Qing
author_sort Ge, Ruiquan
collection PubMed
description As cancer remains one of the main threats of human life, developing efficient cancer treatments is urgent. Anticancer peptides, which could overcome the significant side effects and poor results of traditional cancer treatments, have become a new potential alternative these years. However, identifying anticancer peptides by experimental methods is time consuming and resource consuming, it is of great significance to develop effective computational tools to quickly and accurately identify potential anticancer peptides from amino acid sequences. For most current computational methods, feature representation plays a key role in their final successes. This study proposes a novel fast and accurate approach to identify anticancer peptides using diversified feature representations and ensemble learning method. For the feature representations, the information is encoded from multidimensional feature spaces, including sequence composition, sequence-order, physicochemical properties, etc. In order to better model the potential relationships of peptides, multiple ensemble classifiers, LightGBMs, are applied to detect the different feature sets at first. Then the obtained multiple outputs are used as inputs of the support vector machine classifier, which effectively identifies anticancer peptides. Experimental results on cross validation and independent test sets demonstrate that our method can achieve better or comparable performances compared with other state-of-the-art methods.
format Online
Article
Text
id pubmed-7438906
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-74389062020-09-03 EnACP: An Ensemble Learning Model for Identification of Anticancer Peptides Ge, Ruiquan Feng, Guanwen Jing, Xiaoyang Zhang, Renfeng Wang, Pu Wu, Qing Front Genet Genetics As cancer remains one of the main threats of human life, developing efficient cancer treatments is urgent. Anticancer peptides, which could overcome the significant side effects and poor results of traditional cancer treatments, have become a new potential alternative these years. However, identifying anticancer peptides by experimental methods is time consuming and resource consuming, it is of great significance to develop effective computational tools to quickly and accurately identify potential anticancer peptides from amino acid sequences. For most current computational methods, feature representation plays a key role in their final successes. This study proposes a novel fast and accurate approach to identify anticancer peptides using diversified feature representations and ensemble learning method. For the feature representations, the information is encoded from multidimensional feature spaces, including sequence composition, sequence-order, physicochemical properties, etc. In order to better model the potential relationships of peptides, multiple ensemble classifiers, LightGBMs, are applied to detect the different feature sets at first. Then the obtained multiple outputs are used as inputs of the support vector machine classifier, which effectively identifies anticancer peptides. Experimental results on cross validation and independent test sets demonstrate that our method can achieve better or comparable performances compared with other state-of-the-art methods. Frontiers Media S.A. 2020-07-30 /pmc/articles/PMC7438906/ /pubmed/32903636 http://dx.doi.org/10.3389/fgene.2020.00760 Text en Copyright © 2020 Ge, Feng, Jing, Zhang, Wang and Wu. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Ge, Ruiquan
Feng, Guanwen
Jing, Xiaoyang
Zhang, Renfeng
Wang, Pu
Wu, Qing
EnACP: An Ensemble Learning Model for Identification of Anticancer Peptides
title EnACP: An Ensemble Learning Model for Identification of Anticancer Peptides
title_full EnACP: An Ensemble Learning Model for Identification of Anticancer Peptides
title_fullStr EnACP: An Ensemble Learning Model for Identification of Anticancer Peptides
title_full_unstemmed EnACP: An Ensemble Learning Model for Identification of Anticancer Peptides
title_short EnACP: An Ensemble Learning Model for Identification of Anticancer Peptides
title_sort enacp: an ensemble learning model for identification of anticancer peptides
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7438906/
https://www.ncbi.nlm.nih.gov/pubmed/32903636
http://dx.doi.org/10.3389/fgene.2020.00760
work_keys_str_mv AT geruiquan enacpanensemblelearningmodelforidentificationofanticancerpeptides
AT fengguanwen enacpanensemblelearningmodelforidentificationofanticancerpeptides
AT jingxiaoyang enacpanensemblelearningmodelforidentificationofanticancerpeptides
AT zhangrenfeng enacpanensemblelearningmodelforidentificationofanticancerpeptides
AT wangpu enacpanensemblelearningmodelforidentificationofanticancerpeptides
AT wuqing enacpanensemblelearningmodelforidentificationofanticancerpeptides