Cargando…
Identification of subtypes of anticancer peptides based on sequential features and physicochemical properties
Anticancer peptides (ACPs) are a kind of bioactive peptides which could be used as a novel type of anticancer drug that has several advantages over chemistry-based drug, including high specificity, strong tumor penetration capacity, and low toxicity to normal cells. As the number of experimentally v...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8245499/ https://www.ncbi.nlm.nih.gov/pubmed/34193950 http://dx.doi.org/10.1038/s41598-021-93124-9 |
_version_ | 1783716126003298304 |
---|---|
author | Huang, Kai-Yao Tseng, Yi-Jhan Kao, Hui-Ju Chen, Chia-Hung Yang, Hsiao-Hsiang Weng, Shun-Long |
author_facet | Huang, Kai-Yao Tseng, Yi-Jhan Kao, Hui-Ju Chen, Chia-Hung Yang, Hsiao-Hsiang Weng, Shun-Long |
author_sort | Huang, Kai-Yao |
collection | PubMed |
description | Anticancer peptides (ACPs) are a kind of bioactive peptides which could be used as a novel type of anticancer drug that has several advantages over chemistry-based drug, including high specificity, strong tumor penetration capacity, and low toxicity to normal cells. As the number of experimentally verified bioactive peptides has increased significantly, various of in silico approaches are imperative for investigating the characteristics of ACPs. However, the lack of methods for investigating the differences in physicochemical properties of ACPs. In this study, we compared the N- and C-terminal amino acid composition for each peptide, there are three major subtypes of ACPs that are defined based on the distribution of positively charged residues. For the first time, we were motivated to develop a two-step machine learning model for identification of the subtypes of ACPs, which classify the input data into the corresponding group before applying the classifier. Further, to improve the predictive power, the hybrid feature sets were considered for prediction. Evaluation by five-fold cross-validation showed that the two-step model trained with sequence-based features and physicochemical properties was most effective in discriminating between ACPs and non-ACPs. The two-step model trained with the hybrid features performed well, with a sensitivity of 86.75%, a specificity of 85.75%, an accuracy of 86.08%, and a Matthews Correlation Coefficient value of 0.703. Furthermore, the model also consistently provides the effective performance in independent testing set, with sensitivity of 77.6%, specificity of 94.74%, accuracy of 88.99% and the MCC value reached 0.75. Finally, the two-step model has been implemented as a web-based tool, namely iDACP, which is now freely available at http://mer.hc.mmh.org.tw/iDACP/. |
format | Online Article Text |
id | pubmed-8245499 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-82454992021-07-06 Identification of subtypes of anticancer peptides based on sequential features and physicochemical properties Huang, Kai-Yao Tseng, Yi-Jhan Kao, Hui-Ju Chen, Chia-Hung Yang, Hsiao-Hsiang Weng, Shun-Long Sci Rep Article Anticancer peptides (ACPs) are a kind of bioactive peptides which could be used as a novel type of anticancer drug that has several advantages over chemistry-based drug, including high specificity, strong tumor penetration capacity, and low toxicity to normal cells. As the number of experimentally verified bioactive peptides has increased significantly, various of in silico approaches are imperative for investigating the characteristics of ACPs. However, the lack of methods for investigating the differences in physicochemical properties of ACPs. In this study, we compared the N- and C-terminal amino acid composition for each peptide, there are three major subtypes of ACPs that are defined based on the distribution of positively charged residues. For the first time, we were motivated to develop a two-step machine learning model for identification of the subtypes of ACPs, which classify the input data into the corresponding group before applying the classifier. Further, to improve the predictive power, the hybrid feature sets were considered for prediction. Evaluation by five-fold cross-validation showed that the two-step model trained with sequence-based features and physicochemical properties was most effective in discriminating between ACPs and non-ACPs. The two-step model trained with the hybrid features performed well, with a sensitivity of 86.75%, a specificity of 85.75%, an accuracy of 86.08%, and a Matthews Correlation Coefficient value of 0.703. Furthermore, the model also consistently provides the effective performance in independent testing set, with sensitivity of 77.6%, specificity of 94.74%, accuracy of 88.99% and the MCC value reached 0.75. Finally, the two-step model has been implemented as a web-based tool, namely iDACP, which is now freely available at http://mer.hc.mmh.org.tw/iDACP/. Nature Publishing Group UK 2021-06-30 /pmc/articles/PMC8245499/ /pubmed/34193950 http://dx.doi.org/10.1038/s41598-021-93124-9 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Huang, Kai-Yao Tseng, Yi-Jhan Kao, Hui-Ju Chen, Chia-Hung Yang, Hsiao-Hsiang Weng, Shun-Long Identification of subtypes of anticancer peptides based on sequential features and physicochemical properties |
title | Identification of subtypes of anticancer peptides based on sequential features and physicochemical properties |
title_full | Identification of subtypes of anticancer peptides based on sequential features and physicochemical properties |
title_fullStr | Identification of subtypes of anticancer peptides based on sequential features and physicochemical properties |
title_full_unstemmed | Identification of subtypes of anticancer peptides based on sequential features and physicochemical properties |
title_short | Identification of subtypes of anticancer peptides based on sequential features and physicochemical properties |
title_sort | identification of subtypes of anticancer peptides based on sequential features and physicochemical properties |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8245499/ https://www.ncbi.nlm.nih.gov/pubmed/34193950 http://dx.doi.org/10.1038/s41598-021-93124-9 |
work_keys_str_mv | AT huangkaiyao identificationofsubtypesofanticancerpeptidesbasedonsequentialfeaturesandphysicochemicalproperties AT tsengyijhan identificationofsubtypesofanticancerpeptidesbasedonsequentialfeaturesandphysicochemicalproperties AT kaohuiju identificationofsubtypesofanticancerpeptidesbasedonsequentialfeaturesandphysicochemicalproperties AT chenchiahung identificationofsubtypesofanticancerpeptidesbasedonsequentialfeaturesandphysicochemicalproperties AT yanghsiaohsiang identificationofsubtypesofanticancerpeptidesbasedonsequentialfeaturesandphysicochemicalproperties AT wengshunlong identificationofsubtypesofanticancerpeptidesbasedonsequentialfeaturesandphysicochemicalproperties |