Cargando…

MLACP: machine-learning-based prediction of anticancer peptides

Cancer is the second leading cause of death globally, and use of therapeutic peptides to target and kill cancer cells has received considerable attention in recent years. Identification of anticancer peptides (ACPs) through wet-lab experimentation is expensive and often time consuming; therefore, de...

Descripción completa

Detalles Bibliográficos
Autores principales: Manavalan, Balachandran, Basith, Shaherin, Shin, Tae Hwan, Choi, Sun, Kim, Myeong Ok, Lee, Gwang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Impact Journals LLC 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5652333/
https://www.ncbi.nlm.nih.gov/pubmed/29100375
http://dx.doi.org/10.18632/oncotarget.20365
_version_ 1783273048613322752
author Manavalan, Balachandran
Basith, Shaherin
Shin, Tae Hwan
Choi, Sun
Kim, Myeong Ok
Lee, Gwang
author_facet Manavalan, Balachandran
Basith, Shaherin
Shin, Tae Hwan
Choi, Sun
Kim, Myeong Ok
Lee, Gwang
author_sort Manavalan, Balachandran
collection PubMed
description Cancer is the second leading cause of death globally, and use of therapeutic peptides to target and kill cancer cells has received considerable attention in recent years. Identification of anticancer peptides (ACPs) through wet-lab experimentation is expensive and often time consuming; therefore, development of an efficient computational method is essential to identify potential ACP candidates prior to in vitro experimentation. In this study, we developed support vector machine- and random forest-based machine-learning methods for the prediction of ACPs using the features calculated from the amino acid sequence, including amino acid composition, dipeptide composition, atomic composition, and physicochemical properties. We trained our methods using the Tyagi-B dataset and determined the machine parameters by 10-fold cross-validation. Furthermore, we evaluated the performance of our methods on two benchmarking datasets, with our results showing that the random forest-based method outperformed the existing methods with an average accuracy and Matthews correlation coefficient value of 88.7% and 0.78, respectively. To assist the scientific community, we also developed a publicly accessible web server at www.thegleelab.org/MLACP.html.
format Online
Article
Text
id pubmed-5652333
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Impact Journals LLC
record_format MEDLINE/PubMed
spelling pubmed-56523332017-11-02 MLACP: machine-learning-based prediction of anticancer peptides Manavalan, Balachandran Basith, Shaherin Shin, Tae Hwan Choi, Sun Kim, Myeong Ok Lee, Gwang Oncotarget Research Paper Cancer is the second leading cause of death globally, and use of therapeutic peptides to target and kill cancer cells has received considerable attention in recent years. Identification of anticancer peptides (ACPs) through wet-lab experimentation is expensive and often time consuming; therefore, development of an efficient computational method is essential to identify potential ACP candidates prior to in vitro experimentation. In this study, we developed support vector machine- and random forest-based machine-learning methods for the prediction of ACPs using the features calculated from the amino acid sequence, including amino acid composition, dipeptide composition, atomic composition, and physicochemical properties. We trained our methods using the Tyagi-B dataset and determined the machine parameters by 10-fold cross-validation. Furthermore, we evaluated the performance of our methods on two benchmarking datasets, with our results showing that the random forest-based method outperformed the existing methods with an average accuracy and Matthews correlation coefficient value of 88.7% and 0.78, respectively. To assist the scientific community, we also developed a publicly accessible web server at www.thegleelab.org/MLACP.html. Impact Journals LLC 2017-08-19 /pmc/articles/PMC5652333/ /pubmed/29100375 http://dx.doi.org/10.18632/oncotarget.20365 Text en Copyright: © 2017 Manavalan et al. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License 3.0 (http://creativecommons.org/licenses/by/3.0/) (CC BY 3.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Paper
Manavalan, Balachandran
Basith, Shaherin
Shin, Tae Hwan
Choi, Sun
Kim, Myeong Ok
Lee, Gwang
MLACP: machine-learning-based prediction of anticancer peptides
title MLACP: machine-learning-based prediction of anticancer peptides
title_full MLACP: machine-learning-based prediction of anticancer peptides
title_fullStr MLACP: machine-learning-based prediction of anticancer peptides
title_full_unstemmed MLACP: machine-learning-based prediction of anticancer peptides
title_short MLACP: machine-learning-based prediction of anticancer peptides
title_sort mlacp: machine-learning-based prediction of anticancer peptides
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5652333/
https://www.ncbi.nlm.nih.gov/pubmed/29100375
http://dx.doi.org/10.18632/oncotarget.20365
work_keys_str_mv AT manavalanbalachandran mlacpmachinelearningbasedpredictionofanticancerpeptides
AT basithshaherin mlacpmachinelearningbasedpredictionofanticancerpeptides
AT shintaehwan mlacpmachinelearningbasedpredictionofanticancerpeptides
AT choisun mlacpmachinelearningbasedpredictionofanticancerpeptides
AT kimmyeongok mlacpmachinelearningbasedpredictionofanticancerpeptides
AT leegwang mlacpmachinelearningbasedpredictionofanticancerpeptides