Cargando…

Analysis and Prediction of the Critical Regions of Antimicrobial Peptides Based on Conditional Random Fields

Antimicrobial peptides (AMPs) are potent drug candidates against microbes such as bacteria, fungi, parasites, and viruses. The size of AMPs ranges from less than ten to hundreds of amino acids. Often only a few amino acids or the critical regions of antimicrobial proteins matter the functionality. A...

Descripción completa

Detalles Bibliográficos
Autores principales: Chang, Kuan Y., Lin, Tung-pei, Shih, Ling-Yi, Wang, Chien-Kuo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4372350/
https://www.ncbi.nlm.nih.gov/pubmed/25803302
http://dx.doi.org/10.1371/journal.pone.0119490
_version_ 1782363166234116096
author Chang, Kuan Y.
Lin, Tung-pei
Shih, Ling-Yi
Wang, Chien-Kuo
author_facet Chang, Kuan Y.
Lin, Tung-pei
Shih, Ling-Yi
Wang, Chien-Kuo
author_sort Chang, Kuan Y.
collection PubMed
description Antimicrobial peptides (AMPs) are potent drug candidates against microbes such as bacteria, fungi, parasites, and viruses. The size of AMPs ranges from less than ten to hundreds of amino acids. Often only a few amino acids or the critical regions of antimicrobial proteins matter the functionality. Accurately predicting the AMP critical regions could benefit the experimental designs. However, no extensive analyses have been done specifically on the AMP critical regions and computational modeling on them is either non-existent or settled to other problems. With a focus on the AMP critical regions, we thus develop a computational model AMPcore by introducing a state-of-the-art machine learning method, conditional random fields. We generate a comprehensive dataset of 798 AMPs cores and a low similarity dataset of 510 representative AMP cores. AMPcore could reach a maximal accuracy of 90% and 0.79 Matthew’s correlation coefficient on the comprehensive dataset and a maximal accuracy of 83% and 0.66 MCC on the low similarity dataset. Our analyses of AMP cores follow what we know about AMPs: High in glycine and lysine, but low in aspartic acid, glutamic acid, and methionine; the abundance of α-helical structures; the dominance of positive net charges; the peculiarity of amphipathicity. Two amphipathic sequence motifs within the AMP cores, an amphipathic α-helix and an amphipathic π-helix, are revealed. In addition, a short sequence motif at the N-terminal boundary of AMP cores is reported for the first time: arginine at the P(-1) coupling with glycine at the P1 of AMP cores occurs the most, which might link to microbial cell adhesion.
format Online
Article
Text
id pubmed-4372350
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-43723502015-04-04 Analysis and Prediction of the Critical Regions of Antimicrobial Peptides Based on Conditional Random Fields Chang, Kuan Y. Lin, Tung-pei Shih, Ling-Yi Wang, Chien-Kuo PLoS One Research Article Antimicrobial peptides (AMPs) are potent drug candidates against microbes such as bacteria, fungi, parasites, and viruses. The size of AMPs ranges from less than ten to hundreds of amino acids. Often only a few amino acids or the critical regions of antimicrobial proteins matter the functionality. Accurately predicting the AMP critical regions could benefit the experimental designs. However, no extensive analyses have been done specifically on the AMP critical regions and computational modeling on them is either non-existent or settled to other problems. With a focus on the AMP critical regions, we thus develop a computational model AMPcore by introducing a state-of-the-art machine learning method, conditional random fields. We generate a comprehensive dataset of 798 AMPs cores and a low similarity dataset of 510 representative AMP cores. AMPcore could reach a maximal accuracy of 90% and 0.79 Matthew’s correlation coefficient on the comprehensive dataset and a maximal accuracy of 83% and 0.66 MCC on the low similarity dataset. Our analyses of AMP cores follow what we know about AMPs: High in glycine and lysine, but low in aspartic acid, glutamic acid, and methionine; the abundance of α-helical structures; the dominance of positive net charges; the peculiarity of amphipathicity. Two amphipathic sequence motifs within the AMP cores, an amphipathic α-helix and an amphipathic π-helix, are revealed. In addition, a short sequence motif at the N-terminal boundary of AMP cores is reported for the first time: arginine at the P(-1) coupling with glycine at the P1 of AMP cores occurs the most, which might link to microbial cell adhesion. Public Library of Science 2015-03-24 /pmc/articles/PMC4372350/ /pubmed/25803302 http://dx.doi.org/10.1371/journal.pone.0119490 Text en © 2015 Chang et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Chang, Kuan Y.
Lin, Tung-pei
Shih, Ling-Yi
Wang, Chien-Kuo
Analysis and Prediction of the Critical Regions of Antimicrobial Peptides Based on Conditional Random Fields
title Analysis and Prediction of the Critical Regions of Antimicrobial Peptides Based on Conditional Random Fields
title_full Analysis and Prediction of the Critical Regions of Antimicrobial Peptides Based on Conditional Random Fields
title_fullStr Analysis and Prediction of the Critical Regions of Antimicrobial Peptides Based on Conditional Random Fields
title_full_unstemmed Analysis and Prediction of the Critical Regions of Antimicrobial Peptides Based on Conditional Random Fields
title_short Analysis and Prediction of the Critical Regions of Antimicrobial Peptides Based on Conditional Random Fields
title_sort analysis and prediction of the critical regions of antimicrobial peptides based on conditional random fields
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4372350/
https://www.ncbi.nlm.nih.gov/pubmed/25803302
http://dx.doi.org/10.1371/journal.pone.0119490
work_keys_str_mv AT changkuany analysisandpredictionofthecriticalregionsofantimicrobialpeptidesbasedonconditionalrandomfields
AT lintungpei analysisandpredictionofthecriticalregionsofantimicrobialpeptidesbasedonconditionalrandomfields
AT shihlingyi analysisandpredictionofthecriticalregionsofantimicrobialpeptidesbasedonconditionalrandomfields
AT wangchienkuo analysisandpredictionofthecriticalregionsofantimicrobialpeptidesbasedonconditionalrandomfields