Cargando…

Assigning biological function using hidden signatures in cystine-stabilized peptide sequences

Cystine-stabilized peptides have great utility as they naturally block ion channels, inhibit acetylcholine receptors, or inactivate microbes. However, only a tiny fraction of these peptides has been characterized. Exploration for novel peptides most efficiently starts with the identification of cand...

Descripción completa

Detalles Bibliográficos
Autores principales: Islam, S. M. Ashiqul, Kearney, Christopher Michel, Baker, Erich J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5998126/
https://www.ncbi.nlm.nih.gov/pubmed/29899538
http://dx.doi.org/10.1038/s41598-018-27177-8
_version_ 1783331187395133440
author Islam, S. M. Ashiqul
Kearney, Christopher Michel
Baker, Erich J.
author_facet Islam, S. M. Ashiqul
Kearney, Christopher Michel
Baker, Erich J.
author_sort Islam, S. M. Ashiqul
collection PubMed
description Cystine-stabilized peptides have great utility as they naturally block ion channels, inhibit acetylcholine receptors, or inactivate microbes. However, only a tiny fraction of these peptides has been characterized. Exploration for novel peptides most efficiently starts with the identification of candidates from genome sequence data. Unfortunately, though cystine-stabilized peptides have shared structures, they have low DNA sequence similarity, restricting the utility of BLAST and even more powerful sequence alignment-based annotation algorithms, such as PSI-BLAST and HMMER. In contrast, a supervised machine learning approach may improve discovery and function assignment of these peptides. To this end, we employed our previously described m-NGSG algorithm, which utilizes hidden signatures embedded in peptide primary sequences that define and categorize structural or functional classes of peptides. From the generalized m-NGSG framework, we derived five specific models that categorize cystine-stabilized peptide sequences into specific functional classes. When compared with PSI-BLAST, HMMER and existing function-specific models, our novel approach (named CSPred) consistently demonstrates superior performance in discovery and function-assignment. We also report an interactive version of CSPred, available through download (https://bitbucket.org/sm_islam/cystine-stabilized-proteins/src) or web interface (watson.ecs.baylor.edu/cspred), for the discovery of cystine-stabilized peptides of specific function from genomic datasets and for genome annotation. We fully describe, in the Availability section following the Discussion, the quick and simple usage of the CsPred website to automatically deliver function assignments for batch submissions of peptide sequences.
format Online
Article
Text
id pubmed-5998126
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-59981262018-06-21 Assigning biological function using hidden signatures in cystine-stabilized peptide sequences Islam, S. M. Ashiqul Kearney, Christopher Michel Baker, Erich J. Sci Rep Article Cystine-stabilized peptides have great utility as they naturally block ion channels, inhibit acetylcholine receptors, or inactivate microbes. However, only a tiny fraction of these peptides has been characterized. Exploration for novel peptides most efficiently starts with the identification of candidates from genome sequence data. Unfortunately, though cystine-stabilized peptides have shared structures, they have low DNA sequence similarity, restricting the utility of BLAST and even more powerful sequence alignment-based annotation algorithms, such as PSI-BLAST and HMMER. In contrast, a supervised machine learning approach may improve discovery and function assignment of these peptides. To this end, we employed our previously described m-NGSG algorithm, which utilizes hidden signatures embedded in peptide primary sequences that define and categorize structural or functional classes of peptides. From the generalized m-NGSG framework, we derived five specific models that categorize cystine-stabilized peptide sequences into specific functional classes. When compared with PSI-BLAST, HMMER and existing function-specific models, our novel approach (named CSPred) consistently demonstrates superior performance in discovery and function-assignment. We also report an interactive version of CSPred, available through download (https://bitbucket.org/sm_islam/cystine-stabilized-proteins/src) or web interface (watson.ecs.baylor.edu/cspred), for the discovery of cystine-stabilized peptides of specific function from genomic datasets and for genome annotation. We fully describe, in the Availability section following the Discussion, the quick and simple usage of the CsPred website to automatically deliver function assignments for batch submissions of peptide sequences. Nature Publishing Group UK 2018-06-13 /pmc/articles/PMC5998126/ /pubmed/29899538 http://dx.doi.org/10.1038/s41598-018-27177-8 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Islam, S. M. Ashiqul
Kearney, Christopher Michel
Baker, Erich J.
Assigning biological function using hidden signatures in cystine-stabilized peptide sequences
title Assigning biological function using hidden signatures in cystine-stabilized peptide sequences
title_full Assigning biological function using hidden signatures in cystine-stabilized peptide sequences
title_fullStr Assigning biological function using hidden signatures in cystine-stabilized peptide sequences
title_full_unstemmed Assigning biological function using hidden signatures in cystine-stabilized peptide sequences
title_short Assigning biological function using hidden signatures in cystine-stabilized peptide sequences
title_sort assigning biological function using hidden signatures in cystine-stabilized peptide sequences
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5998126/
https://www.ncbi.nlm.nih.gov/pubmed/29899538
http://dx.doi.org/10.1038/s41598-018-27177-8
work_keys_str_mv AT islamsmashiqul assigningbiologicalfunctionusinghiddensignaturesincystinestabilizedpeptidesequences
AT kearneychristophermichel assigningbiologicalfunctionusinghiddensignaturesincystinestabilizedpeptidesequences
AT bakererichj assigningbiologicalfunctionusinghiddensignaturesincystinestabilizedpeptidesequences