Cargando…

PiPred – a deep-learning method for prediction of π-helices in protein sequences

Canonical π-helices are short, relatively unstable secondary structure elements found in proteins. They comprise seven or more residues and are present in 15% of all known protein structures, often in functionally important regions such as ligand- and ion-binding sites. Given their similarity to α-h...

Descripción completa

Detalles Bibliográficos
Autores principales: Ludwiczak, Jan, Winski, Aleksander, da Silva Neto, Antonio Marinho, Szczepaniak, Krzysztof, Alva, Vikram, Dunin-Horkawicz, Stanislaw
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6499831/
https://www.ncbi.nlm.nih.gov/pubmed/31053765
http://dx.doi.org/10.1038/s41598-019-43189-4
_version_ 1783415837419372544
author Ludwiczak, Jan
Winski, Aleksander
da Silva Neto, Antonio Marinho
Szczepaniak, Krzysztof
Alva, Vikram
Dunin-Horkawicz, Stanislaw
author_facet Ludwiczak, Jan
Winski, Aleksander
da Silva Neto, Antonio Marinho
Szczepaniak, Krzysztof
Alva, Vikram
Dunin-Horkawicz, Stanislaw
author_sort Ludwiczak, Jan
collection PubMed
description Canonical π-helices are short, relatively unstable secondary structure elements found in proteins. They comprise seven or more residues and are present in 15% of all known protein structures, often in functionally important regions such as ligand- and ion-binding sites. Given their similarity to α-helices, the prediction of π-helices is a challenging task and none of the currently available secondary structure prediction methods tackle it. Here, we present PiPred, a neural network-based tool for predicting π-helices in protein sequences. By performing a rigorous benchmark we show that PiPred can detect π-helices with a per-residue precision of 48% and sensitivity of 46%. Interestingly, some of the α-helices mispredicted by PiPred as π-helices exhibit a geometry characteristic of π-helices. Also, despite being trained only with canonical π-helices, PiPred can identify 6-residue-long α/π-bulges. These observations suggest an even higher effective precision of the method and demonstrate that π-helices, α/π-bulges, and other helical deformations may impose similar constraints on sequences. PiPred is freely accessible at: https://toolkit.tuebingen.mpg.de/#/tools/quick2d. A standalone version is available for download at: https://github.com/labstructbioinf/PiPred, where we also provide the CB6133, CB513, CASP10, and CASP11 datasets, commonly used for training and validation of secondary structure prediction methods, with correctly annotated π-helices.
format Online
Article
Text
id pubmed-6499831
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-64998312019-05-17 PiPred – a deep-learning method for prediction of π-helices in protein sequences Ludwiczak, Jan Winski, Aleksander da Silva Neto, Antonio Marinho Szczepaniak, Krzysztof Alva, Vikram Dunin-Horkawicz, Stanislaw Sci Rep Article Canonical π-helices are short, relatively unstable secondary structure elements found in proteins. They comprise seven or more residues and are present in 15% of all known protein structures, often in functionally important regions such as ligand- and ion-binding sites. Given their similarity to α-helices, the prediction of π-helices is a challenging task and none of the currently available secondary structure prediction methods tackle it. Here, we present PiPred, a neural network-based tool for predicting π-helices in protein sequences. By performing a rigorous benchmark we show that PiPred can detect π-helices with a per-residue precision of 48% and sensitivity of 46%. Interestingly, some of the α-helices mispredicted by PiPred as π-helices exhibit a geometry characteristic of π-helices. Also, despite being trained only with canonical π-helices, PiPred can identify 6-residue-long α/π-bulges. These observations suggest an even higher effective precision of the method and demonstrate that π-helices, α/π-bulges, and other helical deformations may impose similar constraints on sequences. PiPred is freely accessible at: https://toolkit.tuebingen.mpg.de/#/tools/quick2d. A standalone version is available for download at: https://github.com/labstructbioinf/PiPred, where we also provide the CB6133, CB513, CASP10, and CASP11 datasets, commonly used for training and validation of secondary structure prediction methods, with correctly annotated π-helices. Nature Publishing Group UK 2019-05-03 /pmc/articles/PMC6499831/ /pubmed/31053765 http://dx.doi.org/10.1038/s41598-019-43189-4 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Ludwiczak, Jan
Winski, Aleksander
da Silva Neto, Antonio Marinho
Szczepaniak, Krzysztof
Alva, Vikram
Dunin-Horkawicz, Stanislaw
PiPred – a deep-learning method for prediction of π-helices in protein sequences
title PiPred – a deep-learning method for prediction of π-helices in protein sequences
title_full PiPred – a deep-learning method for prediction of π-helices in protein sequences
title_fullStr PiPred – a deep-learning method for prediction of π-helices in protein sequences
title_full_unstemmed PiPred – a deep-learning method for prediction of π-helices in protein sequences
title_short PiPred – a deep-learning method for prediction of π-helices in protein sequences
title_sort pipred – a deep-learning method for prediction of π-helices in protein sequences
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6499831/
https://www.ncbi.nlm.nih.gov/pubmed/31053765
http://dx.doi.org/10.1038/s41598-019-43189-4
work_keys_str_mv AT ludwiczakjan pipredadeeplearningmethodforpredictionofphelicesinproteinsequences
AT winskialeksander pipredadeeplearningmethodforpredictionofphelicesinproteinsequences
AT dasilvanetoantoniomarinho pipredadeeplearningmethodforpredictionofphelicesinproteinsequences
AT szczepaniakkrzysztof pipredadeeplearningmethodforpredictionofphelicesinproteinsequences
AT alvavikram pipredadeeplearningmethodforpredictionofphelicesinproteinsequences
AT duninhorkawiczstanislaw pipredadeeplearningmethodforpredictionofphelicesinproteinsequences