Cargando…
PiPred – a deep-learning method for prediction of π-helices in protein sequences
Canonical π-helices are short, relatively unstable secondary structure elements found in proteins. They comprise seven or more residues and are present in 15% of all known protein structures, often in functionally important regions such as ligand- and ion-binding sites. Given their similarity to α-h...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6499831/ https://www.ncbi.nlm.nih.gov/pubmed/31053765 http://dx.doi.org/10.1038/s41598-019-43189-4 |
_version_ | 1783415837419372544 |
---|---|
author | Ludwiczak, Jan Winski, Aleksander da Silva Neto, Antonio Marinho Szczepaniak, Krzysztof Alva, Vikram Dunin-Horkawicz, Stanislaw |
author_facet | Ludwiczak, Jan Winski, Aleksander da Silva Neto, Antonio Marinho Szczepaniak, Krzysztof Alva, Vikram Dunin-Horkawicz, Stanislaw |
author_sort | Ludwiczak, Jan |
collection | PubMed |
description | Canonical π-helices are short, relatively unstable secondary structure elements found in proteins. They comprise seven or more residues and are present in 15% of all known protein structures, often in functionally important regions such as ligand- and ion-binding sites. Given their similarity to α-helices, the prediction of π-helices is a challenging task and none of the currently available secondary structure prediction methods tackle it. Here, we present PiPred, a neural network-based tool for predicting π-helices in protein sequences. By performing a rigorous benchmark we show that PiPred can detect π-helices with a per-residue precision of 48% and sensitivity of 46%. Interestingly, some of the α-helices mispredicted by PiPred as π-helices exhibit a geometry characteristic of π-helices. Also, despite being trained only with canonical π-helices, PiPred can identify 6-residue-long α/π-bulges. These observations suggest an even higher effective precision of the method and demonstrate that π-helices, α/π-bulges, and other helical deformations may impose similar constraints on sequences. PiPred is freely accessible at: https://toolkit.tuebingen.mpg.de/#/tools/quick2d. A standalone version is available for download at: https://github.com/labstructbioinf/PiPred, where we also provide the CB6133, CB513, CASP10, and CASP11 datasets, commonly used for training and validation of secondary structure prediction methods, with correctly annotated π-helices. |
format | Online Article Text |
id | pubmed-6499831 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-64998312019-05-17 PiPred – a deep-learning method for prediction of π-helices in protein sequences Ludwiczak, Jan Winski, Aleksander da Silva Neto, Antonio Marinho Szczepaniak, Krzysztof Alva, Vikram Dunin-Horkawicz, Stanislaw Sci Rep Article Canonical π-helices are short, relatively unstable secondary structure elements found in proteins. They comprise seven or more residues and are present in 15% of all known protein structures, often in functionally important regions such as ligand- and ion-binding sites. Given their similarity to α-helices, the prediction of π-helices is a challenging task and none of the currently available secondary structure prediction methods tackle it. Here, we present PiPred, a neural network-based tool for predicting π-helices in protein sequences. By performing a rigorous benchmark we show that PiPred can detect π-helices with a per-residue precision of 48% and sensitivity of 46%. Interestingly, some of the α-helices mispredicted by PiPred as π-helices exhibit a geometry characteristic of π-helices. Also, despite being trained only with canonical π-helices, PiPred can identify 6-residue-long α/π-bulges. These observations suggest an even higher effective precision of the method and demonstrate that π-helices, α/π-bulges, and other helical deformations may impose similar constraints on sequences. PiPred is freely accessible at: https://toolkit.tuebingen.mpg.de/#/tools/quick2d. A standalone version is available for download at: https://github.com/labstructbioinf/PiPred, where we also provide the CB6133, CB513, CASP10, and CASP11 datasets, commonly used for training and validation of secondary structure prediction methods, with correctly annotated π-helices. Nature Publishing Group UK 2019-05-03 /pmc/articles/PMC6499831/ /pubmed/31053765 http://dx.doi.org/10.1038/s41598-019-43189-4 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Ludwiczak, Jan Winski, Aleksander da Silva Neto, Antonio Marinho Szczepaniak, Krzysztof Alva, Vikram Dunin-Horkawicz, Stanislaw PiPred – a deep-learning method for prediction of π-helices in protein sequences |
title | PiPred – a deep-learning method for prediction of π-helices in protein sequences |
title_full | PiPred – a deep-learning method for prediction of π-helices in protein sequences |
title_fullStr | PiPred – a deep-learning method for prediction of π-helices in protein sequences |
title_full_unstemmed | PiPred – a deep-learning method for prediction of π-helices in protein sequences |
title_short | PiPred – a deep-learning method for prediction of π-helices in protein sequences |
title_sort | pipred – a deep-learning method for prediction of π-helices in protein sequences |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6499831/ https://www.ncbi.nlm.nih.gov/pubmed/31053765 http://dx.doi.org/10.1038/s41598-019-43189-4 |
work_keys_str_mv | AT ludwiczakjan pipredadeeplearningmethodforpredictionofphelicesinproteinsequences AT winskialeksander pipredadeeplearningmethodforpredictionofphelicesinproteinsequences AT dasilvanetoantoniomarinho pipredadeeplearningmethodforpredictionofphelicesinproteinsequences AT szczepaniakkrzysztof pipredadeeplearningmethodforpredictionofphelicesinproteinsequences AT alvavikram pipredadeeplearningmethodforpredictionofphelicesinproteinsequences AT duninhorkawiczstanislaw pipredadeeplearningmethodforpredictionofphelicesinproteinsequences |