Cargando…

PiPred – a deep-learning method for prediction of π-helices in protein sequences

Canonical π-helices are short, relatively unstable secondary structure elements found in proteins. They comprise seven or more residues and are present in 15% of all known protein structures, often in functionally important regions such as ligand- and ion-binding sites. Given their similarity to α-h...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ludwiczak, Jan, Winski, Aleksander, da Silva Neto, Antonio Marinho, Szczepaniak, Krzysztof, Alva, Vikram, Dunin-Horkawicz, Stanislaw
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2019
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6499831/ https://www.ncbi.nlm.nih.gov/pubmed/31053765 http://dx.doi.org/10.1038/s41598-019-43189-4

_version_	1783415837419372544
author	Ludwiczak, Jan Winski, Aleksander da Silva Neto, Antonio Marinho Szczepaniak, Krzysztof Alva, Vikram Dunin-Horkawicz, Stanislaw
author_facet	Ludwiczak, Jan Winski, Aleksander da Silva Neto, Antonio Marinho Szczepaniak, Krzysztof Alva, Vikram Dunin-Horkawicz, Stanislaw
author_sort	Ludwiczak, Jan
collection	PubMed
description	Canonical π-helices are short, relatively unstable secondary structure elements found in proteins. They comprise seven or more residues and are present in 15% of all known protein structures, often in functionally important regions such as ligand- and ion-binding sites. Given their similarity to α-helices, the prediction of π-helices is a challenging task and none of the currently available secondary structure prediction methods tackle it. Here, we present PiPred, a neural network-based tool for predicting π-helices in protein sequences. By performing a rigorous benchmark we show that PiPred can detect π-helices with a per-residue precision of 48% and sensitivity of 46%. Interestingly, some of the α-helices mispredicted by PiPred as π-helices exhibit a geometry characteristic of π-helices. Also, despite being trained only with canonical π-helices, PiPred can identify 6-residue-long α/π-bulges. These observations suggest an even higher effective precision of the method and demonstrate that π-helices, α/π-bulges, and other helical deformations may impose similar constraints on sequences. PiPred is freely accessible at: https://toolkit.tuebingen.mpg.de/#/tools/quick2d. A standalone version is available for download at: https://github.com/labstructbioinf/PiPred, where we also provide the CB6133, CB513, CASP10, and CASP11 datasets, commonly used for training and validation of secondary structure prediction methods, with correctly annotated π-helices.
format	Online Article Text
id	pubmed-6499831
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-64998312019-05-17 PiPred – a deep-learning method for prediction of π-helices in protein sequences Ludwiczak, Jan Winski, Aleksander da Silva Neto, Antonio Marinho Szczepaniak, Krzysztof Alva, Vikram Dunin-Horkawicz, Stanislaw Sci Rep Article Canonical π-helices are short, relatively unstable secondary structure elements found in proteins. They comprise seven or more residues and are present in 15% of all known protein structures, often in functionally important regions such as ligand- and ion-binding sites. Given their similarity to α-helices, the prediction of π-helices is a challenging task and none of the currently available secondary structure prediction methods tackle it. Here, we present PiPred, a neural network-based tool for predicting π-helices in protein sequences. By performing a rigorous benchmark we show that PiPred can detect π-helices with a per-residue precision of 48% and sensitivity of 46%. Interestingly, some of the α-helices mispredicted by PiPred as π-helices exhibit a geometry characteristic of π-helices. Also, despite being trained only with canonical π-helices, PiPred can identify 6-residue-long α/π-bulges. These observations suggest an even higher effective precision of the method and demonstrate that π-helices, α/π-bulges, and other helical deformations may impose similar constraints on sequences. PiPred is freely accessible at: https://toolkit.tuebingen.mpg.de/#/tools/quick2d. A standalone version is available for download at: https://github.com/labstructbioinf/PiPred, where we also provide the CB6133, CB513, CASP10, and CASP11 datasets, commonly used for training and validation of secondary structure prediction methods, with correctly annotated π-helices. Nature Publishing Group UK 2019-05-03 /pmc/articles/PMC6499831/ /pubmed/31053765 http://dx.doi.org/10.1038/s41598-019-43189-4 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle	Article Ludwiczak, Jan Winski, Aleksander da Silva Neto, Antonio Marinho Szczepaniak, Krzysztof Alva, Vikram Dunin-Horkawicz, Stanislaw PiPred – a deep-learning method for prediction of π-helices in protein sequences
title	PiPred – a deep-learning method for prediction of π-helices in protein sequences
title_full	PiPred – a deep-learning method for prediction of π-helices in protein sequences
title_fullStr	PiPred – a deep-learning method for prediction of π-helices in protein sequences
title_full_unstemmed	PiPred – a deep-learning method for prediction of π-helices in protein sequences
title_short	PiPred – a deep-learning method for prediction of π-helices in protein sequences
title_sort	pipred – a deep-learning method for prediction of π-helices in protein sequences
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6499831/ https://www.ncbi.nlm.nih.gov/pubmed/31053765 http://dx.doi.org/10.1038/s41598-019-43189-4
work_keys_str_mv	AT ludwiczakjan pipredadeeplearningmethodforpredictionofphelicesinproteinsequences AT winskialeksander pipredadeeplearningmethodforpredictionofphelicesinproteinsequences AT dasilvanetoantoniomarinho pipredadeeplearningmethodforpredictionofphelicesinproteinsequences AT szczepaniakkrzysztof pipredadeeplearningmethodforpredictionofphelicesinproteinsequences AT alvavikram pipredadeeplearningmethodforpredictionofphelicesinproteinsequences AT duninhorkawiczstanislaw pipredadeeplearningmethodforpredictionofphelicesinproteinsequences

PiPred – a deep-learning method for prediction of π-helices in protein sequences

Ejemplares similares