Cargando…

Automated Protein Secondary Structure Assignment from Cα Positions Using Neural Networks

The assignment of secondary structure elements in protein conformations is necessary to interpret a protein model that has been established by computational methods. The process essentially involves labeling the amino acid residues with H (Helix), E (Strand), or C (Coil, also known as Loop). When pa...

Descripción completa

Detalles Bibliográficos
Autores principales: Saqib, Mohammad N., Kryś, Justyna D., Gront, Dominik
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9220970/
https://www.ncbi.nlm.nih.gov/pubmed/35740966
http://dx.doi.org/10.3390/biom12060841
_version_ 1784732506208075776
author Saqib, Mohammad N.
Kryś, Justyna D.
Gront, Dominik
author_facet Saqib, Mohammad N.
Kryś, Justyna D.
Gront, Dominik
author_sort Saqib, Mohammad N.
collection PubMed
description The assignment of secondary structure elements in protein conformations is necessary to interpret a protein model that has been established by computational methods. The process essentially involves labeling the amino acid residues with H (Helix), E (Strand), or C (Coil, also known as Loop). When particular atoms are absent from an input protein structure, the procedure becomes more complicated, especially when only the alpha carbon locations are known. Various techniques have been tested and applied to this problem during the last forty years. The application of machine learning techniques is the most recent trend. This contribution presents the HECA classifier, which uses neural networks to assign protein secondary structure types. The technique exclusively employs C [Formula: see text] coordinates. The Keras (TensorFlow) library was used to implement and train the neural network model. The BioShell toolkit was used to calculate the neural network input features from raw coordinates. The study’s findings show that neural network-based methods may be successfully used to take on structure assignment challenges when only C [Formula: see text] trace is available. Thanks to the careful selection of input features, our approach’s accuracy (above 97%) exceeded that of the existing methods.
format Online
Article
Text
id pubmed-9220970
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-92209702022-06-24 Automated Protein Secondary Structure Assignment from Cα Positions Using Neural Networks Saqib, Mohammad N. Kryś, Justyna D. Gront, Dominik Biomolecules Article The assignment of secondary structure elements in protein conformations is necessary to interpret a protein model that has been established by computational methods. The process essentially involves labeling the amino acid residues with H (Helix), E (Strand), or C (Coil, also known as Loop). When particular atoms are absent from an input protein structure, the procedure becomes more complicated, especially when only the alpha carbon locations are known. Various techniques have been tested and applied to this problem during the last forty years. The application of machine learning techniques is the most recent trend. This contribution presents the HECA classifier, which uses neural networks to assign protein secondary structure types. The technique exclusively employs C [Formula: see text] coordinates. The Keras (TensorFlow) library was used to implement and train the neural network model. The BioShell toolkit was used to calculate the neural network input features from raw coordinates. The study’s findings show that neural network-based methods may be successfully used to take on structure assignment challenges when only C [Formula: see text] trace is available. Thanks to the careful selection of input features, our approach’s accuracy (above 97%) exceeded that of the existing methods. MDPI 2022-06-17 /pmc/articles/PMC9220970/ /pubmed/35740966 http://dx.doi.org/10.3390/biom12060841 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Saqib, Mohammad N.
Kryś, Justyna D.
Gront, Dominik
Automated Protein Secondary Structure Assignment from Cα Positions Using Neural Networks
title Automated Protein Secondary Structure Assignment from Cα Positions Using Neural Networks
title_full Automated Protein Secondary Structure Assignment from Cα Positions Using Neural Networks
title_fullStr Automated Protein Secondary Structure Assignment from Cα Positions Using Neural Networks
title_full_unstemmed Automated Protein Secondary Structure Assignment from Cα Positions Using Neural Networks
title_short Automated Protein Secondary Structure Assignment from Cα Positions Using Neural Networks
title_sort automated protein secondary structure assignment from cα positions using neural networks
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9220970/
https://www.ncbi.nlm.nih.gov/pubmed/35740966
http://dx.doi.org/10.3390/biom12060841
work_keys_str_mv AT saqibmohammadn automatedproteinsecondarystructureassignmentfromcapositionsusingneuralnetworks
AT krysjustynad automatedproteinsecondarystructureassignmentfromcapositionsusingneuralnetworks
AT grontdominik automatedproteinsecondarystructureassignmentfromcapositionsusingneuralnetworks