Cargando…

A novel method for achieving an optimal classification of the proteinogenic amino acids

The classification of proteinogenic amino acids is crucial for understanding their commonalities as well as their differences to provide a hint for why life settled on the usage of precisely those amino acids. It is also crucial for predicting electrostatic, hydrophobic, stacking and other interacti...

Descripción completa

Detalles Bibliográficos
Autores principales: Then, Andre, Mácha, Karel, Ibrahim, Bashar, Schuster, Stefan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7501307/
https://www.ncbi.nlm.nih.gov/pubmed/32948819
http://dx.doi.org/10.1038/s41598-020-72174-5
_version_ 1783584012336365568
author Then, Andre
Mácha, Karel
Ibrahim, Bashar
Schuster, Stefan
author_facet Then, Andre
Mácha, Karel
Ibrahim, Bashar
Schuster, Stefan
author_sort Then, Andre
collection PubMed
description The classification of proteinogenic amino acids is crucial for understanding their commonalities as well as their differences to provide a hint for why life settled on the usage of precisely those amino acids. It is also crucial for predicting electrostatic, hydrophobic, stacking and other interactions, for assessing conservation in multiple alignments and many other applications. While several methods have been proposed to find “the” optimal classification, they have several shortcomings, such as the lack of efficiency and interpretability or an unnecessarily high number of discriminating features. In this study, we propose a novel method involving a repeated binary separation via a minimum amount of five features (such as hydrophobicity or volume) expressed by numerical values for amino acid characteristics. The features are extracted from the AAindex database. By simple separation at the medians, we successfully derive the five properties volume, electron–ion-interaction potential, hydrophobicity, α-helix propensity, and π-helix propensity. We extend our analysis to separations other than by the median. We further score our combinations based on how natural the separations are.
format Online
Article
Text
id pubmed-7501307
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-75013072020-09-22 A novel method for achieving an optimal classification of the proteinogenic amino acids Then, Andre Mácha, Karel Ibrahim, Bashar Schuster, Stefan Sci Rep Article The classification of proteinogenic amino acids is crucial for understanding their commonalities as well as their differences to provide a hint for why life settled on the usage of precisely those amino acids. It is also crucial for predicting electrostatic, hydrophobic, stacking and other interactions, for assessing conservation in multiple alignments and many other applications. While several methods have been proposed to find “the” optimal classification, they have several shortcomings, such as the lack of efficiency and interpretability or an unnecessarily high number of discriminating features. In this study, we propose a novel method involving a repeated binary separation via a minimum amount of five features (such as hydrophobicity or volume) expressed by numerical values for amino acid characteristics. The features are extracted from the AAindex database. By simple separation at the medians, we successfully derive the five properties volume, electron–ion-interaction potential, hydrophobicity, α-helix propensity, and π-helix propensity. We extend our analysis to separations other than by the median. We further score our combinations based on how natural the separations are. Nature Publishing Group UK 2020-09-18 /pmc/articles/PMC7501307/ /pubmed/32948819 http://dx.doi.org/10.1038/s41598-020-72174-5 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Then, Andre
Mácha, Karel
Ibrahim, Bashar
Schuster, Stefan
A novel method for achieving an optimal classification of the proteinogenic amino acids
title A novel method for achieving an optimal classification of the proteinogenic amino acids
title_full A novel method for achieving an optimal classification of the proteinogenic amino acids
title_fullStr A novel method for achieving an optimal classification of the proteinogenic amino acids
title_full_unstemmed A novel method for achieving an optimal classification of the proteinogenic amino acids
title_short A novel method for achieving an optimal classification of the proteinogenic amino acids
title_sort novel method for achieving an optimal classification of the proteinogenic amino acids
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7501307/
https://www.ncbi.nlm.nih.gov/pubmed/32948819
http://dx.doi.org/10.1038/s41598-020-72174-5
work_keys_str_mv AT thenandre anovelmethodforachievinganoptimalclassificationoftheproteinogenicaminoacids
AT machakarel anovelmethodforachievinganoptimalclassificationoftheproteinogenicaminoacids
AT ibrahimbashar anovelmethodforachievinganoptimalclassificationoftheproteinogenicaminoacids
AT schusterstefan anovelmethodforachievinganoptimalclassificationoftheproteinogenicaminoacids
AT thenandre novelmethodforachievinganoptimalclassificationoftheproteinogenicaminoacids
AT machakarel novelmethodforachievinganoptimalclassificationoftheproteinogenicaminoacids
AT ibrahimbashar novelmethodforachievinganoptimalclassificationoftheproteinogenicaminoacids
AT schusterstefan novelmethodforachievinganoptimalclassificationoftheproteinogenicaminoacids