Cargando…
Fuzzy spherical truncation-based multi-linear protein descriptors: From their definition to application in structural-related predictions
This study introduces a set of fuzzy spherically truncated three-dimensional (3D) multi-linear descriptors for proteins. These indices codify geometric structural information from kth spherically truncated spatial-(dis)similarity two-tuple and three-tuple tensors. The coefficients of these truncated...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9585278/ https://www.ncbi.nlm.nih.gov/pubmed/36277354 http://dx.doi.org/10.3389/fchem.2022.959143 |
_version_ | 1784813457300783104 |
---|---|
author | Contreras-Torres, Ernesto Marrero-Ponce, Yovani Terán, Julio E. Agüero-Chapin, Guillermin Antunes, Agostinho García-Jacas, César R. |
author_facet | Contreras-Torres, Ernesto Marrero-Ponce, Yovani Terán, Julio E. Agüero-Chapin, Guillermin Antunes, Agostinho García-Jacas, César R. |
author_sort | Contreras-Torres, Ernesto |
collection | PubMed |
description | This study introduces a set of fuzzy spherically truncated three-dimensional (3D) multi-linear descriptors for proteins. These indices codify geometric structural information from kth spherically truncated spatial-(dis)similarity two-tuple and three-tuple tensors. The coefficients of these truncated tensors are calculated by applying a smoothing value to the 3D structural encoding based on the relationships between two and three amino acids of a protein embedded into a sphere. At considering, the geometrical center of the protein matches with center of the sphere, the distance between each amino acid involved in any specific interaction and the geometrical center of the protein can be computed. Then, the fuzzy membership degree of each amino acid from an spherical region of interest is computed by fuzzy membership functions (FMFs). The truncation value is finally a combination of the membership degrees from interacting amino acids, by applying the arithmetic mean as fusion rule. Several fuzzy membership functions with diverse biases on the calculation of amino acids memberships (e.g., Z-shaped (close to the center), PI-shaped (middle region), and A-Gaussian (far from the center)) were considered as well as traditional truncation functions (e.g., Switching). Such truncation functions were comparatively evaluated by exploring: 1) the frequency of membership degrees, 2) the variability and orthogonality analyses among them based on the Shannon Entropy’s and Principal Component’s methods, respectively, and 3) the prediction performance of alignment-free prediction of protein folding rates and structural classes. These analyses unraveled the singularity of the proposed fuzzy spherically truncated MDs with respect to the classical (non-truncated) ones and respect to the MDs truncated with traditional functions. They also showed an improved prediction power by attaining an external correlation coefficient of 95.82% in the folding rate modelling and an accuracy of 100% in distinguishing structural protein classes. These outcomes are better than the ones attained by existing approaches, justifying the theoretical contribution of this report. Thus, the fuzzy spherically truncated-based protein descriptors from MuLiMs-MCoMPAs (http://tomocomd.com/mulims-mcompas) are promising alignment-free predictors for modeling protein functions and properties. |
format | Online Article Text |
id | pubmed-9585278 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-95852782022-10-22 Fuzzy spherical truncation-based multi-linear protein descriptors: From their definition to application in structural-related predictions Contreras-Torres, Ernesto Marrero-Ponce, Yovani Terán, Julio E. Agüero-Chapin, Guillermin Antunes, Agostinho García-Jacas, César R. Front Chem Chemistry This study introduces a set of fuzzy spherically truncated three-dimensional (3D) multi-linear descriptors for proteins. These indices codify geometric structural information from kth spherically truncated spatial-(dis)similarity two-tuple and three-tuple tensors. The coefficients of these truncated tensors are calculated by applying a smoothing value to the 3D structural encoding based on the relationships between two and three amino acids of a protein embedded into a sphere. At considering, the geometrical center of the protein matches with center of the sphere, the distance between each amino acid involved in any specific interaction and the geometrical center of the protein can be computed. Then, the fuzzy membership degree of each amino acid from an spherical region of interest is computed by fuzzy membership functions (FMFs). The truncation value is finally a combination of the membership degrees from interacting amino acids, by applying the arithmetic mean as fusion rule. Several fuzzy membership functions with diverse biases on the calculation of amino acids memberships (e.g., Z-shaped (close to the center), PI-shaped (middle region), and A-Gaussian (far from the center)) were considered as well as traditional truncation functions (e.g., Switching). Such truncation functions were comparatively evaluated by exploring: 1) the frequency of membership degrees, 2) the variability and orthogonality analyses among them based on the Shannon Entropy’s and Principal Component’s methods, respectively, and 3) the prediction performance of alignment-free prediction of protein folding rates and structural classes. These analyses unraveled the singularity of the proposed fuzzy spherically truncated MDs with respect to the classical (non-truncated) ones and respect to the MDs truncated with traditional functions. They also showed an improved prediction power by attaining an external correlation coefficient of 95.82% in the folding rate modelling and an accuracy of 100% in distinguishing structural protein classes. These outcomes are better than the ones attained by existing approaches, justifying the theoretical contribution of this report. Thus, the fuzzy spherically truncated-based protein descriptors from MuLiMs-MCoMPAs (http://tomocomd.com/mulims-mcompas) are promising alignment-free predictors for modeling protein functions and properties. Frontiers Media S.A. 2022-10-07 /pmc/articles/PMC9585278/ /pubmed/36277354 http://dx.doi.org/10.3389/fchem.2022.959143 Text en Copyright © 2022 Contreras-Torres, Marrero-Ponce, Terán, Agüero-Chapin, Antunes and García-Jacas. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Chemistry Contreras-Torres, Ernesto Marrero-Ponce, Yovani Terán, Julio E. Agüero-Chapin, Guillermin Antunes, Agostinho García-Jacas, César R. Fuzzy spherical truncation-based multi-linear protein descriptors: From their definition to application in structural-related predictions |
title | Fuzzy spherical truncation-based multi-linear protein descriptors: From their definition to application in structural-related predictions |
title_full | Fuzzy spherical truncation-based multi-linear protein descriptors: From their definition to application in structural-related predictions |
title_fullStr | Fuzzy spherical truncation-based multi-linear protein descriptors: From their definition to application in structural-related predictions |
title_full_unstemmed | Fuzzy spherical truncation-based multi-linear protein descriptors: From their definition to application in structural-related predictions |
title_short | Fuzzy spherical truncation-based multi-linear protein descriptors: From their definition to application in structural-related predictions |
title_sort | fuzzy spherical truncation-based multi-linear protein descriptors: from their definition to application in structural-related predictions |
topic | Chemistry |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9585278/ https://www.ncbi.nlm.nih.gov/pubmed/36277354 http://dx.doi.org/10.3389/fchem.2022.959143 |
work_keys_str_mv | AT contrerastorresernesto fuzzysphericaltruncationbasedmultilinearproteindescriptorsfromtheirdefinitiontoapplicationinstructuralrelatedpredictions AT marreroponceyovani fuzzysphericaltruncationbasedmultilinearproteindescriptorsfromtheirdefinitiontoapplicationinstructuralrelatedpredictions AT teranjulioe fuzzysphericaltruncationbasedmultilinearproteindescriptorsfromtheirdefinitiontoapplicationinstructuralrelatedpredictions AT aguerochapinguillermin fuzzysphericaltruncationbasedmultilinearproteindescriptorsfromtheirdefinitiontoapplicationinstructuralrelatedpredictions AT antunesagostinho fuzzysphericaltruncationbasedmultilinearproteindescriptorsfromtheirdefinitiontoapplicationinstructuralrelatedpredictions AT garciajacascesarr fuzzysphericaltruncationbasedmultilinearproteindescriptorsfromtheirdefinitiontoapplicationinstructuralrelatedpredictions |