Cargando…

Fuzzy spherical truncation-based multi-linear protein descriptors: From their definition to application in structural-related predictions

This study introduces a set of fuzzy spherically truncated three-dimensional (3D) multi-linear descriptors for proteins. These indices codify geometric structural information from kth spherically truncated spatial-(dis)similarity two-tuple and three-tuple tensors. The coefficients of these truncated...

Descripción completa

Detalles Bibliográficos
Autores principales: Contreras-Torres, Ernesto, Marrero-Ponce, Yovani, Terán, Julio E., Agüero-Chapin, Guillermin, Antunes, Agostinho, García-Jacas, César R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9585278/
https://www.ncbi.nlm.nih.gov/pubmed/36277354
http://dx.doi.org/10.3389/fchem.2022.959143
_version_ 1784813457300783104
author Contreras-Torres, Ernesto
Marrero-Ponce, Yovani
Terán, Julio E.
Agüero-Chapin, Guillermin
Antunes, Agostinho
García-Jacas, César R.
author_facet Contreras-Torres, Ernesto
Marrero-Ponce, Yovani
Terán, Julio E.
Agüero-Chapin, Guillermin
Antunes, Agostinho
García-Jacas, César R.
author_sort Contreras-Torres, Ernesto
collection PubMed
description This study introduces a set of fuzzy spherically truncated three-dimensional (3D) multi-linear descriptors for proteins. These indices codify geometric structural information from kth spherically truncated spatial-(dis)similarity two-tuple and three-tuple tensors. The coefficients of these truncated tensors are calculated by applying a smoothing value to the 3D structural encoding based on the relationships between two and three amino acids of a protein embedded into a sphere. At considering, the geometrical center of the protein matches with center of the sphere, the distance between each amino acid involved in any specific interaction and the geometrical center of the protein can be computed. Then, the fuzzy membership degree of each amino acid from an spherical region of interest is computed by fuzzy membership functions (FMFs). The truncation value is finally a combination of the membership degrees from interacting amino acids, by applying the arithmetic mean as fusion rule. Several fuzzy membership functions with diverse biases on the calculation of amino acids memberships (e.g., Z-shaped (close to the center), PI-shaped (middle region), and A-Gaussian (far from the center)) were considered as well as traditional truncation functions (e.g., Switching). Such truncation functions were comparatively evaluated by exploring: 1) the frequency of membership degrees, 2) the variability and orthogonality analyses among them based on the Shannon Entropy’s and Principal Component’s methods, respectively, and 3) the prediction performance of alignment-free prediction of protein folding rates and structural classes. These analyses unraveled the singularity of the proposed fuzzy spherically truncated MDs with respect to the classical (non-truncated) ones and respect to the MDs truncated with traditional functions. They also showed an improved prediction power by attaining an external correlation coefficient of 95.82% in the folding rate modelling and an accuracy of 100% in distinguishing structural protein classes. These outcomes are better than the ones attained by existing approaches, justifying the theoretical contribution of this report. Thus, the fuzzy spherically truncated-based protein descriptors from MuLiMs-MCoMPAs (http://tomocomd.com/mulims-mcompas) are promising alignment-free predictors for modeling protein functions and properties.
format Online
Article
Text
id pubmed-9585278
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-95852782022-10-22 Fuzzy spherical truncation-based multi-linear protein descriptors: From their definition to application in structural-related predictions Contreras-Torres, Ernesto Marrero-Ponce, Yovani Terán, Julio E. Agüero-Chapin, Guillermin Antunes, Agostinho García-Jacas, César R. Front Chem Chemistry This study introduces a set of fuzzy spherically truncated three-dimensional (3D) multi-linear descriptors for proteins. These indices codify geometric structural information from kth spherically truncated spatial-(dis)similarity two-tuple and three-tuple tensors. The coefficients of these truncated tensors are calculated by applying a smoothing value to the 3D structural encoding based on the relationships between two and three amino acids of a protein embedded into a sphere. At considering, the geometrical center of the protein matches with center of the sphere, the distance between each amino acid involved in any specific interaction and the geometrical center of the protein can be computed. Then, the fuzzy membership degree of each amino acid from an spherical region of interest is computed by fuzzy membership functions (FMFs). The truncation value is finally a combination of the membership degrees from interacting amino acids, by applying the arithmetic mean as fusion rule. Several fuzzy membership functions with diverse biases on the calculation of amino acids memberships (e.g., Z-shaped (close to the center), PI-shaped (middle region), and A-Gaussian (far from the center)) were considered as well as traditional truncation functions (e.g., Switching). Such truncation functions were comparatively evaluated by exploring: 1) the frequency of membership degrees, 2) the variability and orthogonality analyses among them based on the Shannon Entropy’s and Principal Component’s methods, respectively, and 3) the prediction performance of alignment-free prediction of protein folding rates and structural classes. These analyses unraveled the singularity of the proposed fuzzy spherically truncated MDs with respect to the classical (non-truncated) ones and respect to the MDs truncated with traditional functions. They also showed an improved prediction power by attaining an external correlation coefficient of 95.82% in the folding rate modelling and an accuracy of 100% in distinguishing structural protein classes. These outcomes are better than the ones attained by existing approaches, justifying the theoretical contribution of this report. Thus, the fuzzy spherically truncated-based protein descriptors from MuLiMs-MCoMPAs (http://tomocomd.com/mulims-mcompas) are promising alignment-free predictors for modeling protein functions and properties. Frontiers Media S.A. 2022-10-07 /pmc/articles/PMC9585278/ /pubmed/36277354 http://dx.doi.org/10.3389/fchem.2022.959143 Text en Copyright © 2022 Contreras-Torres, Marrero-Ponce, Terán, Agüero-Chapin, Antunes and García-Jacas. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Chemistry
Contreras-Torres, Ernesto
Marrero-Ponce, Yovani
Terán, Julio E.
Agüero-Chapin, Guillermin
Antunes, Agostinho
García-Jacas, César R.
Fuzzy spherical truncation-based multi-linear protein descriptors: From their definition to application in structural-related predictions
title Fuzzy spherical truncation-based multi-linear protein descriptors: From their definition to application in structural-related predictions
title_full Fuzzy spherical truncation-based multi-linear protein descriptors: From their definition to application in structural-related predictions
title_fullStr Fuzzy spherical truncation-based multi-linear protein descriptors: From their definition to application in structural-related predictions
title_full_unstemmed Fuzzy spherical truncation-based multi-linear protein descriptors: From their definition to application in structural-related predictions
title_short Fuzzy spherical truncation-based multi-linear protein descriptors: From their definition to application in structural-related predictions
title_sort fuzzy spherical truncation-based multi-linear protein descriptors: from their definition to application in structural-related predictions
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9585278/
https://www.ncbi.nlm.nih.gov/pubmed/36277354
http://dx.doi.org/10.3389/fchem.2022.959143
work_keys_str_mv AT contrerastorresernesto fuzzysphericaltruncationbasedmultilinearproteindescriptorsfromtheirdefinitiontoapplicationinstructuralrelatedpredictions
AT marreroponceyovani fuzzysphericaltruncationbasedmultilinearproteindescriptorsfromtheirdefinitiontoapplicationinstructuralrelatedpredictions
AT teranjulioe fuzzysphericaltruncationbasedmultilinearproteindescriptorsfromtheirdefinitiontoapplicationinstructuralrelatedpredictions
AT aguerochapinguillermin fuzzysphericaltruncationbasedmultilinearproteindescriptorsfromtheirdefinitiontoapplicationinstructuralrelatedpredictions
AT antunesagostinho fuzzysphericaltruncationbasedmultilinearproteindescriptorsfromtheirdefinitiontoapplicationinstructuralrelatedpredictions
AT garciajacascesarr fuzzysphericaltruncationbasedmultilinearproteindescriptorsfromtheirdefinitiontoapplicationinstructuralrelatedpredictions