Cargando…

Tensor Algebra-based Geometrical (3D) Biomacro-Molecular Descriptors for Protein Research: Theory, Applications and Comparison with other Methods

In this report, a new type of tridimensional (3D) biomacro-molecular descriptors for proteins are proposed. These descriptors make use of multi-linear algebra concepts based on the application of 3-linear forms (i.e., Canonical Trilinear (Tr), Trilinear Cubic (TrC), Trilinear-Quadratic-Bilinear (TrQ...

Descripción completa

Detalles Bibliográficos
Autores principales: Terán, Julio E., Marrero-Ponce, Yovani, Contreras-Torres, Ernesto, García-Jacas, César R., Vivas-Reyes, Ricardo, Terán, Enrique, Torres, F. Javier
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6684663/
https://www.ncbi.nlm.nih.gov/pubmed/31388082
http://dx.doi.org/10.1038/s41598-019-47858-2
_version_ 1783442292514750464
author Terán, Julio E.
Marrero-Ponce, Yovani
Contreras-Torres, Ernesto
García-Jacas, César R.
Vivas-Reyes, Ricardo
Terán, Enrique
Torres, F. Javier
author_facet Terán, Julio E.
Marrero-Ponce, Yovani
Contreras-Torres, Ernesto
García-Jacas, César R.
Vivas-Reyes, Ricardo
Terán, Enrique
Torres, F. Javier
author_sort Terán, Julio E.
collection PubMed
description In this report, a new type of tridimensional (3D) biomacro-molecular descriptors for proteins are proposed. These descriptors make use of multi-linear algebra concepts based on the application of 3-linear forms (i.e., Canonical Trilinear (Tr), Trilinear Cubic (TrC), Trilinear-Quadratic-Bilinear (TrQB) and so on) as a specific case of the N-linear algebraic forms. The definition of the k(th) 3-tuple similarity-dissimilarity spatial matrices (Tensor’s Form) are used for the transformation and for the representation of the existing chemical information available in the relationships between three amino acids of a protein. Several metrics (Minkowski-type, wave-edge, etc) and multi-metrics (Triangle area, Bond-angle, etc) are proposed for the interaction information extraction, as well as probabilistic transformations (e.g., simple stochastic and mutual probability) to achieve matrix normalization. A generalized procedure considering amino acid level-based indices that can be fused together by using aggregator operators for descriptors calculations is proposed. The obtained results demonstrated that the new proposed 3D biomacro-molecular indices perform better than other approaches in the SCOP-based discrimination and the prediction of folding rate of proteins by using simple linear parametrical models. It can be concluded that the proposed method allows the definition of 3D biomacro-molecular descriptors that contain orthogonal information capable of providing better models for applications in protein science.
format Online
Article
Text
id pubmed-6684663
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-66846632019-08-11 Tensor Algebra-based Geometrical (3D) Biomacro-Molecular Descriptors for Protein Research: Theory, Applications and Comparison with other Methods Terán, Julio E. Marrero-Ponce, Yovani Contreras-Torres, Ernesto García-Jacas, César R. Vivas-Reyes, Ricardo Terán, Enrique Torres, F. Javier Sci Rep Article In this report, a new type of tridimensional (3D) biomacro-molecular descriptors for proteins are proposed. These descriptors make use of multi-linear algebra concepts based on the application of 3-linear forms (i.e., Canonical Trilinear (Tr), Trilinear Cubic (TrC), Trilinear-Quadratic-Bilinear (TrQB) and so on) as a specific case of the N-linear algebraic forms. The definition of the k(th) 3-tuple similarity-dissimilarity spatial matrices (Tensor’s Form) are used for the transformation and for the representation of the existing chemical information available in the relationships between three amino acids of a protein. Several metrics (Minkowski-type, wave-edge, etc) and multi-metrics (Triangle area, Bond-angle, etc) are proposed for the interaction information extraction, as well as probabilistic transformations (e.g., simple stochastic and mutual probability) to achieve matrix normalization. A generalized procedure considering amino acid level-based indices that can be fused together by using aggregator operators for descriptors calculations is proposed. The obtained results demonstrated that the new proposed 3D biomacro-molecular indices perform better than other approaches in the SCOP-based discrimination and the prediction of folding rate of proteins by using simple linear parametrical models. It can be concluded that the proposed method allows the definition of 3D biomacro-molecular descriptors that contain orthogonal information capable of providing better models for applications in protein science. Nature Publishing Group UK 2019-08-06 /pmc/articles/PMC6684663/ /pubmed/31388082 http://dx.doi.org/10.1038/s41598-019-47858-2 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Terán, Julio E.
Marrero-Ponce, Yovani
Contreras-Torres, Ernesto
García-Jacas, César R.
Vivas-Reyes, Ricardo
Terán, Enrique
Torres, F. Javier
Tensor Algebra-based Geometrical (3D) Biomacro-Molecular Descriptors for Protein Research: Theory, Applications and Comparison with other Methods
title Tensor Algebra-based Geometrical (3D) Biomacro-Molecular Descriptors for Protein Research: Theory, Applications and Comparison with other Methods
title_full Tensor Algebra-based Geometrical (3D) Biomacro-Molecular Descriptors for Protein Research: Theory, Applications and Comparison with other Methods
title_fullStr Tensor Algebra-based Geometrical (3D) Biomacro-Molecular Descriptors for Protein Research: Theory, Applications and Comparison with other Methods
title_full_unstemmed Tensor Algebra-based Geometrical (3D) Biomacro-Molecular Descriptors for Protein Research: Theory, Applications and Comparison with other Methods
title_short Tensor Algebra-based Geometrical (3D) Biomacro-Molecular Descriptors for Protein Research: Theory, Applications and Comparison with other Methods
title_sort tensor algebra-based geometrical (3d) biomacro-molecular descriptors for protein research: theory, applications and comparison with other methods
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6684663/
https://www.ncbi.nlm.nih.gov/pubmed/31388082
http://dx.doi.org/10.1038/s41598-019-47858-2
work_keys_str_mv AT teranjulioe tensoralgebrabasedgeometrical3dbiomacromoleculardescriptorsforproteinresearchtheoryapplicationsandcomparisonwithothermethods
AT marreroponceyovani tensoralgebrabasedgeometrical3dbiomacromoleculardescriptorsforproteinresearchtheoryapplicationsandcomparisonwithothermethods
AT contrerastorresernesto tensoralgebrabasedgeometrical3dbiomacromoleculardescriptorsforproteinresearchtheoryapplicationsandcomparisonwithothermethods
AT garciajacascesarr tensoralgebrabasedgeometrical3dbiomacromoleculardescriptorsforproteinresearchtheoryapplicationsandcomparisonwithothermethods
AT vivasreyesricardo tensoralgebrabasedgeometrical3dbiomacromoleculardescriptorsforproteinresearchtheoryapplicationsandcomparisonwithothermethods
AT teranenrique tensoralgebrabasedgeometrical3dbiomacromoleculardescriptorsforproteinresearchtheoryapplicationsandcomparisonwithothermethods
AT torresfjavier tensoralgebrabasedgeometrical3dbiomacromoleculardescriptorsforproteinresearchtheoryapplicationsandcomparisonwithothermethods