Cargando…

Structural alignment of protein descriptors – a combinatorial model

BACKGROUND: Structural alignment of proteins is one of the most challenging problems in molecular biology. The tertiary structure of a protein strictly correlates with its function and computationally predicted structures are nowadays a main premise for understanding the latter. However, computation...

Descripción completa

Detalles Bibliográficos
Autores principales: Antczak, Maciej, Kasprzak, Marta, Lukasiak, Piotr, Blazewicz, Jacek
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5027075/
https://www.ncbi.nlm.nih.gov/pubmed/27639380
http://dx.doi.org/10.1186/s12859-016-1237-9
_version_ 1782454179712729088
author Antczak, Maciej
Kasprzak, Marta
Lukasiak, Piotr
Blazewicz, Jacek
author_facet Antczak, Maciej
Kasprzak, Marta
Lukasiak, Piotr
Blazewicz, Jacek
author_sort Antczak, Maciej
collection PubMed
description BACKGROUND: Structural alignment of proteins is one of the most challenging problems in molecular biology. The tertiary structure of a protein strictly correlates with its function and computationally predicted structures are nowadays a main premise for understanding the latter. However, computationally derived 3D models often exhibit deviations from the native structure. A way to confirm a model is a comparison with other structures. The structural alignment of a pair of proteins can be defined with the use of a concept of protein descriptors. The protein descriptors are local substructures of protein molecules, which allow us to divide the original problem into a set of subproblems and, consequently, to propose a more efficient algorithmic solution. In the literature, one can find many applications of the descriptors concept that prove its usefulness for insight into protein 3D structures, but the proposed approaches are presented rather from the biological perspective than from the computational or algorithmic point of view. Efficient algorithms for identification and structural comparison of descriptors can become crucial components of methods for structural quality assessment as well as tertiary structure prediction. RESULTS: In this paper, we propose a new combinatorial model and new polynomial-time algorithms for the structural alignment of descriptors. The model is based on the maximum-size assignment problem, which we define here and prove that it can be solved in polynomial time. We demonstrate suitability of this approach by comparison with an exact backtracking algorithm. Besides a simplification coming from the combinatorial modeling, both on the conceptual and complexity level, we gain with this approach high quality of obtained results, in terms of 3D alignment accuracy and processing efficiency. CONCLUSIONS: All the proposed algorithms were developed and integrated in a computationally efficient tool descs-standalone, which allows the user to identify and structurally compare descriptors of biological molecules, such as proteins and RNAs. Both PDB (Protein Data Bank) and mmCIF (macromolecular Crystallographic Information File) formats are supported. The proposed tool is available as an open source project stored on GitHub (https://github.com/mantczak/descs-standalone). ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1237-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5027075
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-50270752016-09-22 Structural alignment of protein descriptors – a combinatorial model Antczak, Maciej Kasprzak, Marta Lukasiak, Piotr Blazewicz, Jacek BMC Bioinformatics Methodology Article BACKGROUND: Structural alignment of proteins is one of the most challenging problems in molecular biology. The tertiary structure of a protein strictly correlates with its function and computationally predicted structures are nowadays a main premise for understanding the latter. However, computationally derived 3D models often exhibit deviations from the native structure. A way to confirm a model is a comparison with other structures. The structural alignment of a pair of proteins can be defined with the use of a concept of protein descriptors. The protein descriptors are local substructures of protein molecules, which allow us to divide the original problem into a set of subproblems and, consequently, to propose a more efficient algorithmic solution. In the literature, one can find many applications of the descriptors concept that prove its usefulness for insight into protein 3D structures, but the proposed approaches are presented rather from the biological perspective than from the computational or algorithmic point of view. Efficient algorithms for identification and structural comparison of descriptors can become crucial components of methods for structural quality assessment as well as tertiary structure prediction. RESULTS: In this paper, we propose a new combinatorial model and new polynomial-time algorithms for the structural alignment of descriptors. The model is based on the maximum-size assignment problem, which we define here and prove that it can be solved in polynomial time. We demonstrate suitability of this approach by comparison with an exact backtracking algorithm. Besides a simplification coming from the combinatorial modeling, both on the conceptual and complexity level, we gain with this approach high quality of obtained results, in terms of 3D alignment accuracy and processing efficiency. CONCLUSIONS: All the proposed algorithms were developed and integrated in a computationally efficient tool descs-standalone, which allows the user to identify and structurally compare descriptors of biological molecules, such as proteins and RNAs. Both PDB (Protein Data Bank) and mmCIF (macromolecular Crystallographic Information File) formats are supported. The proposed tool is available as an open source project stored on GitHub (https://github.com/mantczak/descs-standalone). ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1237-9) contains supplementary material, which is available to authorized users. BioMed Central 2016-09-17 /pmc/articles/PMC5027075/ /pubmed/27639380 http://dx.doi.org/10.1186/s12859-016-1237-9 Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Antczak, Maciej
Kasprzak, Marta
Lukasiak, Piotr
Blazewicz, Jacek
Structural alignment of protein descriptors – a combinatorial model
title Structural alignment of protein descriptors – a combinatorial model
title_full Structural alignment of protein descriptors – a combinatorial model
title_fullStr Structural alignment of protein descriptors – a combinatorial model
title_full_unstemmed Structural alignment of protein descriptors – a combinatorial model
title_short Structural alignment of protein descriptors – a combinatorial model
title_sort structural alignment of protein descriptors – a combinatorial model
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5027075/
https://www.ncbi.nlm.nih.gov/pubmed/27639380
http://dx.doi.org/10.1186/s12859-016-1237-9
work_keys_str_mv AT antczakmaciej structuralalignmentofproteindescriptorsacombinatorialmodel
AT kasprzakmarta structuralalignmentofproteindescriptorsacombinatorialmodel
AT lukasiakpiotr structuralalignmentofproteindescriptorsacombinatorialmodel
AT blazewiczjacek structuralalignmentofproteindescriptorsacombinatorialmodel