Cargando…
Amplitude spectrum distance: measuring the global shape divergence of protein fragments
BACKGROUND: In structural bioinformatics, there is an increasing interest in identifying and understanding the evolution of local protein structures regarded as key structural or functional protein building blocks. A central need is then to compare these, possibly short, fragments by measuring effic...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4535829/ https://www.ncbi.nlm.nih.gov/pubmed/26268224 http://dx.doi.org/10.1186/s12859-015-0693-y |
_version_ | 1782385654342090752 |
---|---|
author | Galiez, Clovis Coste, François |
author_facet | Galiez, Clovis Coste, François |
author_sort | Galiez, Clovis |
collection | PubMed |
description | BACKGROUND: In structural bioinformatics, there is an increasing interest in identifying and understanding the evolution of local protein structures regarded as key structural or functional protein building blocks. A central need is then to compare these, possibly short, fragments by measuring efficiently and accurately their (dis)similarity. Progress towards this goal has given rise to scores enabling to assess the strong similarity of fragments. Yet, there is still a lack of more progressive scores, with meaningful intermediate values, for the comparison, retrieval or clustering of distantly related fragments. RESULTS: We introduce here the Amplitude Spectrum Distance (ASD), a novel way of comparing protein fragments based on the discrete Fourier transform of their C (α) distance matrix. Defined as the distance between their amplitude spectra, ASD can be computed efficiently and provides a parameter-free measure of the global shape dissimilarity of two fragments. ASD inherits from nice theoretical properties, making it tolerant to shifts, insertions, deletions, circular permutations or sequence reversals while satisfying the triangle inequality. The practical interest of ASD with respect to RMSD, RMSD(d), BC and TM scores is illustrated through zinc finger retrieval experiments and concrete structure examples. The benefits of ASD are also illustrated by two additional clustering experiments: domain linkers fragments and complementarity-determining regions of antibodies. CONCLUSIONS: Taking advantage of the Fourier transform to compare fragments at a global shape level, ASD is an objective and progressive measure taking into account the whole fragments. Its practical computation time and its properties make ASD particularly relevant for applications requiring meaningful measures on distantly related protein fragments, such as similar fragments retrieval asking for high recalls as shown in the experiments, or for any application taking also advantage of triangle inequality, such as fragments clustering. ASD program and source code are freely available at: http://www.irisa.fr/dyliss/public/ASD/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0693-y) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4535829 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-45358292015-08-14 Amplitude spectrum distance: measuring the global shape divergence of protein fragments Galiez, Clovis Coste, François BMC Bioinformatics Research Article BACKGROUND: In structural bioinformatics, there is an increasing interest in identifying and understanding the evolution of local protein structures regarded as key structural or functional protein building blocks. A central need is then to compare these, possibly short, fragments by measuring efficiently and accurately their (dis)similarity. Progress towards this goal has given rise to scores enabling to assess the strong similarity of fragments. Yet, there is still a lack of more progressive scores, with meaningful intermediate values, for the comparison, retrieval or clustering of distantly related fragments. RESULTS: We introduce here the Amplitude Spectrum Distance (ASD), a novel way of comparing protein fragments based on the discrete Fourier transform of their C (α) distance matrix. Defined as the distance between their amplitude spectra, ASD can be computed efficiently and provides a parameter-free measure of the global shape dissimilarity of two fragments. ASD inherits from nice theoretical properties, making it tolerant to shifts, insertions, deletions, circular permutations or sequence reversals while satisfying the triangle inequality. The practical interest of ASD with respect to RMSD, RMSD(d), BC and TM scores is illustrated through zinc finger retrieval experiments and concrete structure examples. The benefits of ASD are also illustrated by two additional clustering experiments: domain linkers fragments and complementarity-determining regions of antibodies. CONCLUSIONS: Taking advantage of the Fourier transform to compare fragments at a global shape level, ASD is an objective and progressive measure taking into account the whole fragments. Its practical computation time and its properties make ASD particularly relevant for applications requiring meaningful measures on distantly related protein fragments, such as similar fragments retrieval asking for high recalls as shown in the experiments, or for any application taking also advantage of triangle inequality, such as fragments clustering. ASD program and source code are freely available at: http://www.irisa.fr/dyliss/public/ASD/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0693-y) contains supplementary material, which is available to authorized users. BioMed Central 2015-08-14 /pmc/articles/PMC4535829/ /pubmed/26268224 http://dx.doi.org/10.1186/s12859-015-0693-y Text en © Galiez and Coste. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Galiez, Clovis Coste, François Amplitude spectrum distance: measuring the global shape divergence of protein fragments |
title | Amplitude spectrum distance: measuring the global shape divergence of protein fragments |
title_full | Amplitude spectrum distance: measuring the global shape divergence of protein fragments |
title_fullStr | Amplitude spectrum distance: measuring the global shape divergence of protein fragments |
title_full_unstemmed | Amplitude spectrum distance: measuring the global shape divergence of protein fragments |
title_short | Amplitude spectrum distance: measuring the global shape divergence of protein fragments |
title_sort | amplitude spectrum distance: measuring the global shape divergence of protein fragments |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4535829/ https://www.ncbi.nlm.nih.gov/pubmed/26268224 http://dx.doi.org/10.1186/s12859-015-0693-y |
work_keys_str_mv | AT galiezclovis amplitudespectrumdistancemeasuringtheglobalshapedivergenceofproteinfragments AT costefrancois amplitudespectrumdistancemeasuringtheglobalshapedivergenceofproteinfragments |