Cargando…

Amplitude spectrum distance: measuring the global shape divergence of protein fragments

BACKGROUND: In structural bioinformatics, there is an increasing interest in identifying and understanding the evolution of local protein structures regarded as key structural or functional protein building blocks. A central need is then to compare these, possibly short, fragments by measuring effic...

Descripción completa

Detalles Bibliográficos
Autores principales: Galiez, Clovis, Coste, François
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4535829/
https://www.ncbi.nlm.nih.gov/pubmed/26268224
http://dx.doi.org/10.1186/s12859-015-0693-y
_version_ 1782385654342090752
author Galiez, Clovis
Coste, François
author_facet Galiez, Clovis
Coste, François
author_sort Galiez, Clovis
collection PubMed
description BACKGROUND: In structural bioinformatics, there is an increasing interest in identifying and understanding the evolution of local protein structures regarded as key structural or functional protein building blocks. A central need is then to compare these, possibly short, fragments by measuring efficiently and accurately their (dis)similarity. Progress towards this goal has given rise to scores enabling to assess the strong similarity of fragments. Yet, there is still a lack of more progressive scores, with meaningful intermediate values, for the comparison, retrieval or clustering of distantly related fragments. RESULTS: We introduce here the Amplitude Spectrum Distance (ASD), a novel way of comparing protein fragments based on the discrete Fourier transform of their C (α) distance matrix. Defined as the distance between their amplitude spectra, ASD can be computed efficiently and provides a parameter-free measure of the global shape dissimilarity of two fragments. ASD inherits from nice theoretical properties, making it tolerant to shifts, insertions, deletions, circular permutations or sequence reversals while satisfying the triangle inequality. The practical interest of ASD with respect to RMSD, RMSD(d), BC and TM scores is illustrated through zinc finger retrieval experiments and concrete structure examples. The benefits of ASD are also illustrated by two additional clustering experiments: domain linkers fragments and complementarity-determining regions of antibodies. CONCLUSIONS: Taking advantage of the Fourier transform to compare fragments at a global shape level, ASD is an objective and progressive measure taking into account the whole fragments. Its practical computation time and its properties make ASD particularly relevant for applications requiring meaningful measures on distantly related protein fragments, such as similar fragments retrieval asking for high recalls as shown in the experiments, or for any application taking also advantage of triangle inequality, such as fragments clustering. ASD program and source code are freely available at: http://www.irisa.fr/dyliss/public/ASD/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0693-y) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4535829
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45358292015-08-14 Amplitude spectrum distance: measuring the global shape divergence of protein fragments Galiez, Clovis Coste, François BMC Bioinformatics Research Article BACKGROUND: In structural bioinformatics, there is an increasing interest in identifying and understanding the evolution of local protein structures regarded as key structural or functional protein building blocks. A central need is then to compare these, possibly short, fragments by measuring efficiently and accurately their (dis)similarity. Progress towards this goal has given rise to scores enabling to assess the strong similarity of fragments. Yet, there is still a lack of more progressive scores, with meaningful intermediate values, for the comparison, retrieval or clustering of distantly related fragments. RESULTS: We introduce here the Amplitude Spectrum Distance (ASD), a novel way of comparing protein fragments based on the discrete Fourier transform of their C (α) distance matrix. Defined as the distance between their amplitude spectra, ASD can be computed efficiently and provides a parameter-free measure of the global shape dissimilarity of two fragments. ASD inherits from nice theoretical properties, making it tolerant to shifts, insertions, deletions, circular permutations or sequence reversals while satisfying the triangle inequality. The practical interest of ASD with respect to RMSD, RMSD(d), BC and TM scores is illustrated through zinc finger retrieval experiments and concrete structure examples. The benefits of ASD are also illustrated by two additional clustering experiments: domain linkers fragments and complementarity-determining regions of antibodies. CONCLUSIONS: Taking advantage of the Fourier transform to compare fragments at a global shape level, ASD is an objective and progressive measure taking into account the whole fragments. Its practical computation time and its properties make ASD particularly relevant for applications requiring meaningful measures on distantly related protein fragments, such as similar fragments retrieval asking for high recalls as shown in the experiments, or for any application taking also advantage of triangle inequality, such as fragments clustering. ASD program and source code are freely available at: http://www.irisa.fr/dyliss/public/ASD/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0693-y) contains supplementary material, which is available to authorized users. BioMed Central 2015-08-14 /pmc/articles/PMC4535829/ /pubmed/26268224 http://dx.doi.org/10.1186/s12859-015-0693-y Text en © Galiez and Coste. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Galiez, Clovis
Coste, François
Amplitude spectrum distance: measuring the global shape divergence of protein fragments
title Amplitude spectrum distance: measuring the global shape divergence of protein fragments
title_full Amplitude spectrum distance: measuring the global shape divergence of protein fragments
title_fullStr Amplitude spectrum distance: measuring the global shape divergence of protein fragments
title_full_unstemmed Amplitude spectrum distance: measuring the global shape divergence of protein fragments
title_short Amplitude spectrum distance: measuring the global shape divergence of protein fragments
title_sort amplitude spectrum distance: measuring the global shape divergence of protein fragments
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4535829/
https://www.ncbi.nlm.nih.gov/pubmed/26268224
http://dx.doi.org/10.1186/s12859-015-0693-y
work_keys_str_mv AT galiezclovis amplitudespectrumdistancemeasuringtheglobalshapedivergenceofproteinfragments
AT costefrancois amplitudespectrumdistancemeasuringtheglobalshapedivergenceofproteinfragments