Cargando…

Island method for estimating the statistical significance of profile-profile alignment scores

BACKGROUND: In the last decade, a significant improvement in detecting remote similarity between protein sequences has been made by utilizing alignment profiles in place of amino-acid strings. Unfortunately, no analytical theory is available for estimating the significance of a gapped alignment of t...

Descripción completa

Detalles Bibliográficos
Autor principal: Poleksic, Aleksandar
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2678096/
https://www.ncbi.nlm.nih.gov/pubmed/19379500
http://dx.doi.org/10.1186/1471-2105-10-112
_version_ 1782166816685031424
author Poleksic, Aleksandar
author_facet Poleksic, Aleksandar
author_sort Poleksic, Aleksandar
collection PubMed
description BACKGROUND: In the last decade, a significant improvement in detecting remote similarity between protein sequences has been made by utilizing alignment profiles in place of amino-acid strings. Unfortunately, no analytical theory is available for estimating the significance of a gapped alignment of two profiles. Many experiments suggest that the distribution of local profile-profile alignment scores is of the Gumbel form. However, estimating distribution parameters by random simulations turns out to be computationally very expensive. RESULTS: We demonstrate that the background distribution of profile-profile alignment scores heavily depends on profiles' composition and thus the distribution parameters must be estimated independently, for each pair of profiles of interest. We also show that accurate estimates of statistical parameters can be obtained using the "island statistics" for profile-profile alignments. CONCLUSION: The island statistics can be generalized to profile-profile alignments to provide an efficient method for the alignment score normalization. Since multiple island scores can be extracted from a single comparison of two profiles, the island method has a clear speed advantage over the direct shuffling method for comparable accuracy in parameter estimates.
format Text
id pubmed-2678096
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26780962009-05-07 Island method for estimating the statistical significance of profile-profile alignment scores Poleksic, Aleksandar BMC Bioinformatics Methodology Article BACKGROUND: In the last decade, a significant improvement in detecting remote similarity between protein sequences has been made by utilizing alignment profiles in place of amino-acid strings. Unfortunately, no analytical theory is available for estimating the significance of a gapped alignment of two profiles. Many experiments suggest that the distribution of local profile-profile alignment scores is of the Gumbel form. However, estimating distribution parameters by random simulations turns out to be computationally very expensive. RESULTS: We demonstrate that the background distribution of profile-profile alignment scores heavily depends on profiles' composition and thus the distribution parameters must be estimated independently, for each pair of profiles of interest. We also show that accurate estimates of statistical parameters can be obtained using the "island statistics" for profile-profile alignments. CONCLUSION: The island statistics can be generalized to profile-profile alignments to provide an efficient method for the alignment score normalization. Since multiple island scores can be extracted from a single comparison of two profiles, the island method has a clear speed advantage over the direct shuffling method for comparable accuracy in parameter estimates. BioMed Central 2009-04-20 /pmc/articles/PMC2678096/ /pubmed/19379500 http://dx.doi.org/10.1186/1471-2105-10-112 Text en Copyright © 2009 Poleksic; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Poleksic, Aleksandar
Island method for estimating the statistical significance of profile-profile alignment scores
title Island method for estimating the statistical significance of profile-profile alignment scores
title_full Island method for estimating the statistical significance of profile-profile alignment scores
title_fullStr Island method for estimating the statistical significance of profile-profile alignment scores
title_full_unstemmed Island method for estimating the statistical significance of profile-profile alignment scores
title_short Island method for estimating the statistical significance of profile-profile alignment scores
title_sort island method for estimating the statistical significance of profile-profile alignment scores
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2678096/
https://www.ncbi.nlm.nih.gov/pubmed/19379500
http://dx.doi.org/10.1186/1471-2105-10-112
work_keys_str_mv AT poleksicaleksandar islandmethodforestimatingthestatisticalsignificanceofprofileprofilealignmentscores