Cargando…

AliGROOVE – visualization of heterogeneous sequence divergence within multiple sequence alignments and detection of inflated branch support

BACKGROUND: Masking of multiple sequence alignment blocks has become a powerful method to enhance the tree-likeness of the underlying data. However, existing masking approaches are insensitive to heterogeneous sequence divergence which can mislead tree reconstructions. We present AliGROOVE, a new me...

Descripción completa

Detalles Bibliográficos
Autores principales: Kück, Patrick, Meid, Sandra A, Groß, Christian, Wägele, Johann W, Misof, Bernhard
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4167143/
https://www.ncbi.nlm.nih.gov/pubmed/25176556
http://dx.doi.org/10.1186/1471-2105-15-294
_version_ 1782335374384693248
author Kück, Patrick
Meid, Sandra A
Groß, Christian
Wägele, Johann W
Misof, Bernhard
author_facet Kück, Patrick
Meid, Sandra A
Groß, Christian
Wägele, Johann W
Misof, Bernhard
author_sort Kück, Patrick
collection PubMed
description BACKGROUND: Masking of multiple sequence alignment blocks has become a powerful method to enhance the tree-likeness of the underlying data. However, existing masking approaches are insensitive to heterogeneous sequence divergence which can mislead tree reconstructions. We present AliGROOVE, a new method based on a sliding window and a Monte Carlo resampling approach, that visualizes heterogeneous sequence divergence or alignment ambiguity related to single taxa or subsets of taxa within a multiple sequence alignment and tags suspicious branches on a given tree. RESULTS: We used simulated multiple sequence alignments to show that the extent of alignment ambiguity in pairwise sequence comparison is correlated with the frequency of misplaced taxa in tree reconstructions. The approach implemented in AliGROOVE allows to detect nodes within a tree that are supported despite the absence of phylogenetic signal in the underlying multiple sequence alignment. We show that AliGROOVE equally well detects heterogeneous sequence divergence in a case study based on an empirical data set of mitochondrial DNA sequences of chelicerates. CONCLUSIONS: The AliGROOVE approach has the potential to identify single taxa or subsets of taxa which show predominantly randomized sequence similarity in comparison with other taxa in a multiple sequence alignment. It further allows to evaluate the reliability of node support in a novel way. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2105-15-294) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4167143
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41671432014-09-19 AliGROOVE – visualization of heterogeneous sequence divergence within multiple sequence alignments and detection of inflated branch support Kück, Patrick Meid, Sandra A Groß, Christian Wägele, Johann W Misof, Bernhard BMC Bioinformatics Methodology Article BACKGROUND: Masking of multiple sequence alignment blocks has become a powerful method to enhance the tree-likeness of the underlying data. However, existing masking approaches are insensitive to heterogeneous sequence divergence which can mislead tree reconstructions. We present AliGROOVE, a new method based on a sliding window and a Monte Carlo resampling approach, that visualizes heterogeneous sequence divergence or alignment ambiguity related to single taxa or subsets of taxa within a multiple sequence alignment and tags suspicious branches on a given tree. RESULTS: We used simulated multiple sequence alignments to show that the extent of alignment ambiguity in pairwise sequence comparison is correlated with the frequency of misplaced taxa in tree reconstructions. The approach implemented in AliGROOVE allows to detect nodes within a tree that are supported despite the absence of phylogenetic signal in the underlying multiple sequence alignment. We show that AliGROOVE equally well detects heterogeneous sequence divergence in a case study based on an empirical data set of mitochondrial DNA sequences of chelicerates. CONCLUSIONS: The AliGROOVE approach has the potential to identify single taxa or subsets of taxa which show predominantly randomized sequence similarity in comparison with other taxa in a multiple sequence alignment. It further allows to evaluate the reliability of node support in a novel way. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2105-15-294) contains supplementary material, which is available to authorized users. BioMed Central 2014-08-30 /pmc/articles/PMC4167143/ /pubmed/25176556 http://dx.doi.org/10.1186/1471-2105-15-294 Text en © Kück et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Kück, Patrick
Meid, Sandra A
Groß, Christian
Wägele, Johann W
Misof, Bernhard
AliGROOVE – visualization of heterogeneous sequence divergence within multiple sequence alignments and detection of inflated branch support
title AliGROOVE – visualization of heterogeneous sequence divergence within multiple sequence alignments and detection of inflated branch support
title_full AliGROOVE – visualization of heterogeneous sequence divergence within multiple sequence alignments and detection of inflated branch support
title_fullStr AliGROOVE – visualization of heterogeneous sequence divergence within multiple sequence alignments and detection of inflated branch support
title_full_unstemmed AliGROOVE – visualization of heterogeneous sequence divergence within multiple sequence alignments and detection of inflated branch support
title_short AliGROOVE – visualization of heterogeneous sequence divergence within multiple sequence alignments and detection of inflated branch support
title_sort aligroove – visualization of heterogeneous sequence divergence within multiple sequence alignments and detection of inflated branch support
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4167143/
https://www.ncbi.nlm.nih.gov/pubmed/25176556
http://dx.doi.org/10.1186/1471-2105-15-294
work_keys_str_mv AT kuckpatrick aligroovevisualizationofheterogeneoussequencedivergencewithinmultiplesequencealignmentsanddetectionofinflatedbranchsupport
AT meidsandraa aligroovevisualizationofheterogeneoussequencedivergencewithinmultiplesequencealignmentsanddetectionofinflatedbranchsupport
AT großchristian aligroovevisualizationofheterogeneoussequencedivergencewithinmultiplesequencealignmentsanddetectionofinflatedbranchsupport
AT wagelejohannw aligroovevisualizationofheterogeneoussequencedivergencewithinmultiplesequencealignmentsanddetectionofinflatedbranchsupport
AT misofbernhard aligroovevisualizationofheterogeneoussequencedivergencewithinmultiplesequencealignmentsanddetectionofinflatedbranchsupport