Cargando…
AliGROOVE – visualization of heterogeneous sequence divergence within multiple sequence alignments and detection of inflated branch support
BACKGROUND: Masking of multiple sequence alignment blocks has become a powerful method to enhance the tree-likeness of the underlying data. However, existing masking approaches are insensitive to heterogeneous sequence divergence which can mislead tree reconstructions. We present AliGROOVE, a new me...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4167143/ https://www.ncbi.nlm.nih.gov/pubmed/25176556 http://dx.doi.org/10.1186/1471-2105-15-294 |
_version_ | 1782335374384693248 |
---|---|
author | Kück, Patrick Meid, Sandra A Groß, Christian Wägele, Johann W Misof, Bernhard |
author_facet | Kück, Patrick Meid, Sandra A Groß, Christian Wägele, Johann W Misof, Bernhard |
author_sort | Kück, Patrick |
collection | PubMed |
description | BACKGROUND: Masking of multiple sequence alignment blocks has become a powerful method to enhance the tree-likeness of the underlying data. However, existing masking approaches are insensitive to heterogeneous sequence divergence which can mislead tree reconstructions. We present AliGROOVE, a new method based on a sliding window and a Monte Carlo resampling approach, that visualizes heterogeneous sequence divergence or alignment ambiguity related to single taxa or subsets of taxa within a multiple sequence alignment and tags suspicious branches on a given tree. RESULTS: We used simulated multiple sequence alignments to show that the extent of alignment ambiguity in pairwise sequence comparison is correlated with the frequency of misplaced taxa in tree reconstructions. The approach implemented in AliGROOVE allows to detect nodes within a tree that are supported despite the absence of phylogenetic signal in the underlying multiple sequence alignment. We show that AliGROOVE equally well detects heterogeneous sequence divergence in a case study based on an empirical data set of mitochondrial DNA sequences of chelicerates. CONCLUSIONS: The AliGROOVE approach has the potential to identify single taxa or subsets of taxa which show predominantly randomized sequence similarity in comparison with other taxa in a multiple sequence alignment. It further allows to evaluate the reliability of node support in a novel way. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2105-15-294) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4167143 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-41671432014-09-19 AliGROOVE – visualization of heterogeneous sequence divergence within multiple sequence alignments and detection of inflated branch support Kück, Patrick Meid, Sandra A Groß, Christian Wägele, Johann W Misof, Bernhard BMC Bioinformatics Methodology Article BACKGROUND: Masking of multiple sequence alignment blocks has become a powerful method to enhance the tree-likeness of the underlying data. However, existing masking approaches are insensitive to heterogeneous sequence divergence which can mislead tree reconstructions. We present AliGROOVE, a new method based on a sliding window and a Monte Carlo resampling approach, that visualizes heterogeneous sequence divergence or alignment ambiguity related to single taxa or subsets of taxa within a multiple sequence alignment and tags suspicious branches on a given tree. RESULTS: We used simulated multiple sequence alignments to show that the extent of alignment ambiguity in pairwise sequence comparison is correlated with the frequency of misplaced taxa in tree reconstructions. The approach implemented in AliGROOVE allows to detect nodes within a tree that are supported despite the absence of phylogenetic signal in the underlying multiple sequence alignment. We show that AliGROOVE equally well detects heterogeneous sequence divergence in a case study based on an empirical data set of mitochondrial DNA sequences of chelicerates. CONCLUSIONS: The AliGROOVE approach has the potential to identify single taxa or subsets of taxa which show predominantly randomized sequence similarity in comparison with other taxa in a multiple sequence alignment. It further allows to evaluate the reliability of node support in a novel way. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2105-15-294) contains supplementary material, which is available to authorized users. BioMed Central 2014-08-30 /pmc/articles/PMC4167143/ /pubmed/25176556 http://dx.doi.org/10.1186/1471-2105-15-294 Text en © Kück et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Kück, Patrick Meid, Sandra A Groß, Christian Wägele, Johann W Misof, Bernhard AliGROOVE – visualization of heterogeneous sequence divergence within multiple sequence alignments and detection of inflated branch support |
title | AliGROOVE – visualization of heterogeneous sequence divergence within multiple sequence alignments and detection of inflated branch support |
title_full | AliGROOVE – visualization of heterogeneous sequence divergence within multiple sequence alignments and detection of inflated branch support |
title_fullStr | AliGROOVE – visualization of heterogeneous sequence divergence within multiple sequence alignments and detection of inflated branch support |
title_full_unstemmed | AliGROOVE – visualization of heterogeneous sequence divergence within multiple sequence alignments and detection of inflated branch support |
title_short | AliGROOVE – visualization of heterogeneous sequence divergence within multiple sequence alignments and detection of inflated branch support |
title_sort | aligroove – visualization of heterogeneous sequence divergence within multiple sequence alignments and detection of inflated branch support |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4167143/ https://www.ncbi.nlm.nih.gov/pubmed/25176556 http://dx.doi.org/10.1186/1471-2105-15-294 |
work_keys_str_mv | AT kuckpatrick aligroovevisualizationofheterogeneoussequencedivergencewithinmultiplesequencealignmentsanddetectionofinflatedbranchsupport AT meidsandraa aligroovevisualizationofheterogeneoussequencedivergencewithinmultiplesequencealignmentsanddetectionofinflatedbranchsupport AT großchristian aligroovevisualizationofheterogeneoussequencedivergencewithinmultiplesequencealignmentsanddetectionofinflatedbranchsupport AT wagelejohannw aligroovevisualizationofheterogeneoussequencedivergencewithinmultiplesequencealignmentsanddetectionofinflatedbranchsupport AT misofbernhard aligroovevisualizationofheterogeneoussequencedivergencewithinmultiplesequencealignmentsanddetectionofinflatedbranchsupport |