Cargando…
Inferring phylogenies with incomplete data sets: a 5-gene, 567-taxon analysis of angiosperms
BACKGROUND: Phylogenetic analyses of angiosperm relationships have used only a small percentage of available sequence data, but phylogenetic data matrices often can be augmented with existing data, especially if one allows missing characters. We explore the effects on phylogenetic analyses of adding...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2674047/ https://www.ncbi.nlm.nih.gov/pubmed/19292928 http://dx.doi.org/10.1186/1471-2148-9-61 |
_version_ | 1782166613108195328 |
---|---|
author | Burleigh, J Gordon Hilu, Khidir W Soltis, Douglas E |
author_facet | Burleigh, J Gordon Hilu, Khidir W Soltis, Douglas E |
author_sort | Burleigh, J Gordon |
collection | PubMed |
description | BACKGROUND: Phylogenetic analyses of angiosperm relationships have used only a small percentage of available sequence data, but phylogenetic data matrices often can be augmented with existing data, especially if one allows missing characters. We explore the effects on phylogenetic analyses of adding 378 matK sequences and 240 26S rDNA sequences to the complete 3-gene, 567-taxon angiosperm phylogenetic matrix of Soltis et al. RESULTS: We performed maximum likelihood bootstrap analyses of the complete, 3-gene 567-taxon data matrix and the incomplete, 5-gene 567-taxon data matrix. Although the 5-gene matrix has more missing data (27.5%) than the 3-gene data matrix (2.9%), the 5-gene analysis resulted in higher levels of bootstrap support. Within the 567-taxon tree, the increase in support is most evident for relationships among the 170 taxa for which both matK and 26S rDNA sequences were added, and there is little gain in support for relationships among the 119 taxa having neither matK nor 26S rDNA sequences. The 5-gene analysis also places the enigmatic Hydrostachys in Lamiales (BS = 97%) rather than in Cornales (BS = 100% in 3-gene analysis). The placement of Hydrostachys in Lamiales is unprecedented in molecular analyses, but it is consistent with embryological and morphological data. CONCLUSION: Adding available, and often incomplete, sets of sequences to existing data sets can be a fast and inexpensive way to increase support for phylogenetic relationships and produce novel and credible new phylogenetic hypotheses. |
format | Text |
id | pubmed-2674047 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-26740472009-04-28 Inferring phylogenies with incomplete data sets: a 5-gene, 567-taxon analysis of angiosperms Burleigh, J Gordon Hilu, Khidir W Soltis, Douglas E BMC Evol Biol Research Article BACKGROUND: Phylogenetic analyses of angiosperm relationships have used only a small percentage of available sequence data, but phylogenetic data matrices often can be augmented with existing data, especially if one allows missing characters. We explore the effects on phylogenetic analyses of adding 378 matK sequences and 240 26S rDNA sequences to the complete 3-gene, 567-taxon angiosperm phylogenetic matrix of Soltis et al. RESULTS: We performed maximum likelihood bootstrap analyses of the complete, 3-gene 567-taxon data matrix and the incomplete, 5-gene 567-taxon data matrix. Although the 5-gene matrix has more missing data (27.5%) than the 3-gene data matrix (2.9%), the 5-gene analysis resulted in higher levels of bootstrap support. Within the 567-taxon tree, the increase in support is most evident for relationships among the 170 taxa for which both matK and 26S rDNA sequences were added, and there is little gain in support for relationships among the 119 taxa having neither matK nor 26S rDNA sequences. The 5-gene analysis also places the enigmatic Hydrostachys in Lamiales (BS = 97%) rather than in Cornales (BS = 100% in 3-gene analysis). The placement of Hydrostachys in Lamiales is unprecedented in molecular analyses, but it is consistent with embryological and morphological data. CONCLUSION: Adding available, and often incomplete, sets of sequences to existing data sets can be a fast and inexpensive way to increase support for phylogenetic relationships and produce novel and credible new phylogenetic hypotheses. BioMed Central 2009-03-17 /pmc/articles/PMC2674047/ /pubmed/19292928 http://dx.doi.org/10.1186/1471-2148-9-61 Text en Copyright © 2009 Burleigh et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Burleigh, J Gordon Hilu, Khidir W Soltis, Douglas E Inferring phylogenies with incomplete data sets: a 5-gene, 567-taxon analysis of angiosperms |
title | Inferring phylogenies with incomplete data sets: a 5-gene, 567-taxon analysis of angiosperms |
title_full | Inferring phylogenies with incomplete data sets: a 5-gene, 567-taxon analysis of angiosperms |
title_fullStr | Inferring phylogenies with incomplete data sets: a 5-gene, 567-taxon analysis of angiosperms |
title_full_unstemmed | Inferring phylogenies with incomplete data sets: a 5-gene, 567-taxon analysis of angiosperms |
title_short | Inferring phylogenies with incomplete data sets: a 5-gene, 567-taxon analysis of angiosperms |
title_sort | inferring phylogenies with incomplete data sets: a 5-gene, 567-taxon analysis of angiosperms |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2674047/ https://www.ncbi.nlm.nih.gov/pubmed/19292928 http://dx.doi.org/10.1186/1471-2148-9-61 |
work_keys_str_mv | AT burleighjgordon inferringphylogenieswithincompletedatasetsa5gene567taxonanalysisofangiosperms AT hilukhidirw inferringphylogenieswithincompletedatasetsa5gene567taxonanalysisofangiosperms AT soltisdouglase inferringphylogenieswithincompletedatasetsa5gene567taxonanalysisofangiosperms |