Cargando…

Inferring phylogenies with incomplete data sets: a 5-gene, 567-taxon analysis of angiosperms

BACKGROUND: Phylogenetic analyses of angiosperm relationships have used only a small percentage of available sequence data, but phylogenetic data matrices often can be augmented with existing data, especially if one allows missing characters. We explore the effects on phylogenetic analyses of adding...

Descripción completa

Detalles Bibliográficos
Autores principales: Burleigh, J Gordon, Hilu, Khidir W, Soltis, Douglas E
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2674047/
https://www.ncbi.nlm.nih.gov/pubmed/19292928
http://dx.doi.org/10.1186/1471-2148-9-61
_version_ 1782166613108195328
author Burleigh, J Gordon
Hilu, Khidir W
Soltis, Douglas E
author_facet Burleigh, J Gordon
Hilu, Khidir W
Soltis, Douglas E
author_sort Burleigh, J Gordon
collection PubMed
description BACKGROUND: Phylogenetic analyses of angiosperm relationships have used only a small percentage of available sequence data, but phylogenetic data matrices often can be augmented with existing data, especially if one allows missing characters. We explore the effects on phylogenetic analyses of adding 378 matK sequences and 240 26S rDNA sequences to the complete 3-gene, 567-taxon angiosperm phylogenetic matrix of Soltis et al. RESULTS: We performed maximum likelihood bootstrap analyses of the complete, 3-gene 567-taxon data matrix and the incomplete, 5-gene 567-taxon data matrix. Although the 5-gene matrix has more missing data (27.5%) than the 3-gene data matrix (2.9%), the 5-gene analysis resulted in higher levels of bootstrap support. Within the 567-taxon tree, the increase in support is most evident for relationships among the 170 taxa for which both matK and 26S rDNA sequences were added, and there is little gain in support for relationships among the 119 taxa having neither matK nor 26S rDNA sequences. The 5-gene analysis also places the enigmatic Hydrostachys in Lamiales (BS = 97%) rather than in Cornales (BS = 100% in 3-gene analysis). The placement of Hydrostachys in Lamiales is unprecedented in molecular analyses, but it is consistent with embryological and morphological data. CONCLUSION: Adding available, and often incomplete, sets of sequences to existing data sets can be a fast and inexpensive way to increase support for phylogenetic relationships and produce novel and credible new phylogenetic hypotheses.
format Text
id pubmed-2674047
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26740472009-04-28 Inferring phylogenies with incomplete data sets: a 5-gene, 567-taxon analysis of angiosperms Burleigh, J Gordon Hilu, Khidir W Soltis, Douglas E BMC Evol Biol Research Article BACKGROUND: Phylogenetic analyses of angiosperm relationships have used only a small percentage of available sequence data, but phylogenetic data matrices often can be augmented with existing data, especially if one allows missing characters. We explore the effects on phylogenetic analyses of adding 378 matK sequences and 240 26S rDNA sequences to the complete 3-gene, 567-taxon angiosperm phylogenetic matrix of Soltis et al. RESULTS: We performed maximum likelihood bootstrap analyses of the complete, 3-gene 567-taxon data matrix and the incomplete, 5-gene 567-taxon data matrix. Although the 5-gene matrix has more missing data (27.5%) than the 3-gene data matrix (2.9%), the 5-gene analysis resulted in higher levels of bootstrap support. Within the 567-taxon tree, the increase in support is most evident for relationships among the 170 taxa for which both matK and 26S rDNA sequences were added, and there is little gain in support for relationships among the 119 taxa having neither matK nor 26S rDNA sequences. The 5-gene analysis also places the enigmatic Hydrostachys in Lamiales (BS = 97%) rather than in Cornales (BS = 100% in 3-gene analysis). The placement of Hydrostachys in Lamiales is unprecedented in molecular analyses, but it is consistent with embryological and morphological data. CONCLUSION: Adding available, and often incomplete, sets of sequences to existing data sets can be a fast and inexpensive way to increase support for phylogenetic relationships and produce novel and credible new phylogenetic hypotheses. BioMed Central 2009-03-17 /pmc/articles/PMC2674047/ /pubmed/19292928 http://dx.doi.org/10.1186/1471-2148-9-61 Text en Copyright © 2009 Burleigh et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Burleigh, J Gordon
Hilu, Khidir W
Soltis, Douglas E
Inferring phylogenies with incomplete data sets: a 5-gene, 567-taxon analysis of angiosperms
title Inferring phylogenies with incomplete data sets: a 5-gene, 567-taxon analysis of angiosperms
title_full Inferring phylogenies with incomplete data sets: a 5-gene, 567-taxon analysis of angiosperms
title_fullStr Inferring phylogenies with incomplete data sets: a 5-gene, 567-taxon analysis of angiosperms
title_full_unstemmed Inferring phylogenies with incomplete data sets: a 5-gene, 567-taxon analysis of angiosperms
title_short Inferring phylogenies with incomplete data sets: a 5-gene, 567-taxon analysis of angiosperms
title_sort inferring phylogenies with incomplete data sets: a 5-gene, 567-taxon analysis of angiosperms
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2674047/
https://www.ncbi.nlm.nih.gov/pubmed/19292928
http://dx.doi.org/10.1186/1471-2148-9-61
work_keys_str_mv AT burleighjgordon inferringphylogenieswithincompletedatasetsa5gene567taxonanalysisofangiosperms
AT hilukhidirw inferringphylogenieswithincompletedatasetsa5gene567taxonanalysisofangiosperms
AT soltisdouglase inferringphylogenieswithincompletedatasetsa5gene567taxonanalysisofangiosperms