Cargando…

ESTimating plant phylogeny: lessons from partitioning

BACKGROUND: While Expressed Sequence Tags (ESTs) have proven a viable and efficient way to sample genomes, particularly those for which whole-genome sequencing is impractical, phylogenetic analysis using ESTs remains difficult. Sequencing errors and orthology determination are the major problems whe...

Descripción completa

Detalles Bibliográficos
Autores principales: de la Torre, Jose EB, Egan, Mary G, Katari, Manpreet S, Brenner, Eric D, Stevenson, Dennis W, Coruzzi, Gloria M, DeSalle, Rob
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1564041/
https://www.ncbi.nlm.nih.gov/pubmed/16776834
http://dx.doi.org/10.1186/1471-2148-6-48
_version_ 1782129539907846144
author de la Torre, Jose EB
Egan, Mary G
Katari, Manpreet S
Brenner, Eric D
Stevenson, Dennis W
Coruzzi, Gloria M
DeSalle, Rob
author_facet de la Torre, Jose EB
Egan, Mary G
Katari, Manpreet S
Brenner, Eric D
Stevenson, Dennis W
Coruzzi, Gloria M
DeSalle, Rob
author_sort de la Torre, Jose EB
collection PubMed
description BACKGROUND: While Expressed Sequence Tags (ESTs) have proven a viable and efficient way to sample genomes, particularly those for which whole-genome sequencing is impractical, phylogenetic analysis using ESTs remains difficult. Sequencing errors and orthology determination are the major problems when using ESTs as a source of characters for systematics. Here we develop methods to incorporate EST sequence information in a simultaneous analysis framework to address controversial phylogenetic questions regarding the relationships among the major groups of seed plants. We use an automated, phylogenetically derived approach to orthology determination called OrthologID generate a phylogeny based on 43 process partitions, many of which are derived from ESTs, and examine several measures of support to assess the utility of EST data for phylogenies. RESULTS: A maximum parsimony (MP) analysis resulted in a single tree with relatively high support at all nodes in the tree despite rampant conflict among trees generated from the separate analysis of individual partitions. In a comparison of broader-scale groupings based on cellular compartment (ie: chloroplast, mitochondrial or nuclear) or function, only the nuclear partition tree (based largely on EST data) was found to be topologically identical to the tree based on the simultaneous analysis of all data. Despite topological conflict among the broader-scale groupings examined, only the tree based on morphological data showed statistically significant differences. CONCLUSION: Based on the amount of character support contributed by EST data which make up a majority of the nuclear data set, and the lack of conflict of the nuclear data set with the simultaneous analysis tree, we conclude that the inclusion of EST data does provide a viable and efficient approach to address phylogenetic questions within a parsimony framework on a genomic scale, if problems of orthology determination and potential sequencing errors can be overcome. In addition, approaches that examine conflict and support in a simultaneous analysis framework allow for a more precise understanding of the evolutionary history of individual process partitions and may be a novel way to understand functional aspects of different kinds of cellular classes of gene products.
format Text
id pubmed-1564041
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-15640412006-09-12 ESTimating plant phylogeny: lessons from partitioning de la Torre, Jose EB Egan, Mary G Katari, Manpreet S Brenner, Eric D Stevenson, Dennis W Coruzzi, Gloria M DeSalle, Rob BMC Evol Biol Research Article BACKGROUND: While Expressed Sequence Tags (ESTs) have proven a viable and efficient way to sample genomes, particularly those for which whole-genome sequencing is impractical, phylogenetic analysis using ESTs remains difficult. Sequencing errors and orthology determination are the major problems when using ESTs as a source of characters for systematics. Here we develop methods to incorporate EST sequence information in a simultaneous analysis framework to address controversial phylogenetic questions regarding the relationships among the major groups of seed plants. We use an automated, phylogenetically derived approach to orthology determination called OrthologID generate a phylogeny based on 43 process partitions, many of which are derived from ESTs, and examine several measures of support to assess the utility of EST data for phylogenies. RESULTS: A maximum parsimony (MP) analysis resulted in a single tree with relatively high support at all nodes in the tree despite rampant conflict among trees generated from the separate analysis of individual partitions. In a comparison of broader-scale groupings based on cellular compartment (ie: chloroplast, mitochondrial or nuclear) or function, only the nuclear partition tree (based largely on EST data) was found to be topologically identical to the tree based on the simultaneous analysis of all data. Despite topological conflict among the broader-scale groupings examined, only the tree based on morphological data showed statistically significant differences. CONCLUSION: Based on the amount of character support contributed by EST data which make up a majority of the nuclear data set, and the lack of conflict of the nuclear data set with the simultaneous analysis tree, we conclude that the inclusion of EST data does provide a viable and efficient approach to address phylogenetic questions within a parsimony framework on a genomic scale, if problems of orthology determination and potential sequencing errors can be overcome. In addition, approaches that examine conflict and support in a simultaneous analysis framework allow for a more precise understanding of the evolutionary history of individual process partitions and may be a novel way to understand functional aspects of different kinds of cellular classes of gene products. BioMed Central 2006-06-15 /pmc/articles/PMC1564041/ /pubmed/16776834 http://dx.doi.org/10.1186/1471-2148-6-48 Text en Copyright © 2006 de la Torre et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
de la Torre, Jose EB
Egan, Mary G
Katari, Manpreet S
Brenner, Eric D
Stevenson, Dennis W
Coruzzi, Gloria M
DeSalle, Rob
ESTimating plant phylogeny: lessons from partitioning
title ESTimating plant phylogeny: lessons from partitioning
title_full ESTimating plant phylogeny: lessons from partitioning
title_fullStr ESTimating plant phylogeny: lessons from partitioning
title_full_unstemmed ESTimating plant phylogeny: lessons from partitioning
title_short ESTimating plant phylogeny: lessons from partitioning
title_sort estimating plant phylogeny: lessons from partitioning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1564041/
https://www.ncbi.nlm.nih.gov/pubmed/16776834
http://dx.doi.org/10.1186/1471-2148-6-48
work_keys_str_mv AT delatorrejoseeb estimatingplantphylogenylessonsfrompartitioning
AT eganmaryg estimatingplantphylogenylessonsfrompartitioning
AT katarimanpreets estimatingplantphylogenylessonsfrompartitioning
AT brennerericd estimatingplantphylogenylessonsfrompartitioning
AT stevensondennisw estimatingplantphylogenylessonsfrompartitioning
AT coruzzigloriam estimatingplantphylogenylessonsfrompartitioning
AT desallerob estimatingplantphylogenylessonsfrompartitioning