Cargando…

Phylotranscriptomic Analysis Based on Coalescence was Less Influenced by the Evolving Rates and the Number of Genes: A Case Study in Ericales

Advances in high-throughput sequencing have generated a vast amount of transcriptomic data that are being increasingly used in phylogenetic reconstruction. However, processing the vast datasets for a huge number of genes and even identifying optimal analytical methodology are challenging. Through de...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Lu, Wu, Wei, Yan, Hai-Fei, Ge, Xue-Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Libertas Academica 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4718149/
https://www.ncbi.nlm.nih.gov/pubmed/26819541
http://dx.doi.org/10.4137/EBO.S22448
_version_ 1782410749542400000
author Zhang, Lu
Wu, Wei
Yan, Hai-Fei
Ge, Xue-Jun
author_facet Zhang, Lu
Wu, Wei
Yan, Hai-Fei
Ge, Xue-Jun
author_sort Zhang, Lu
collection PubMed
description Advances in high-throughput sequencing have generated a vast amount of transcriptomic data that are being increasingly used in phylogenetic reconstruction. However, processing the vast datasets for a huge number of genes and even identifying optimal analytical methodology are challenging. Through de novo sequenced and retrieved data from public databases, we identified 221 orthologous protein-coding genes to reconstruct the phylogeny of Ericales, an order characterized by rapid ancient radiation. Seven species representing different families in Ericales were used as in-groups. Both concatenation and coalescence methods yielded the same well-supported topology as previous studies, with only two nodes conflicting with previously reported relationships. The results revealed that a partitioning strategy could improve the traditional concatenation methodology. Rapidly evolving genes negatively affected the concatenation analysis, while slowly evolving genes slightly affected the coalescence analysis. The coalescence methods usually accommodated rate heterogeneity better and required fewer genes to yield well-supported topologies than the concatenation methods with both real and simulated data.
format Online
Article
Text
id pubmed-4718149
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-47181492016-01-27 Phylotranscriptomic Analysis Based on Coalescence was Less Influenced by the Evolving Rates and the Number of Genes: A Case Study in Ericales Zhang, Lu Wu, Wei Yan, Hai-Fei Ge, Xue-Jun Evol Bioinform Online Original Research Advances in high-throughput sequencing have generated a vast amount of transcriptomic data that are being increasingly used in phylogenetic reconstruction. However, processing the vast datasets for a huge number of genes and even identifying optimal analytical methodology are challenging. Through de novo sequenced and retrieved data from public databases, we identified 221 orthologous protein-coding genes to reconstruct the phylogeny of Ericales, an order characterized by rapid ancient radiation. Seven species representing different families in Ericales were used as in-groups. Both concatenation and coalescence methods yielded the same well-supported topology as previous studies, with only two nodes conflicting with previously reported relationships. The results revealed that a partitioning strategy could improve the traditional concatenation methodology. Rapidly evolving genes negatively affected the concatenation analysis, while slowly evolving genes slightly affected the coalescence analysis. The coalescence methods usually accommodated rate heterogeneity better and required fewer genes to yield well-supported topologies than the concatenation methods with both real and simulated data. Libertas Academica 2016-01-18 /pmc/articles/PMC4718149/ /pubmed/26819541 http://dx.doi.org/10.4137/EBO.S22448 Text en © 2015 the author(s), publisher and licensee Libertas Academica Ltd. This is an open-access article distributed under the terms of the Creative Commons CC-BY-NC 3.0 License.
spellingShingle Original Research
Zhang, Lu
Wu, Wei
Yan, Hai-Fei
Ge, Xue-Jun
Phylotranscriptomic Analysis Based on Coalescence was Less Influenced by the Evolving Rates and the Number of Genes: A Case Study in Ericales
title Phylotranscriptomic Analysis Based on Coalescence was Less Influenced by the Evolving Rates and the Number of Genes: A Case Study in Ericales
title_full Phylotranscriptomic Analysis Based on Coalescence was Less Influenced by the Evolving Rates and the Number of Genes: A Case Study in Ericales
title_fullStr Phylotranscriptomic Analysis Based on Coalescence was Less Influenced by the Evolving Rates and the Number of Genes: A Case Study in Ericales
title_full_unstemmed Phylotranscriptomic Analysis Based on Coalescence was Less Influenced by the Evolving Rates and the Number of Genes: A Case Study in Ericales
title_short Phylotranscriptomic Analysis Based on Coalescence was Less Influenced by the Evolving Rates and the Number of Genes: A Case Study in Ericales
title_sort phylotranscriptomic analysis based on coalescence was less influenced by the evolving rates and the number of genes: a case study in ericales
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4718149/
https://www.ncbi.nlm.nih.gov/pubmed/26819541
http://dx.doi.org/10.4137/EBO.S22448
work_keys_str_mv AT zhanglu phylotranscriptomicanalysisbasedoncoalescencewaslessinfluencedbytheevolvingratesandthenumberofgenesacasestudyinericales
AT wuwei phylotranscriptomicanalysisbasedoncoalescencewaslessinfluencedbytheevolvingratesandthenumberofgenesacasestudyinericales
AT yanhaifei phylotranscriptomicanalysisbasedoncoalescencewaslessinfluencedbytheevolvingratesandthenumberofgenesacasestudyinericales
AT gexuejun phylotranscriptomicanalysisbasedoncoalescencewaslessinfluencedbytheevolvingratesandthenumberofgenesacasestudyinericales