Cargando…
Phylotranscriptomic Analysis Based on Coalescence was Less Influenced by the Evolving Rates and the Number of Genes: A Case Study in Ericales
Advances in high-throughput sequencing have generated a vast amount of transcriptomic data that are being increasingly used in phylogenetic reconstruction. However, processing the vast datasets for a huge number of genes and even identifying optimal analytical methodology are challenging. Through de...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Libertas Academica
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4718149/ https://www.ncbi.nlm.nih.gov/pubmed/26819541 http://dx.doi.org/10.4137/EBO.S22448 |
_version_ | 1782410749542400000 |
---|---|
author | Zhang, Lu Wu, Wei Yan, Hai-Fei Ge, Xue-Jun |
author_facet | Zhang, Lu Wu, Wei Yan, Hai-Fei Ge, Xue-Jun |
author_sort | Zhang, Lu |
collection | PubMed |
description | Advances in high-throughput sequencing have generated a vast amount of transcriptomic data that are being increasingly used in phylogenetic reconstruction. However, processing the vast datasets for a huge number of genes and even identifying optimal analytical methodology are challenging. Through de novo sequenced and retrieved data from public databases, we identified 221 orthologous protein-coding genes to reconstruct the phylogeny of Ericales, an order characterized by rapid ancient radiation. Seven species representing different families in Ericales were used as in-groups. Both concatenation and coalescence methods yielded the same well-supported topology as previous studies, with only two nodes conflicting with previously reported relationships. The results revealed that a partitioning strategy could improve the traditional concatenation methodology. Rapidly evolving genes negatively affected the concatenation analysis, while slowly evolving genes slightly affected the coalescence analysis. The coalescence methods usually accommodated rate heterogeneity better and required fewer genes to yield well-supported topologies than the concatenation methods with both real and simulated data. |
format | Online Article Text |
id | pubmed-4718149 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Libertas Academica |
record_format | MEDLINE/PubMed |
spelling | pubmed-47181492016-01-27 Phylotranscriptomic Analysis Based on Coalescence was Less Influenced by the Evolving Rates and the Number of Genes: A Case Study in Ericales Zhang, Lu Wu, Wei Yan, Hai-Fei Ge, Xue-Jun Evol Bioinform Online Original Research Advances in high-throughput sequencing have generated a vast amount of transcriptomic data that are being increasingly used in phylogenetic reconstruction. However, processing the vast datasets for a huge number of genes and even identifying optimal analytical methodology are challenging. Through de novo sequenced and retrieved data from public databases, we identified 221 orthologous protein-coding genes to reconstruct the phylogeny of Ericales, an order characterized by rapid ancient radiation. Seven species representing different families in Ericales were used as in-groups. Both concatenation and coalescence methods yielded the same well-supported topology as previous studies, with only two nodes conflicting with previously reported relationships. The results revealed that a partitioning strategy could improve the traditional concatenation methodology. Rapidly evolving genes negatively affected the concatenation analysis, while slowly evolving genes slightly affected the coalescence analysis. The coalescence methods usually accommodated rate heterogeneity better and required fewer genes to yield well-supported topologies than the concatenation methods with both real and simulated data. Libertas Academica 2016-01-18 /pmc/articles/PMC4718149/ /pubmed/26819541 http://dx.doi.org/10.4137/EBO.S22448 Text en © 2015 the author(s), publisher and licensee Libertas Academica Ltd. This is an open-access article distributed under the terms of the Creative Commons CC-BY-NC 3.0 License. |
spellingShingle | Original Research Zhang, Lu Wu, Wei Yan, Hai-Fei Ge, Xue-Jun Phylotranscriptomic Analysis Based on Coalescence was Less Influenced by the Evolving Rates and the Number of Genes: A Case Study in Ericales |
title | Phylotranscriptomic Analysis Based on Coalescence was Less Influenced by the Evolving Rates and the Number of Genes: A Case Study in Ericales |
title_full | Phylotranscriptomic Analysis Based on Coalescence was Less Influenced by the Evolving Rates and the Number of Genes: A Case Study in Ericales |
title_fullStr | Phylotranscriptomic Analysis Based on Coalescence was Less Influenced by the Evolving Rates and the Number of Genes: A Case Study in Ericales |
title_full_unstemmed | Phylotranscriptomic Analysis Based on Coalescence was Less Influenced by the Evolving Rates and the Number of Genes: A Case Study in Ericales |
title_short | Phylotranscriptomic Analysis Based on Coalescence was Less Influenced by the Evolving Rates and the Number of Genes: A Case Study in Ericales |
title_sort | phylotranscriptomic analysis based on coalescence was less influenced by the evolving rates and the number of genes: a case study in ericales |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4718149/ https://www.ncbi.nlm.nih.gov/pubmed/26819541 http://dx.doi.org/10.4137/EBO.S22448 |
work_keys_str_mv | AT zhanglu phylotranscriptomicanalysisbasedoncoalescencewaslessinfluencedbytheevolvingratesandthenumberofgenesacasestudyinericales AT wuwei phylotranscriptomicanalysisbasedoncoalescencewaslessinfluencedbytheevolvingratesandthenumberofgenesacasestudyinericales AT yanhaifei phylotranscriptomicanalysisbasedoncoalescencewaslessinfluencedbytheevolvingratesandthenumberofgenesacasestudyinericales AT gexuejun phylotranscriptomicanalysisbasedoncoalescencewaslessinfluencedbytheevolvingratesandthenumberofgenesacasestudyinericales |