Cargando…

Impact of the Partitioning Scheme on Divergence Times Inferred from Mammalian Genomic Data Sets

Data partitioning has long been regarded as an important parameter for phylogenetic inference. The division of heterogeneous multigene data sets into partitions with similar substitution patterns is known to increase the performance of probabilistic phylogenetic methods. However, the effect of the p...

Descripción completa

Detalles Bibliográficos
Autores principales: Voloch, Carolina M., Schrago, Carlos G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Libertas Academica 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3362329/
https://www.ncbi.nlm.nih.gov/pubmed/22654486
http://dx.doi.org/10.4137/EBO.S9627
_version_ 1782234212364976128
author Voloch, Carolina M.
Schrago, Carlos G.
author_facet Voloch, Carolina M.
Schrago, Carlos G.
author_sort Voloch, Carolina M.
collection PubMed
description Data partitioning has long been regarded as an important parameter for phylogenetic inference. The division of heterogeneous multigene data sets into partitions with similar substitution patterns is known to increase the performance of probabilistic phylogenetic methods. However, the effect of the partitioning scheme on divergence time estimates has generally been ignored. To investigate the impact of data partitioning on the estimation of divergence times, we have constructed two genomic data sets. The first one with 15 nuclear genes comprising 50,928 bp were selected from the OrthoMam database; the second set was composed of complete mitochondrial genomes. We studied two partitioning schemes: concatenated supermatrices and partitioned gene analysis. We have also measured the impact of taxonomic sampling on the estimates. After drawing divergence time inferences using the uncorrelated relaxed clock in BEAST, we have compared the age estimates between the partitioning schemes. Our results show that, in general, both schemes resulted in similar chronological estimates, however the concatenated data sets were more efficient than the partitioned ones in attaining suitable effective sample sizes.
format Online
Article
Text
id pubmed-3362329
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-33623292012-05-31 Impact of the Partitioning Scheme on Divergence Times Inferred from Mammalian Genomic Data Sets Voloch, Carolina M. Schrago, Carlos G. Evol Bioinform Online Original Research Data partitioning has long been regarded as an important parameter for phylogenetic inference. The division of heterogeneous multigene data sets into partitions with similar substitution patterns is known to increase the performance of probabilistic phylogenetic methods. However, the effect of the partitioning scheme on divergence time estimates has generally been ignored. To investigate the impact of data partitioning on the estimation of divergence times, we have constructed two genomic data sets. The first one with 15 nuclear genes comprising 50,928 bp were selected from the OrthoMam database; the second set was composed of complete mitochondrial genomes. We studied two partitioning schemes: concatenated supermatrices and partitioned gene analysis. We have also measured the impact of taxonomic sampling on the estimates. After drawing divergence time inferences using the uncorrelated relaxed clock in BEAST, we have compared the age estimates between the partitioning schemes. Our results show that, in general, both schemes resulted in similar chronological estimates, however the concatenated data sets were more efficient than the partitioned ones in attaining suitable effective sample sizes. Libertas Academica 2012-05-14 /pmc/articles/PMC3362329/ /pubmed/22654486 http://dx.doi.org/10.4137/EBO.S9627 Text en © the author(s), publisher and licensee Libertas Academica Ltd. This is an open access article. Unrestricted non-commercial use is permitted provided the original work is properly cited.
spellingShingle Original Research
Voloch, Carolina M.
Schrago, Carlos G.
Impact of the Partitioning Scheme on Divergence Times Inferred from Mammalian Genomic Data Sets
title Impact of the Partitioning Scheme on Divergence Times Inferred from Mammalian Genomic Data Sets
title_full Impact of the Partitioning Scheme on Divergence Times Inferred from Mammalian Genomic Data Sets
title_fullStr Impact of the Partitioning Scheme on Divergence Times Inferred from Mammalian Genomic Data Sets
title_full_unstemmed Impact of the Partitioning Scheme on Divergence Times Inferred from Mammalian Genomic Data Sets
title_short Impact of the Partitioning Scheme on Divergence Times Inferred from Mammalian Genomic Data Sets
title_sort impact of the partitioning scheme on divergence times inferred from mammalian genomic data sets
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3362329/
https://www.ncbi.nlm.nih.gov/pubmed/22654486
http://dx.doi.org/10.4137/EBO.S9627
work_keys_str_mv AT volochcarolinam impactofthepartitioningschemeondivergencetimesinferredfrommammaliangenomicdatasets
AT schragocarlosg impactofthepartitioningschemeondivergencetimesinferredfrommammaliangenomicdatasets