Cargando…
Genome-Scale Profiling Reveals Noncoding Loci Carry Higher Proportions of Concordant Data
Many evolutionary relationships remain controversial despite whole-genome sequencing data. These controversies arise, in part, due to challenges associated with accurately modeling the complex phylogenetic signal coming from genomic regions experiencing distinct evolutionary forces. Here, we examine...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8136493/ https://www.ncbi.nlm.nih.gov/pubmed/33528497 http://dx.doi.org/10.1093/molbev/msab026 |
_version_ | 1783695454446288896 |
---|---|
author | Literman, Robert Schwartz, Rachel |
author_facet | Literman, Robert Schwartz, Rachel |
author_sort | Literman, Robert |
collection | PubMed |
description | Many evolutionary relationships remain controversial despite whole-genome sequencing data. These controversies arise, in part, due to challenges associated with accurately modeling the complex phylogenetic signal coming from genomic regions experiencing distinct evolutionary forces. Here, we examine how different regions of the genome support or contradict well-established relationships among three mammal groups using millions of orthologous parsimony-informative biallelic sites (PIBS) distributed across primate, rodent, and Pecora genomes. We compared PIBS concordance percentages among locus types (e.g. coding sequences (CDS), introns, intergenic regions), and contrasted PIBS utility over evolutionary timescales. Sites derived from noncoding sequences provided more data and proportionally more concordant sites compared with those from CDS in all clades. CDS PIBS were also predominant drivers of tree incongruence in two cases of topological conflict. PIBS derived from most locus types provided surprisingly consistent support for splitting events spread across the timescales we examined, although we find evidence that CDS and intronic PIBS may, respectively and to a limited degree, inform disproportionately about older and younger splits. In this era of accessible wholegenome sequence data, these results:1) suggest benefits to more intentionally focusing on noncoding loci as robust data for tree inference and 2) reinforce the importance of accurate modeling, especially when using CDS data. |
format | Online Article Text |
id | pubmed-8136493 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-81364932021-05-25 Genome-Scale Profiling Reveals Noncoding Loci Carry Higher Proportions of Concordant Data Literman, Robert Schwartz, Rachel Mol Biol Evol Discoveries Many evolutionary relationships remain controversial despite whole-genome sequencing data. These controversies arise, in part, due to challenges associated with accurately modeling the complex phylogenetic signal coming from genomic regions experiencing distinct evolutionary forces. Here, we examine how different regions of the genome support or contradict well-established relationships among three mammal groups using millions of orthologous parsimony-informative biallelic sites (PIBS) distributed across primate, rodent, and Pecora genomes. We compared PIBS concordance percentages among locus types (e.g. coding sequences (CDS), introns, intergenic regions), and contrasted PIBS utility over evolutionary timescales. Sites derived from noncoding sequences provided more data and proportionally more concordant sites compared with those from CDS in all clades. CDS PIBS were also predominant drivers of tree incongruence in two cases of topological conflict. PIBS derived from most locus types provided surprisingly consistent support for splitting events spread across the timescales we examined, although we find evidence that CDS and intronic PIBS may, respectively and to a limited degree, inform disproportionately about older and younger splits. In this era of accessible wholegenome sequence data, these results:1) suggest benefits to more intentionally focusing on noncoding loci as robust data for tree inference and 2) reinforce the importance of accurate modeling, especially when using CDS data. Oxford University Press 2021-02-02 /pmc/articles/PMC8136493/ /pubmed/33528497 http://dx.doi.org/10.1093/molbev/msab026 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Discoveries Literman, Robert Schwartz, Rachel Genome-Scale Profiling Reveals Noncoding Loci Carry Higher Proportions of Concordant Data |
title | Genome-Scale Profiling Reveals Noncoding Loci Carry Higher Proportions of Concordant Data |
title_full | Genome-Scale Profiling Reveals Noncoding Loci Carry Higher Proportions of Concordant Data |
title_fullStr | Genome-Scale Profiling Reveals Noncoding Loci Carry Higher Proportions of Concordant Data |
title_full_unstemmed | Genome-Scale Profiling Reveals Noncoding Loci Carry Higher Proportions of Concordant Data |
title_short | Genome-Scale Profiling Reveals Noncoding Loci Carry Higher Proportions of Concordant Data |
title_sort | genome-scale profiling reveals noncoding loci carry higher proportions of concordant data |
topic | Discoveries |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8136493/ https://www.ncbi.nlm.nih.gov/pubmed/33528497 http://dx.doi.org/10.1093/molbev/msab026 |
work_keys_str_mv | AT litermanrobert genomescaleprofilingrevealsnoncodinglocicarryhigherproportionsofconcordantdata AT schwartzrachel genomescaleprofilingrevealsnoncodinglocicarryhigherproportionsofconcordantdata |