Cargando…

Genome-Scale Profiling Reveals Noncoding Loci Carry Higher Proportions of Concordant Data

Many evolutionary relationships remain controversial despite whole-genome sequencing data. These controversies arise, in part, due to challenges associated with accurately modeling the complex phylogenetic signal coming from genomic regions experiencing distinct evolutionary forces. Here, we examine...

Descripción completa

Detalles Bibliográficos
Autores principales: Literman, Robert, Schwartz, Rachel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8136493/
https://www.ncbi.nlm.nih.gov/pubmed/33528497
http://dx.doi.org/10.1093/molbev/msab026
_version_ 1783695454446288896
author Literman, Robert
Schwartz, Rachel
author_facet Literman, Robert
Schwartz, Rachel
author_sort Literman, Robert
collection PubMed
description Many evolutionary relationships remain controversial despite whole-genome sequencing data. These controversies arise, in part, due to challenges associated with accurately modeling the complex phylogenetic signal coming from genomic regions experiencing distinct evolutionary forces. Here, we examine how different regions of the genome support or contradict well-established relationships among three mammal groups using millions of orthologous parsimony-informative biallelic sites (PIBS) distributed across primate, rodent, and Pecora genomes. We compared PIBS concordance percentages among locus types (e.g. coding sequences (CDS), introns, intergenic regions), and contrasted PIBS utility over evolutionary timescales. Sites derived from noncoding sequences provided more data and proportionally more concordant sites compared with those from CDS in all clades. CDS PIBS were also predominant drivers of tree incongruence in two cases of topological conflict. PIBS derived from most locus types provided surprisingly consistent support for splitting events spread across the timescales we examined, although we find evidence that CDS and intronic PIBS may, respectively and to a limited degree, inform disproportionately about older and younger splits. In this era of accessible wholegenome sequence data, these results:1) suggest benefits to more intentionally focusing on noncoding loci as robust data for tree inference and 2) reinforce the importance of accurate modeling, especially when using CDS data.
format Online
Article
Text
id pubmed-8136493
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-81364932021-05-25 Genome-Scale Profiling Reveals Noncoding Loci Carry Higher Proportions of Concordant Data Literman, Robert Schwartz, Rachel Mol Biol Evol Discoveries Many evolutionary relationships remain controversial despite whole-genome sequencing data. These controversies arise, in part, due to challenges associated with accurately modeling the complex phylogenetic signal coming from genomic regions experiencing distinct evolutionary forces. Here, we examine how different regions of the genome support or contradict well-established relationships among three mammal groups using millions of orthologous parsimony-informative biallelic sites (PIBS) distributed across primate, rodent, and Pecora genomes. We compared PIBS concordance percentages among locus types (e.g. coding sequences (CDS), introns, intergenic regions), and contrasted PIBS utility over evolutionary timescales. Sites derived from noncoding sequences provided more data and proportionally more concordant sites compared with those from CDS in all clades. CDS PIBS were also predominant drivers of tree incongruence in two cases of topological conflict. PIBS derived from most locus types provided surprisingly consistent support for splitting events spread across the timescales we examined, although we find evidence that CDS and intronic PIBS may, respectively and to a limited degree, inform disproportionately about older and younger splits. In this era of accessible wholegenome sequence data, these results:1) suggest benefits to more intentionally focusing on noncoding loci as robust data for tree inference and 2) reinforce the importance of accurate modeling, especially when using CDS data. Oxford University Press 2021-02-02 /pmc/articles/PMC8136493/ /pubmed/33528497 http://dx.doi.org/10.1093/molbev/msab026 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Discoveries
Literman, Robert
Schwartz, Rachel
Genome-Scale Profiling Reveals Noncoding Loci Carry Higher Proportions of Concordant Data
title Genome-Scale Profiling Reveals Noncoding Loci Carry Higher Proportions of Concordant Data
title_full Genome-Scale Profiling Reveals Noncoding Loci Carry Higher Proportions of Concordant Data
title_fullStr Genome-Scale Profiling Reveals Noncoding Loci Carry Higher Proportions of Concordant Data
title_full_unstemmed Genome-Scale Profiling Reveals Noncoding Loci Carry Higher Proportions of Concordant Data
title_short Genome-Scale Profiling Reveals Noncoding Loci Carry Higher Proportions of Concordant Data
title_sort genome-scale profiling reveals noncoding loci carry higher proportions of concordant data
topic Discoveries
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8136493/
https://www.ncbi.nlm.nih.gov/pubmed/33528497
http://dx.doi.org/10.1093/molbev/msab026
work_keys_str_mv AT litermanrobert genomescaleprofilingrevealsnoncodinglocicarryhigherproportionsofconcordantdata
AT schwartzrachel genomescaleprofilingrevealsnoncodinglocicarryhigherproportionsofconcordantdata