Cargando…
Distinguishing Among Evolutionary Forces Acting on Genome-Wide Base Composition: Computer Simulation Analysis of Approximate Methods for Inferring Site Frequency Spectra of Derived Mutations
Inferred ancestral nucleotide states are increasingly employed in analyses of within- and between -species genome variation. Although numerous studies have focused on ancestral inference among distantly related lineages, approaches to infer ancestral states in polymorphism data have received less at...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Genetics Society of America
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5940166/ https://www.ncbi.nlm.nih.gov/pubmed/29588382 http://dx.doi.org/10.1534/g3.117.300512 |
_version_ | 1783321060947525632 |
---|---|
author | Matsumoto, Tomotaka Akashi, Hiroshi |
author_facet | Matsumoto, Tomotaka Akashi, Hiroshi |
author_sort | Matsumoto, Tomotaka |
collection | PubMed |
description | Inferred ancestral nucleotide states are increasingly employed in analyses of within- and between -species genome variation. Although numerous studies have focused on ancestral inference among distantly related lineages, approaches to infer ancestral states in polymorphism data have received less attention. Recently developed approaches that employ complex transition matrices allow us to infer ancestral nucleotide sequence in various evolutionary scenarios of base composition. However, the requirement of a single gene tree to calculate a likelihood is an important limitation for conducting ancestral inference using within-species variation in recombining genomes. To resolve this problem, and to extend the applicability of ancestral inference in studies of base composition evolution, we first evaluate three previously proposed methods to infer ancestral nucleotide sequences among within- and between-species sequence variation data. The methods employ a single allele, bifurcating tree, or a star tree for within-species variation data. Using simulated nucleotide sequences, we employ ancestral inference to infer fixations and polymorphisms. We find that all three methods show biased inference. We modify the bifurcating tree method to include weights to adjust for an expected site frequency spectrum, “bifurcating tree with weighting” (BTW). Our simulation analysis show that the BTW method can substantially improve the reliability and robustness of ancestral inference in a range of scenarios that include non-neutral and/or non-stationary base composition evolution. |
format | Online Article Text |
id | pubmed-5940166 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Genetics Society of America |
record_format | MEDLINE/PubMed |
spelling | pubmed-59401662018-05-10 Distinguishing Among Evolutionary Forces Acting on Genome-Wide Base Composition: Computer Simulation Analysis of Approximate Methods for Inferring Site Frequency Spectra of Derived Mutations Matsumoto, Tomotaka Akashi, Hiroshi G3 (Bethesda) Investigations Inferred ancestral nucleotide states are increasingly employed in analyses of within- and between -species genome variation. Although numerous studies have focused on ancestral inference among distantly related lineages, approaches to infer ancestral states in polymorphism data have received less attention. Recently developed approaches that employ complex transition matrices allow us to infer ancestral nucleotide sequence in various evolutionary scenarios of base composition. However, the requirement of a single gene tree to calculate a likelihood is an important limitation for conducting ancestral inference using within-species variation in recombining genomes. To resolve this problem, and to extend the applicability of ancestral inference in studies of base composition evolution, we first evaluate three previously proposed methods to infer ancestral nucleotide sequences among within- and between-species sequence variation data. The methods employ a single allele, bifurcating tree, or a star tree for within-species variation data. Using simulated nucleotide sequences, we employ ancestral inference to infer fixations and polymorphisms. We find that all three methods show biased inference. We modify the bifurcating tree method to include weights to adjust for an expected site frequency spectrum, “bifurcating tree with weighting” (BTW). Our simulation analysis show that the BTW method can substantially improve the reliability and robustness of ancestral inference in a range of scenarios that include non-neutral and/or non-stationary base composition evolution. Genetics Society of America 2018-03-27 /pmc/articles/PMC5940166/ /pubmed/29588382 http://dx.doi.org/10.1534/g3.117.300512 Text en Copyright © 2018 Tomotaka Matsumoto, Hiroshi Akashi http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Investigations Matsumoto, Tomotaka Akashi, Hiroshi Distinguishing Among Evolutionary Forces Acting on Genome-Wide Base Composition: Computer Simulation Analysis of Approximate Methods for Inferring Site Frequency Spectra of Derived Mutations |
title | Distinguishing Among Evolutionary Forces Acting on Genome-Wide Base Composition: Computer Simulation Analysis of Approximate Methods for Inferring Site Frequency Spectra of Derived Mutations |
title_full | Distinguishing Among Evolutionary Forces Acting on Genome-Wide Base Composition: Computer Simulation Analysis of Approximate Methods for Inferring Site Frequency Spectra of Derived Mutations |
title_fullStr | Distinguishing Among Evolutionary Forces Acting on Genome-Wide Base Composition: Computer Simulation Analysis of Approximate Methods for Inferring Site Frequency Spectra of Derived Mutations |
title_full_unstemmed | Distinguishing Among Evolutionary Forces Acting on Genome-Wide Base Composition: Computer Simulation Analysis of Approximate Methods for Inferring Site Frequency Spectra of Derived Mutations |
title_short | Distinguishing Among Evolutionary Forces Acting on Genome-Wide Base Composition: Computer Simulation Analysis of Approximate Methods for Inferring Site Frequency Spectra of Derived Mutations |
title_sort | distinguishing among evolutionary forces acting on genome-wide base composition: computer simulation analysis of approximate methods for inferring site frequency spectra of derived mutations |
topic | Investigations |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5940166/ https://www.ncbi.nlm.nih.gov/pubmed/29588382 http://dx.doi.org/10.1534/g3.117.300512 |
work_keys_str_mv | AT matsumototomotaka distinguishingamongevolutionaryforcesactingongenomewidebasecompositioncomputersimulationanalysisofapproximatemethodsforinferringsitefrequencyspectraofderivedmutations AT akashihiroshi distinguishingamongevolutionaryforcesactingongenomewidebasecompositioncomputersimulationanalysisofapproximatemethodsforinferringsitefrequencyspectraofderivedmutations |