Cargando…
The effect of mutation subtypes on the allele frequency spectrum and population genetics inference
Population genetics has adapted as technological advances in next-generation sequencing have resulted in an exponential increase of genetic data. A common approach to efficiently analyze genetic variation present in large sequencing data is through the allele frequency spectrum, defined as the distr...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10085755/ https://www.ncbi.nlm.nih.gov/pubmed/36759699 http://dx.doi.org/10.1093/g3journal/jkad035 |
_version_ | 1785022004220395520 |
---|---|
author | Liao, Kevin Carlson, Jedidiah Zöllner, Sebastian |
author_facet | Liao, Kevin Carlson, Jedidiah Zöllner, Sebastian |
author_sort | Liao, Kevin |
collection | PubMed |
description | Population genetics has adapted as technological advances in next-generation sequencing have resulted in an exponential increase of genetic data. A common approach to efficiently analyze genetic variation present in large sequencing data is through the allele frequency spectrum, defined as the distribution of allele frequencies in a sample. While the frequency spectrum serves to summarize patterns of genetic variation, it implicitly assumes mutation types (A→C vs C→T) as interchangeable. However, mutations of different types arise and spread due to spatial and temporal variation in forces such as mutation rate and biased gene conversion that result in heterogeneity in the distribution of allele frequencies across sites. In this work, we explore the impact of this simplification on multiple aspects of population genetic modeling. As a site’s mutation rate is strongly affected by flanking nucleotides, we defined a mutation subtype by the base pair change and adjacent nucleotides (e.g. AAA→ATA) and systematically assessed the heterogeneity in the frequency spectrum across 96 distinct 3-mer mutation subtypes using n = 3556 whole-genome sequenced individuals of European ancestry. We observed substantial variation across the subtype-specific frequency spectra, with some of the variation being influenced by molecular factors previously identified for single base mutation types. Estimates of model parameters from demographic inference performed for each mutation subtype’s AFS individually varied drastically across the 96 subtypes. In local patterns of variation, a combination of regional subtype composition and local genomic factors shaped the regional frequency spectrum across genomic regions. Our results illustrate how treating variants in large sequencing samples as interchangeable may confound population genetic frameworks and encourages us to consider the unique evolutionary mechanisms of analyzed polymorphisms. |
format | Online Article Text |
id | pubmed-10085755 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-100857552023-04-12 The effect of mutation subtypes on the allele frequency spectrum and population genetics inference Liao, Kevin Carlson, Jedidiah Zöllner, Sebastian G3 (Bethesda) Investigation Population genetics has adapted as technological advances in next-generation sequencing have resulted in an exponential increase of genetic data. A common approach to efficiently analyze genetic variation present in large sequencing data is through the allele frequency spectrum, defined as the distribution of allele frequencies in a sample. While the frequency spectrum serves to summarize patterns of genetic variation, it implicitly assumes mutation types (A→C vs C→T) as interchangeable. However, mutations of different types arise and spread due to spatial and temporal variation in forces such as mutation rate and biased gene conversion that result in heterogeneity in the distribution of allele frequencies across sites. In this work, we explore the impact of this simplification on multiple aspects of population genetic modeling. As a site’s mutation rate is strongly affected by flanking nucleotides, we defined a mutation subtype by the base pair change and adjacent nucleotides (e.g. AAA→ATA) and systematically assessed the heterogeneity in the frequency spectrum across 96 distinct 3-mer mutation subtypes using n = 3556 whole-genome sequenced individuals of European ancestry. We observed substantial variation across the subtype-specific frequency spectra, with some of the variation being influenced by molecular factors previously identified for single base mutation types. Estimates of model parameters from demographic inference performed for each mutation subtype’s AFS individually varied drastically across the 96 subtypes. In local patterns of variation, a combination of regional subtype composition and local genomic factors shaped the regional frequency spectrum across genomic regions. Our results illustrate how treating variants in large sequencing samples as interchangeable may confound population genetic frameworks and encourages us to consider the unique evolutionary mechanisms of analyzed polymorphisms. Oxford University Press 2023-02-10 /pmc/articles/PMC10085755/ /pubmed/36759699 http://dx.doi.org/10.1093/g3journal/jkad035 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Investigation Liao, Kevin Carlson, Jedidiah Zöllner, Sebastian The effect of mutation subtypes on the allele frequency spectrum and population genetics inference |
title | The effect of mutation subtypes on the allele frequency spectrum and population genetics inference |
title_full | The effect of mutation subtypes on the allele frequency spectrum and population genetics inference |
title_fullStr | The effect of mutation subtypes on the allele frequency spectrum and population genetics inference |
title_full_unstemmed | The effect of mutation subtypes on the allele frequency spectrum and population genetics inference |
title_short | The effect of mutation subtypes on the allele frequency spectrum and population genetics inference |
title_sort | effect of mutation subtypes on the allele frequency spectrum and population genetics inference |
topic | Investigation |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10085755/ https://www.ncbi.nlm.nih.gov/pubmed/36759699 http://dx.doi.org/10.1093/g3journal/jkad035 |
work_keys_str_mv | AT liaokevin theeffectofmutationsubtypesontheallelefrequencyspectrumandpopulationgeneticsinference AT carlsonjedidiah theeffectofmutationsubtypesontheallelefrequencyspectrumandpopulationgeneticsinference AT zollnersebastian theeffectofmutationsubtypesontheallelefrequencyspectrumandpopulationgeneticsinference AT liaokevin effectofmutationsubtypesontheallelefrequencyspectrumandpopulationgeneticsinference AT carlsonjedidiah effectofmutationsubtypesontheallelefrequencyspectrumandpopulationgeneticsinference AT zollnersebastian effectofmutationsubtypesontheallelefrequencyspectrumandpopulationgeneticsinference |