Cargando…

The effect of mutation subtypes on the allele frequency spectrum and population genetics inference

Population genetics has adapted as technological advances in next-generation sequencing have resulted in an exponential increase of genetic data. A common approach to efficiently analyze genetic variation present in large sequencing data is through the allele frequency spectrum, defined as the distr...

Descripción completa

Detalles Bibliográficos
Autores principales: Liao, Kevin, Carlson, Jedidiah, Zöllner, Sebastian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10085755/
https://www.ncbi.nlm.nih.gov/pubmed/36759699
http://dx.doi.org/10.1093/g3journal/jkad035
_version_ 1785022004220395520
author Liao, Kevin
Carlson, Jedidiah
Zöllner, Sebastian
author_facet Liao, Kevin
Carlson, Jedidiah
Zöllner, Sebastian
author_sort Liao, Kevin
collection PubMed
description Population genetics has adapted as technological advances in next-generation sequencing have resulted in an exponential increase of genetic data. A common approach to efficiently analyze genetic variation present in large sequencing data is through the allele frequency spectrum, defined as the distribution of allele frequencies in a sample. While the frequency spectrum serves to summarize patterns of genetic variation, it implicitly assumes mutation types (A→C vs C→T) as interchangeable. However, mutations of different types arise and spread due to spatial and temporal variation in forces such as mutation rate and biased gene conversion that result in heterogeneity in the distribution of allele frequencies across sites. In this work, we explore the impact of this simplification on multiple aspects of population genetic modeling. As a site’s mutation rate is strongly affected by flanking nucleotides, we defined a mutation subtype by the base pair change and adjacent nucleotides (e.g. AAA→ATA) and systematically assessed the heterogeneity in the frequency spectrum across 96 distinct 3-mer mutation subtypes using n = 3556 whole-genome sequenced individuals of European ancestry. We observed substantial variation across the subtype-specific frequency spectra, with some of the variation being influenced by molecular factors previously identified for single base mutation types. Estimates of model parameters from demographic inference performed for each mutation subtype’s AFS individually varied drastically across the 96 subtypes. In local patterns of variation, a combination of regional subtype composition and local genomic factors shaped the regional frequency spectrum across genomic regions. Our results illustrate how treating variants in large sequencing samples as interchangeable may confound population genetic frameworks and encourages us to consider the unique evolutionary mechanisms of analyzed polymorphisms.
format Online
Article
Text
id pubmed-10085755
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-100857552023-04-12 The effect of mutation subtypes on the allele frequency spectrum and population genetics inference Liao, Kevin Carlson, Jedidiah Zöllner, Sebastian G3 (Bethesda) Investigation Population genetics has adapted as technological advances in next-generation sequencing have resulted in an exponential increase of genetic data. A common approach to efficiently analyze genetic variation present in large sequencing data is through the allele frequency spectrum, defined as the distribution of allele frequencies in a sample. While the frequency spectrum serves to summarize patterns of genetic variation, it implicitly assumes mutation types (A→C vs C→T) as interchangeable. However, mutations of different types arise and spread due to spatial and temporal variation in forces such as mutation rate and biased gene conversion that result in heterogeneity in the distribution of allele frequencies across sites. In this work, we explore the impact of this simplification on multiple aspects of population genetic modeling. As a site’s mutation rate is strongly affected by flanking nucleotides, we defined a mutation subtype by the base pair change and adjacent nucleotides (e.g. AAA→ATA) and systematically assessed the heterogeneity in the frequency spectrum across 96 distinct 3-mer mutation subtypes using n = 3556 whole-genome sequenced individuals of European ancestry. We observed substantial variation across the subtype-specific frequency spectra, with some of the variation being influenced by molecular factors previously identified for single base mutation types. Estimates of model parameters from demographic inference performed for each mutation subtype’s AFS individually varied drastically across the 96 subtypes. In local patterns of variation, a combination of regional subtype composition and local genomic factors shaped the regional frequency spectrum across genomic regions. Our results illustrate how treating variants in large sequencing samples as interchangeable may confound population genetic frameworks and encourages us to consider the unique evolutionary mechanisms of analyzed polymorphisms. Oxford University Press 2023-02-10 /pmc/articles/PMC10085755/ /pubmed/36759699 http://dx.doi.org/10.1093/g3journal/jkad035 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigation
Liao, Kevin
Carlson, Jedidiah
Zöllner, Sebastian
The effect of mutation subtypes on the allele frequency spectrum and population genetics inference
title The effect of mutation subtypes on the allele frequency spectrum and population genetics inference
title_full The effect of mutation subtypes on the allele frequency spectrum and population genetics inference
title_fullStr The effect of mutation subtypes on the allele frequency spectrum and population genetics inference
title_full_unstemmed The effect of mutation subtypes on the allele frequency spectrum and population genetics inference
title_short The effect of mutation subtypes on the allele frequency spectrum and population genetics inference
title_sort effect of mutation subtypes on the allele frequency spectrum and population genetics inference
topic Investigation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10085755/
https://www.ncbi.nlm.nih.gov/pubmed/36759699
http://dx.doi.org/10.1093/g3journal/jkad035
work_keys_str_mv AT liaokevin theeffectofmutationsubtypesontheallelefrequencyspectrumandpopulationgeneticsinference
AT carlsonjedidiah theeffectofmutationsubtypesontheallelefrequencyspectrumandpopulationgeneticsinference
AT zollnersebastian theeffectofmutationsubtypesontheallelefrequencyspectrumandpopulationgeneticsinference
AT liaokevin effectofmutationsubtypesontheallelefrequencyspectrumandpopulationgeneticsinference
AT carlsonjedidiah effectofmutationsubtypesontheallelefrequencyspectrumandpopulationgeneticsinference
AT zollnersebastian effectofmutationsubtypesontheallelefrequencyspectrumandpopulationgeneticsinference