Cargando…

Functional genomic effects of indels using Bayesian genome-phenome wide association studies in sorghum

High-throughput genomic and phenomic data have enhanced the ability to detect genotype-to-phenotype associations that can resolve broad pleiotropic effects of mutations on plant phenotypes. As the scale of genotyping and phenotyping has advanced, rigorous methodologies have been developed to accommo...

Descripción completa

Detalles Bibliográficos
Autores principales: Boatwright, J. Lucas, Sapkota, Sirjan, Kresovich, Stephen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10102435/
https://www.ncbi.nlm.nih.gov/pubmed/37065477
http://dx.doi.org/10.3389/fgene.2023.1143395
_version_ 1785025691001028608
author Boatwright, J. Lucas
Sapkota, Sirjan
Kresovich, Stephen
author_facet Boatwright, J. Lucas
Sapkota, Sirjan
Kresovich, Stephen
author_sort Boatwright, J. Lucas
collection PubMed
description High-throughput genomic and phenomic data have enhanced the ability to detect genotype-to-phenotype associations that can resolve broad pleiotropic effects of mutations on plant phenotypes. As the scale of genotyping and phenotyping has advanced, rigorous methodologies have been developed to accommodate larger datasets and maintain statistical precision. However, determining the functional effects of associated genes/loci is expensive and limited due to the complexity associated with cloning and subsequent characterization. Here, we utilized phenomic imputation of a multi-year, multi-environment dataset using PHENIX which imputes missing data using kinship and correlated traits, and we screened insertions and deletions (InDels) from the recently whole-genome sequenced Sorghum Association Panel for putative loss-of-function effects. Candidate loci from genome-wide association results were screened for potential loss of function using a Bayesian Genome-Phenome Wide Association Study (BGPWAS) model across both functionally characterized and uncharacterized loci. Our approach is designed to facilitate in silico validation of associations beyond traditional candidate gene and literature-search approaches and to facilitate the identification of putative variants for functional analysis and reduce the incidence of false-positive candidates in current functional validation methods. Using this Bayesian GPWAS model, we identified associations for previously characterized genes with known loss-of-function alleles, specific genes falling within known quantitative trait loci, and genes without any previous genome-wide associations while additionally detecting putative pleiotropic effects. In particular, we were able to identify the major tannin haplotypes at the Tan1 locus and effects of InDels on the protein folding. Depending on the haplotype present, heterodimer formation with Tan2 was significantly affected. We also identified major effect InDels in Dw2 and Ma1, where proteins were truncated due to frameshift mutations that resulted in early stop codons. These truncated proteins also lost most of their functional domains, suggesting that these indels likely result in loss of function. Here, we show that the Bayesian GPWAS model is able to identify loss-of-function alleles that can have significant effects upon protein structure and folding as well as multimer formation. Our approach to characterize loss-of-function mutations and their functional repercussions will facilitate precision genomics and breeding by identifying key targets for gene editing and trait integration.
format Online
Article
Text
id pubmed-10102435
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-101024352023-04-15 Functional genomic effects of indels using Bayesian genome-phenome wide association studies in sorghum Boatwright, J. Lucas Sapkota, Sirjan Kresovich, Stephen Front Genet Genetics High-throughput genomic and phenomic data have enhanced the ability to detect genotype-to-phenotype associations that can resolve broad pleiotropic effects of mutations on plant phenotypes. As the scale of genotyping and phenotyping has advanced, rigorous methodologies have been developed to accommodate larger datasets and maintain statistical precision. However, determining the functional effects of associated genes/loci is expensive and limited due to the complexity associated with cloning and subsequent characterization. Here, we utilized phenomic imputation of a multi-year, multi-environment dataset using PHENIX which imputes missing data using kinship and correlated traits, and we screened insertions and deletions (InDels) from the recently whole-genome sequenced Sorghum Association Panel for putative loss-of-function effects. Candidate loci from genome-wide association results were screened for potential loss of function using a Bayesian Genome-Phenome Wide Association Study (BGPWAS) model across both functionally characterized and uncharacterized loci. Our approach is designed to facilitate in silico validation of associations beyond traditional candidate gene and literature-search approaches and to facilitate the identification of putative variants for functional analysis and reduce the incidence of false-positive candidates in current functional validation methods. Using this Bayesian GPWAS model, we identified associations for previously characterized genes with known loss-of-function alleles, specific genes falling within known quantitative trait loci, and genes without any previous genome-wide associations while additionally detecting putative pleiotropic effects. In particular, we were able to identify the major tannin haplotypes at the Tan1 locus and effects of InDels on the protein folding. Depending on the haplotype present, heterodimer formation with Tan2 was significantly affected. We also identified major effect InDels in Dw2 and Ma1, where proteins were truncated due to frameshift mutations that resulted in early stop codons. These truncated proteins also lost most of their functional domains, suggesting that these indels likely result in loss of function. Here, we show that the Bayesian GPWAS model is able to identify loss-of-function alleles that can have significant effects upon protein structure and folding as well as multimer formation. Our approach to characterize loss-of-function mutations and their functional repercussions will facilitate precision genomics and breeding by identifying key targets for gene editing and trait integration. Frontiers Media S.A. 2023-03-31 /pmc/articles/PMC10102435/ /pubmed/37065477 http://dx.doi.org/10.3389/fgene.2023.1143395 Text en Copyright © 2023 Boatwright, Sapkota and Kresovich. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Boatwright, J. Lucas
Sapkota, Sirjan
Kresovich, Stephen
Functional genomic effects of indels using Bayesian genome-phenome wide association studies in sorghum
title Functional genomic effects of indels using Bayesian genome-phenome wide association studies in sorghum
title_full Functional genomic effects of indels using Bayesian genome-phenome wide association studies in sorghum
title_fullStr Functional genomic effects of indels using Bayesian genome-phenome wide association studies in sorghum
title_full_unstemmed Functional genomic effects of indels using Bayesian genome-phenome wide association studies in sorghum
title_short Functional genomic effects of indels using Bayesian genome-phenome wide association studies in sorghum
title_sort functional genomic effects of indels using bayesian genome-phenome wide association studies in sorghum
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10102435/
https://www.ncbi.nlm.nih.gov/pubmed/37065477
http://dx.doi.org/10.3389/fgene.2023.1143395
work_keys_str_mv AT boatwrightjlucas functionalgenomiceffectsofindelsusingbayesiangenomephenomewideassociationstudiesinsorghum
AT sapkotasirjan functionalgenomiceffectsofindelsusingbayesiangenomephenomewideassociationstudiesinsorghum
AT kresovichstephen functionalgenomiceffectsofindelsusingbayesiangenomephenomewideassociationstudiesinsorghum