Cargando…

Highly complete long-read genomes reveal pangenomic variation underlying yeast phenotypic diversity

Understanding the genetic causes of trait variation is a primary goal of genetic research. One way that individuals can vary genetically is through variable pangenomic genes: genes that are only present in some individuals in a population. The presence or absence of entire genes could have large eff...

Descripción completa

Detalles Bibliográficos
Autores principales: Weller, Cory A., Andreev, Ilya, Chambers, Michael J., Park, Morgan, Bloom, Joshua S., Sadhu, Meru J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10317115/
https://www.ncbi.nlm.nih.gov/pubmed/37127330
http://dx.doi.org/10.1101/gr.277515.122
_version_ 1785067836987670528
author Weller, Cory A.
Andreev, Ilya
Chambers, Michael J.
Park, Morgan
Bloom, Joshua S.
Sadhu, Meru J.
author_facet Weller, Cory A.
Andreev, Ilya
Chambers, Michael J.
Park, Morgan
Bloom, Joshua S.
Sadhu, Meru J.
author_sort Weller, Cory A.
collection PubMed
description Understanding the genetic causes of trait variation is a primary goal of genetic research. One way that individuals can vary genetically is through variable pangenomic genes: genes that are only present in some individuals in a population. The presence or absence of entire genes could have large effects on trait variation. However, variable pangenomic genes can be missed in standard genotyping workflows, owing to reliance on aligning short-read sequencing to reference genomes. A popular method for studying the genetic basis of trait variation is linkage mapping, which identifies quantitative trait loci (QTLs), regions of the genome that harbor causative genetic variants. Large-scale linkage mapping in the budding yeast Saccharomyces cerevisiae has found thousands of QTLs affecting myriad yeast phenotypes. To enable the resolution of QTLs caused by variable pangenomic genes, we used long-read sequencing to generate highly complete de novo genome assemblies of 16 diverse yeast isolates. With these assemblies, we resolved QTLs for growth on maltose, sucrose, raffinose, and oxidative stress to specific genes that are absent from the reference genome but present in the broader yeast population at appreciable frequency. Copies of genes also duplicate onto chromosomes where they are absent in the reference genome, and we found that these copies generate additional QTLs whose resolution requires pangenome characterization. Our findings show the need for highly complete genome assemblies to identify the genetic basis of trait variation.
format Online
Article
Text
id pubmed-10317115
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-103171152023-11-01 Highly complete long-read genomes reveal pangenomic variation underlying yeast phenotypic diversity Weller, Cory A. Andreev, Ilya Chambers, Michael J. Park, Morgan Bloom, Joshua S. Sadhu, Meru J. Genome Res Research Understanding the genetic causes of trait variation is a primary goal of genetic research. One way that individuals can vary genetically is through variable pangenomic genes: genes that are only present in some individuals in a population. The presence or absence of entire genes could have large effects on trait variation. However, variable pangenomic genes can be missed in standard genotyping workflows, owing to reliance on aligning short-read sequencing to reference genomes. A popular method for studying the genetic basis of trait variation is linkage mapping, which identifies quantitative trait loci (QTLs), regions of the genome that harbor causative genetic variants. Large-scale linkage mapping in the budding yeast Saccharomyces cerevisiae has found thousands of QTLs affecting myriad yeast phenotypes. To enable the resolution of QTLs caused by variable pangenomic genes, we used long-read sequencing to generate highly complete de novo genome assemblies of 16 diverse yeast isolates. With these assemblies, we resolved QTLs for growth on maltose, sucrose, raffinose, and oxidative stress to specific genes that are absent from the reference genome but present in the broader yeast population at appreciable frequency. Copies of genes also duplicate onto chromosomes where they are absent in the reference genome, and we found that these copies generate additional QTLs whose resolution requires pangenome characterization. Our findings show the need for highly complete genome assemblies to identify the genetic basis of trait variation. Cold Spring Harbor Laboratory Press 2023-05 /pmc/articles/PMC10317115/ /pubmed/37127330 http://dx.doi.org/10.1101/gr.277515.122 Text en © 2023 Weller et al.; Published by Cold Spring Harbor Laboratory Press https://creativecommons.org/licenses/by-nc/4.0/This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see https://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle Research
Weller, Cory A.
Andreev, Ilya
Chambers, Michael J.
Park, Morgan
Bloom, Joshua S.
Sadhu, Meru J.
Highly complete long-read genomes reveal pangenomic variation underlying yeast phenotypic diversity
title Highly complete long-read genomes reveal pangenomic variation underlying yeast phenotypic diversity
title_full Highly complete long-read genomes reveal pangenomic variation underlying yeast phenotypic diversity
title_fullStr Highly complete long-read genomes reveal pangenomic variation underlying yeast phenotypic diversity
title_full_unstemmed Highly complete long-read genomes reveal pangenomic variation underlying yeast phenotypic diversity
title_short Highly complete long-read genomes reveal pangenomic variation underlying yeast phenotypic diversity
title_sort highly complete long-read genomes reveal pangenomic variation underlying yeast phenotypic diversity
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10317115/
https://www.ncbi.nlm.nih.gov/pubmed/37127330
http://dx.doi.org/10.1101/gr.277515.122
work_keys_str_mv AT wellercorya highlycompletelongreadgenomesrevealpangenomicvariationunderlyingyeastphenotypicdiversity
AT andreevilya highlycompletelongreadgenomesrevealpangenomicvariationunderlyingyeastphenotypicdiversity
AT chambersmichaelj highlycompletelongreadgenomesrevealpangenomicvariationunderlyingyeastphenotypicdiversity
AT parkmorgan highlycompletelongreadgenomesrevealpangenomicvariationunderlyingyeastphenotypicdiversity
AT highlycompletelongreadgenomesrevealpangenomicvariationunderlyingyeastphenotypicdiversity
AT bloomjoshuas highlycompletelongreadgenomesrevealpangenomicvariationunderlyingyeastphenotypicdiversity
AT sadhumeruj highlycompletelongreadgenomesrevealpangenomicvariationunderlyingyeastphenotypicdiversity