Cargando…

Evolution of Conserved Noncoding Sequences in Arabidopsis thaliana

Recent pangenome studies have revealed a large fraction of the gene content within a species exhibits presence–absence variation (PAV). However, coding regions alone provide an incomplete assessment of functional genomic sequence variation at the species level. Little to no attention has been paid t...

Descripción completa

Detalles Bibliográficos
Autores principales: Yocca, Alan E., Lu, Zefu, Schmitz, Robert J., Freeling, Michael, Edger, Patrick P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8233505/
https://www.ncbi.nlm.nih.gov/pubmed/33565589
http://dx.doi.org/10.1093/molbev/msab042
_version_ 1783713866993106944
author Yocca, Alan E.
Lu, Zefu
Schmitz, Robert J.
Freeling, Michael
Edger, Patrick P.
author_facet Yocca, Alan E.
Lu, Zefu
Schmitz, Robert J.
Freeling, Michael
Edger, Patrick P.
author_sort Yocca, Alan E.
collection PubMed
description Recent pangenome studies have revealed a large fraction of the gene content within a species exhibits presence–absence variation (PAV). However, coding regions alone provide an incomplete assessment of functional genomic sequence variation at the species level. Little to no attention has been paid to noncoding regulatory regions in pangenome studies, though these sequences directly modulate gene expression and phenotype. To uncover regulatory genetic variation, we generated chromosome-scale genome assemblies for thirty Arabidopsis thaliana accessions from multiple distinct habitats and characterized species level variation in Conserved Noncoding Sequences (CNS). Our analyses uncovered not only PAV and positional variation (PosV) but that diversity in CNS is nonrandom, with variants shared across different accessions. Using evolutionary analyses and chromatin accessibility data, we provide further evidence supporting roles for conserved and variable CNS in gene regulation. Additionally, our data suggests that transposable elements contribute to CNS variation. Characterizing species-level diversity in all functional genomic sequences may later uncover previously unknown mechanistic links between genotype and phenotype.
format Online
Article
Text
id pubmed-8233505
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-82335052021-06-28 Evolution of Conserved Noncoding Sequences in Arabidopsis thaliana Yocca, Alan E. Lu, Zefu Schmitz, Robert J. Freeling, Michael Edger, Patrick P. Mol Biol Evol Discoveries Recent pangenome studies have revealed a large fraction of the gene content within a species exhibits presence–absence variation (PAV). However, coding regions alone provide an incomplete assessment of functional genomic sequence variation at the species level. Little to no attention has been paid to noncoding regulatory regions in pangenome studies, though these sequences directly modulate gene expression and phenotype. To uncover regulatory genetic variation, we generated chromosome-scale genome assemblies for thirty Arabidopsis thaliana accessions from multiple distinct habitats and characterized species level variation in Conserved Noncoding Sequences (CNS). Our analyses uncovered not only PAV and positional variation (PosV) but that diversity in CNS is nonrandom, with variants shared across different accessions. Using evolutionary analyses and chromatin accessibility data, we provide further evidence supporting roles for conserved and variable CNS in gene regulation. Additionally, our data suggests that transposable elements contribute to CNS variation. Characterizing species-level diversity in all functional genomic sequences may later uncover previously unknown mechanistic links between genotype and phenotype. Oxford University Press 2021-02-10 /pmc/articles/PMC8233505/ /pubmed/33565589 http://dx.doi.org/10.1093/molbev/msab042 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Discoveries
Yocca, Alan E.
Lu, Zefu
Schmitz, Robert J.
Freeling, Michael
Edger, Patrick P.
Evolution of Conserved Noncoding Sequences in Arabidopsis thaliana
title Evolution of Conserved Noncoding Sequences in Arabidopsis thaliana
title_full Evolution of Conserved Noncoding Sequences in Arabidopsis thaliana
title_fullStr Evolution of Conserved Noncoding Sequences in Arabidopsis thaliana
title_full_unstemmed Evolution of Conserved Noncoding Sequences in Arabidopsis thaliana
title_short Evolution of Conserved Noncoding Sequences in Arabidopsis thaliana
title_sort evolution of conserved noncoding sequences in arabidopsis thaliana
topic Discoveries
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8233505/
https://www.ncbi.nlm.nih.gov/pubmed/33565589
http://dx.doi.org/10.1093/molbev/msab042
work_keys_str_mv AT yoccaalane evolutionofconservednoncodingsequencesinarabidopsisthaliana
AT luzefu evolutionofconservednoncodingsequencesinarabidopsisthaliana
AT schmitzrobertj evolutionofconservednoncodingsequencesinarabidopsisthaliana
AT freelingmichael evolutionofconservednoncodingsequencesinarabidopsisthaliana
AT edgerpatrickp evolutionofconservednoncodingsequencesinarabidopsisthaliana