Cargando…

Single-Base Resolution Map of Evolutionary Constraints and Annotation of Conserved Elements across Major Grass Genomes

Conserved noncoding sequences (CNSs) are evolutionarily conserved DNA sequences that do not encode proteins but may have potential regulatory roles in gene expression. CNS in crop genomes could be linked to many important agronomic traits and ecological adaptations. Compared with the relatively matu...

Descripción completa

Detalles Bibliográficos
Autores principales: Liang, Pingping, Saqib, Hafiz Sohaib Ahmed, Zhang, Xingtan, Zhang, Liangsheng, Tang, Haibao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5798027/
https://www.ncbi.nlm.nih.gov/pubmed/29378032
http://dx.doi.org/10.1093/gbe/evy006
_version_ 1783297794631532544
author Liang, Pingping
Saqib, Hafiz Sohaib Ahmed
Zhang, Xingtan
Zhang, Liangsheng
Tang, Haibao
author_facet Liang, Pingping
Saqib, Hafiz Sohaib Ahmed
Zhang, Xingtan
Zhang, Liangsheng
Tang, Haibao
author_sort Liang, Pingping
collection PubMed
description Conserved noncoding sequences (CNSs) are evolutionarily conserved DNA sequences that do not encode proteins but may have potential regulatory roles in gene expression. CNS in crop genomes could be linked to many important agronomic traits and ecological adaptations. Compared with the relatively mature exon annotation protocols, efficient methods are lacking to predict the location of noncoding sequences in the plant genomes. We implemented a computational pipeline that is tailored to the comparisons of plant genomes, yielding a large number of conserved sequences using rice genome as the reference. In this study, we used 17 published grass genomes, along with five monocot genomes as well as the basal angiosperm genome of Amborella trichopoda. Genome alignments among these genomes suggest that at least 12.05% of the rice genome appears to be evolving under constraints in the Poaceae lineage, with close to half of the evolutionarily constrained sequences located outside protein-coding regions. We found evidence for purifying selection acting on the conserved sequences by analyzing segregating SNPs within the rice population. Furthermore, we found that known functional motifs were significantly enriched within CNS, with many motifs associated with the preferred binding of ubiquitous transcription factors. The conserved elements that we have curated are accessible through our public database and the JBrowse server. In-depth functional annotations and evolutionary dynamics of the identified conserved sequences provide a solid foundation for studying gene regulation, genome evolution, as well as to inform gene isolation for cereal biologists.
format Online
Article
Text
id pubmed-5798027
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-57980272018-02-08 Single-Base Resolution Map of Evolutionary Constraints and Annotation of Conserved Elements across Major Grass Genomes Liang, Pingping Saqib, Hafiz Sohaib Ahmed Zhang, Xingtan Zhang, Liangsheng Tang, Haibao Genome Biol Evol Research Article Conserved noncoding sequences (CNSs) are evolutionarily conserved DNA sequences that do not encode proteins but may have potential regulatory roles in gene expression. CNS in crop genomes could be linked to many important agronomic traits and ecological adaptations. Compared with the relatively mature exon annotation protocols, efficient methods are lacking to predict the location of noncoding sequences in the plant genomes. We implemented a computational pipeline that is tailored to the comparisons of plant genomes, yielding a large number of conserved sequences using rice genome as the reference. In this study, we used 17 published grass genomes, along with five monocot genomes as well as the basal angiosperm genome of Amborella trichopoda. Genome alignments among these genomes suggest that at least 12.05% of the rice genome appears to be evolving under constraints in the Poaceae lineage, with close to half of the evolutionarily constrained sequences located outside protein-coding regions. We found evidence for purifying selection acting on the conserved sequences by analyzing segregating SNPs within the rice population. Furthermore, we found that known functional motifs were significantly enriched within CNS, with many motifs associated with the preferred binding of ubiquitous transcription factors. The conserved elements that we have curated are accessible through our public database and the JBrowse server. In-depth functional annotations and evolutionary dynamics of the identified conserved sequences provide a solid foundation for studying gene regulation, genome evolution, as well as to inform gene isolation for cereal biologists. Oxford University Press 2018-01-25 /pmc/articles/PMC5798027/ /pubmed/29378032 http://dx.doi.org/10.1093/gbe/evy006 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Liang, Pingping
Saqib, Hafiz Sohaib Ahmed
Zhang, Xingtan
Zhang, Liangsheng
Tang, Haibao
Single-Base Resolution Map of Evolutionary Constraints and Annotation of Conserved Elements across Major Grass Genomes
title Single-Base Resolution Map of Evolutionary Constraints and Annotation of Conserved Elements across Major Grass Genomes
title_full Single-Base Resolution Map of Evolutionary Constraints and Annotation of Conserved Elements across Major Grass Genomes
title_fullStr Single-Base Resolution Map of Evolutionary Constraints and Annotation of Conserved Elements across Major Grass Genomes
title_full_unstemmed Single-Base Resolution Map of Evolutionary Constraints and Annotation of Conserved Elements across Major Grass Genomes
title_short Single-Base Resolution Map of Evolutionary Constraints and Annotation of Conserved Elements across Major Grass Genomes
title_sort single-base resolution map of evolutionary constraints and annotation of conserved elements across major grass genomes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5798027/
https://www.ncbi.nlm.nih.gov/pubmed/29378032
http://dx.doi.org/10.1093/gbe/evy006
work_keys_str_mv AT liangpingping singlebaseresolutionmapofevolutionaryconstraintsandannotationofconservedelementsacrossmajorgrassgenomes
AT saqibhafizsohaibahmed singlebaseresolutionmapofevolutionaryconstraintsandannotationofconservedelementsacrossmajorgrassgenomes
AT zhangxingtan singlebaseresolutionmapofevolutionaryconstraintsandannotationofconservedelementsacrossmajorgrassgenomes
AT zhangliangsheng singlebaseresolutionmapofevolutionaryconstraintsandannotationofconservedelementsacrossmajorgrassgenomes
AT tanghaibao singlebaseresolutionmapofevolutionaryconstraintsandannotationofconservedelementsacrossmajorgrassgenomes