Cargando…
Single-Base Resolution Map of Evolutionary Constraints and Annotation of Conserved Elements across Major Grass Genomes
Conserved noncoding sequences (CNSs) are evolutionarily conserved DNA sequences that do not encode proteins but may have potential regulatory roles in gene expression. CNS in crop genomes could be linked to many important agronomic traits and ecological adaptations. Compared with the relatively matu...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5798027/ https://www.ncbi.nlm.nih.gov/pubmed/29378032 http://dx.doi.org/10.1093/gbe/evy006 |
_version_ | 1783297794631532544 |
---|---|
author | Liang, Pingping Saqib, Hafiz Sohaib Ahmed Zhang, Xingtan Zhang, Liangsheng Tang, Haibao |
author_facet | Liang, Pingping Saqib, Hafiz Sohaib Ahmed Zhang, Xingtan Zhang, Liangsheng Tang, Haibao |
author_sort | Liang, Pingping |
collection | PubMed |
description | Conserved noncoding sequences (CNSs) are evolutionarily conserved DNA sequences that do not encode proteins but may have potential regulatory roles in gene expression. CNS in crop genomes could be linked to many important agronomic traits and ecological adaptations. Compared with the relatively mature exon annotation protocols, efficient methods are lacking to predict the location of noncoding sequences in the plant genomes. We implemented a computational pipeline that is tailored to the comparisons of plant genomes, yielding a large number of conserved sequences using rice genome as the reference. In this study, we used 17 published grass genomes, along with five monocot genomes as well as the basal angiosperm genome of Amborella trichopoda. Genome alignments among these genomes suggest that at least 12.05% of the rice genome appears to be evolving under constraints in the Poaceae lineage, with close to half of the evolutionarily constrained sequences located outside protein-coding regions. We found evidence for purifying selection acting on the conserved sequences by analyzing segregating SNPs within the rice population. Furthermore, we found that known functional motifs were significantly enriched within CNS, with many motifs associated with the preferred binding of ubiquitous transcription factors. The conserved elements that we have curated are accessible through our public database and the JBrowse server. In-depth functional annotations and evolutionary dynamics of the identified conserved sequences provide a solid foundation for studying gene regulation, genome evolution, as well as to inform gene isolation for cereal biologists. |
format | Online Article Text |
id | pubmed-5798027 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-57980272018-02-08 Single-Base Resolution Map of Evolutionary Constraints and Annotation of Conserved Elements across Major Grass Genomes Liang, Pingping Saqib, Hafiz Sohaib Ahmed Zhang, Xingtan Zhang, Liangsheng Tang, Haibao Genome Biol Evol Research Article Conserved noncoding sequences (CNSs) are evolutionarily conserved DNA sequences that do not encode proteins but may have potential regulatory roles in gene expression. CNS in crop genomes could be linked to many important agronomic traits and ecological adaptations. Compared with the relatively mature exon annotation protocols, efficient methods are lacking to predict the location of noncoding sequences in the plant genomes. We implemented a computational pipeline that is tailored to the comparisons of plant genomes, yielding a large number of conserved sequences using rice genome as the reference. In this study, we used 17 published grass genomes, along with five monocot genomes as well as the basal angiosperm genome of Amborella trichopoda. Genome alignments among these genomes suggest that at least 12.05% of the rice genome appears to be evolving under constraints in the Poaceae lineage, with close to half of the evolutionarily constrained sequences located outside protein-coding regions. We found evidence for purifying selection acting on the conserved sequences by analyzing segregating SNPs within the rice population. Furthermore, we found that known functional motifs were significantly enriched within CNS, with many motifs associated with the preferred binding of ubiquitous transcription factors. The conserved elements that we have curated are accessible through our public database and the JBrowse server. In-depth functional annotations and evolutionary dynamics of the identified conserved sequences provide a solid foundation for studying gene regulation, genome evolution, as well as to inform gene isolation for cereal biologists. Oxford University Press 2018-01-25 /pmc/articles/PMC5798027/ /pubmed/29378032 http://dx.doi.org/10.1093/gbe/evy006 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Liang, Pingping Saqib, Hafiz Sohaib Ahmed Zhang, Xingtan Zhang, Liangsheng Tang, Haibao Single-Base Resolution Map of Evolutionary Constraints and Annotation of Conserved Elements across Major Grass Genomes |
title | Single-Base Resolution Map of Evolutionary Constraints and Annotation of Conserved Elements across Major Grass Genomes |
title_full | Single-Base Resolution Map of Evolutionary Constraints and Annotation of Conserved Elements across Major Grass Genomes |
title_fullStr | Single-Base Resolution Map of Evolutionary Constraints and Annotation of Conserved Elements across Major Grass Genomes |
title_full_unstemmed | Single-Base Resolution Map of Evolutionary Constraints and Annotation of Conserved Elements across Major Grass Genomes |
title_short | Single-Base Resolution Map of Evolutionary Constraints and Annotation of Conserved Elements across Major Grass Genomes |
title_sort | single-base resolution map of evolutionary constraints and annotation of conserved elements across major grass genomes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5798027/ https://www.ncbi.nlm.nih.gov/pubmed/29378032 http://dx.doi.org/10.1093/gbe/evy006 |
work_keys_str_mv | AT liangpingping singlebaseresolutionmapofevolutionaryconstraintsandannotationofconservedelementsacrossmajorgrassgenomes AT saqibhafizsohaibahmed singlebaseresolutionmapofevolutionaryconstraintsandannotationofconservedelementsacrossmajorgrassgenomes AT zhangxingtan singlebaseresolutionmapofevolutionaryconstraintsandannotationofconservedelementsacrossmajorgrassgenomes AT zhangliangsheng singlebaseresolutionmapofevolutionaryconstraintsandannotationofconservedelementsacrossmajorgrassgenomes AT tanghaibao singlebaseresolutionmapofevolutionaryconstraintsandannotationofconservedelementsacrossmajorgrassgenomes |