Cargando…

Distribution, Diversity, and Long-Term Retention of Grass Short Interspersed Nuclear Elements (SINEs)

Instances of highly conserved plant short interspersed nuclear element (SINE) families and their enrichment near genes have been well documented, but little is known about the general patterns of such conservation and enrichment and underlying mechanisms. Here, we perform a comprehensive investigati...

Descripción completa

Detalles Bibliográficos
Autores principales: Mao, Hongliang, Wang, Hao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5585668/
https://www.ncbi.nlm.nih.gov/pubmed/28903462
http://dx.doi.org/10.1093/gbe/evx145
_version_ 1783261679180578816
author Mao, Hongliang
Wang, Hao
author_facet Mao, Hongliang
Wang, Hao
author_sort Mao, Hongliang
collection PubMed
description Instances of highly conserved plant short interspersed nuclear element (SINE) families and their enrichment near genes have been well documented, but little is known about the general patterns of such conservation and enrichment and underlying mechanisms. Here, we perform a comprehensive investigation of the structure, distribution, and evolution of SINEs in the grass family by analyzing 14 grass and 5 other flowering plant genomes using comparative genomics methods. We identify 61 SINE families composed of 29,572 copies, in which 46 families are first described. We find that comparing with other grass TEs, grass SINEs show much higher level of conservation in terms of genomic retention: The origin of at least 26% families can be traced to early grass diversification and these families are among most abundant SINE families in 86% species. We find that these families show much higher level of enrichment near protein coding genes than families of relatively recent origin (51%:28%), and that 40% of all grass SINEs are near gene and the percentage is higher than other types of grass TEs. The pattern of enrichment suggests that differential removal of SINE copies in gene-poor regions plays an important role in shaping the genomic distribution of these elements. We also identify a sequence motif located at 3′ SINE end which is shared in 17 families. In short, this study provides insights into structure and evolution of SINEs in the grass family.
format Online
Article
Text
id pubmed-5585668
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-55856682017-09-11 Distribution, Diversity, and Long-Term Retention of Grass Short Interspersed Nuclear Elements (SINEs) Mao, Hongliang Wang, Hao Genome Biol Evol Research Article Instances of highly conserved plant short interspersed nuclear element (SINE) families and their enrichment near genes have been well documented, but little is known about the general patterns of such conservation and enrichment and underlying mechanisms. Here, we perform a comprehensive investigation of the structure, distribution, and evolution of SINEs in the grass family by analyzing 14 grass and 5 other flowering plant genomes using comparative genomics methods. We identify 61 SINE families composed of 29,572 copies, in which 46 families are first described. We find that comparing with other grass TEs, grass SINEs show much higher level of conservation in terms of genomic retention: The origin of at least 26% families can be traced to early grass diversification and these families are among most abundant SINE families in 86% species. We find that these families show much higher level of enrichment near protein coding genes than families of relatively recent origin (51%:28%), and that 40% of all grass SINEs are near gene and the percentage is higher than other types of grass TEs. The pattern of enrichment suggests that differential removal of SINE copies in gene-poor regions plays an important role in shaping the genomic distribution of these elements. We also identify a sequence motif located at 3′ SINE end which is shared in 17 families. In short, this study provides insights into structure and evolution of SINEs in the grass family. Oxford University Press 2017-08-17 /pmc/articles/PMC5585668/ /pubmed/28903462 http://dx.doi.org/10.1093/gbe/evx145 Text en © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Research Article
Mao, Hongliang
Wang, Hao
Distribution, Diversity, and Long-Term Retention of Grass Short Interspersed Nuclear Elements (SINEs)
title Distribution, Diversity, and Long-Term Retention of Grass Short Interspersed Nuclear Elements (SINEs)
title_full Distribution, Diversity, and Long-Term Retention of Grass Short Interspersed Nuclear Elements (SINEs)
title_fullStr Distribution, Diversity, and Long-Term Retention of Grass Short Interspersed Nuclear Elements (SINEs)
title_full_unstemmed Distribution, Diversity, and Long-Term Retention of Grass Short Interspersed Nuclear Elements (SINEs)
title_short Distribution, Diversity, and Long-Term Retention of Grass Short Interspersed Nuclear Elements (SINEs)
title_sort distribution, diversity, and long-term retention of grass short interspersed nuclear elements (sines)
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5585668/
https://www.ncbi.nlm.nih.gov/pubmed/28903462
http://dx.doi.org/10.1093/gbe/evx145
work_keys_str_mv AT maohongliang distributiondiversityandlongtermretentionofgrassshortinterspersednuclearelementssines
AT wanghao distributiondiversityandlongtermretentionofgrassshortinterspersednuclearelementssines