Cargando…

Nucleotide diversity analysis highlights functionally important genomic regions

We analyzed functionality and relative distribution of genetic variants across the complete Oryza sativa genome, using the 40 million single nucleotide polymorphisms (SNPs) dataset from the 3,000 Rice Genomes Project (http://snp-seek.irri.org), the largest and highest density SNP collection for any...

Descripción completa

Detalles Bibliográficos
Autores principales: Tatarinova, Tatiana V., Chekalin, Evgeny, Nikolsky, Yuri, Bruskin, Sergey, Chebotarov, Dmitry, McNally, Kenneth L., Alexandrov, Nickolai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5075931/
https://www.ncbi.nlm.nih.gov/pubmed/27774999
http://dx.doi.org/10.1038/srep35730
_version_ 1782461950935957504
author Tatarinova, Tatiana V.
Chekalin, Evgeny
Nikolsky, Yuri
Bruskin, Sergey
Chebotarov, Dmitry
McNally, Kenneth L.
Alexandrov, Nickolai
author_facet Tatarinova, Tatiana V.
Chekalin, Evgeny
Nikolsky, Yuri
Bruskin, Sergey
Chebotarov, Dmitry
McNally, Kenneth L.
Alexandrov, Nickolai
author_sort Tatarinova, Tatiana V.
collection PubMed
description We analyzed functionality and relative distribution of genetic variants across the complete Oryza sativa genome, using the 40 million single nucleotide polymorphisms (SNPs) dataset from the 3,000 Rice Genomes Project (http://snp-seek.irri.org), the largest and highest density SNP collection for any higher plant. We have shown that the DNA-binding transcription factors (TFs) are the most conserved group of genes, whereas kinases and membrane-localized transporters are the most variable ones. TFs may be conserved because they belong to some of the most connected regulatory hubs that modulate transcription of vast downstream gene networks, whereas signaling kinases and transporters need to adapt rapidly to changing environmental conditions. In general, the observed profound patterns of nucleotide variability reveal functionally important genomic regions. As expected, nucleotide diversity is much higher in intergenic regions than within gene bodies (regions spanning gene models), and protein-coding sequences are more conserved than untranslated gene regions. We have observed a sharp decline in nucleotide diversity that begins at about 250 nucleotides upstream of the transcription start and reaches minimal diversity exactly at the transcription start. We found the transcription termination sites to have remarkably symmetrical patterns of SNP density, implying presence of functional sites near transcription termination. Also, nucleotide diversity was significantly lower near 3′ UTRs, the area rich with regulatory regions.
format Online
Article
Text
id pubmed-5075931
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-50759312016-10-28 Nucleotide diversity analysis highlights functionally important genomic regions Tatarinova, Tatiana V. Chekalin, Evgeny Nikolsky, Yuri Bruskin, Sergey Chebotarov, Dmitry McNally, Kenneth L. Alexandrov, Nickolai Sci Rep Article We analyzed functionality and relative distribution of genetic variants across the complete Oryza sativa genome, using the 40 million single nucleotide polymorphisms (SNPs) dataset from the 3,000 Rice Genomes Project (http://snp-seek.irri.org), the largest and highest density SNP collection for any higher plant. We have shown that the DNA-binding transcription factors (TFs) are the most conserved group of genes, whereas kinases and membrane-localized transporters are the most variable ones. TFs may be conserved because they belong to some of the most connected regulatory hubs that modulate transcription of vast downstream gene networks, whereas signaling kinases and transporters need to adapt rapidly to changing environmental conditions. In general, the observed profound patterns of nucleotide variability reveal functionally important genomic regions. As expected, nucleotide diversity is much higher in intergenic regions than within gene bodies (regions spanning gene models), and protein-coding sequences are more conserved than untranslated gene regions. We have observed a sharp decline in nucleotide diversity that begins at about 250 nucleotides upstream of the transcription start and reaches minimal diversity exactly at the transcription start. We found the transcription termination sites to have remarkably symmetrical patterns of SNP density, implying presence of functional sites near transcription termination. Also, nucleotide diversity was significantly lower near 3′ UTRs, the area rich with regulatory regions. Nature Publishing Group 2016-10-24 /pmc/articles/PMC5075931/ /pubmed/27774999 http://dx.doi.org/10.1038/srep35730 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Tatarinova, Tatiana V.
Chekalin, Evgeny
Nikolsky, Yuri
Bruskin, Sergey
Chebotarov, Dmitry
McNally, Kenneth L.
Alexandrov, Nickolai
Nucleotide diversity analysis highlights functionally important genomic regions
title Nucleotide diversity analysis highlights functionally important genomic regions
title_full Nucleotide diversity analysis highlights functionally important genomic regions
title_fullStr Nucleotide diversity analysis highlights functionally important genomic regions
title_full_unstemmed Nucleotide diversity analysis highlights functionally important genomic regions
title_short Nucleotide diversity analysis highlights functionally important genomic regions
title_sort nucleotide diversity analysis highlights functionally important genomic regions
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5075931/
https://www.ncbi.nlm.nih.gov/pubmed/27774999
http://dx.doi.org/10.1038/srep35730
work_keys_str_mv AT tatarinovatatianav nucleotidediversityanalysishighlightsfunctionallyimportantgenomicregions
AT chekalinevgeny nucleotidediversityanalysishighlightsfunctionallyimportantgenomicregions
AT nikolskyyuri nucleotidediversityanalysishighlightsfunctionallyimportantgenomicregions
AT bruskinsergey nucleotidediversityanalysishighlightsfunctionallyimportantgenomicregions
AT chebotarovdmitry nucleotidediversityanalysishighlightsfunctionallyimportantgenomicregions
AT mcnallykennethl nucleotidediversityanalysishighlightsfunctionallyimportantgenomicregions
AT alexandrovnickolai nucleotidediversityanalysishighlightsfunctionallyimportantgenomicregions