Cargando…

SNPs Occur in Regions with Less Genomic Sequence Conservation

Rates of SNPs (single nucleotide polymorphisms) and cross-species genomic sequence conservation reflect intra- and inter-species variation, respectively. Here, I report SNP rates and genomic sequence conservation adjacent to mRNA processing regions and show that, as expected, more SNPs occur in less...

Descripción completa

Detalles Bibliográficos
Autor principal: Castle, John C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3108954/
https://www.ncbi.nlm.nih.gov/pubmed/21674007
http://dx.doi.org/10.1371/journal.pone.0020660
_version_ 1782205392857595904
author Castle, John C.
author_facet Castle, John C.
author_sort Castle, John C.
collection PubMed
description Rates of SNPs (single nucleotide polymorphisms) and cross-species genomic sequence conservation reflect intra- and inter-species variation, respectively. Here, I report SNP rates and genomic sequence conservation adjacent to mRNA processing regions and show that, as expected, more SNPs occur in less conserved regions and that functional regions have fewer SNPs. Results are confirmed using both mouse and human data. Regions include protein start codons, 3′ splice sites, 5′ splice sites, protein stop codons, predicted miRNA binding sites, and polyadenylation sites. Throughout, SNP rates are lower and conservation is higher at regulatory sites. Within coding regions, SNP rates are highest and conservation is lowest at codon position three and the fewest SNPs are found at codon position two, reflecting codon degeneracy for amino acid encoding. Exon splice sites show high conservation and very low SNP rates, reflecting both splicing signals and protein coding. Relaxed constraint on the codon third position is dramatically seen when separating exonic SNP rates based on intron phase. At polyadenylation sites, a peak of conservation and low SNP rate occurs from 30 to 17 nt preceding the site. This region is highly enriched for the sequence AAUAAA, reflecting the location of the conserved polyA signal. miRNA 3′ UTR target sites are predicted incorporating interspecies genomic sequence conservation; SNP rates are low in these sites, again showing fewer SNPs in conserved regions. Together, these results confirm that SNPs, reflecting recent genetic variation, occur more frequently in regions with less evolutionarily conservation.
format Online
Article
Text
id pubmed-3108954
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-31089542011-06-13 SNPs Occur in Regions with Less Genomic Sequence Conservation Castle, John C. PLoS One Research Article Rates of SNPs (single nucleotide polymorphisms) and cross-species genomic sequence conservation reflect intra- and inter-species variation, respectively. Here, I report SNP rates and genomic sequence conservation adjacent to mRNA processing regions and show that, as expected, more SNPs occur in less conserved regions and that functional regions have fewer SNPs. Results are confirmed using both mouse and human data. Regions include protein start codons, 3′ splice sites, 5′ splice sites, protein stop codons, predicted miRNA binding sites, and polyadenylation sites. Throughout, SNP rates are lower and conservation is higher at regulatory sites. Within coding regions, SNP rates are highest and conservation is lowest at codon position three and the fewest SNPs are found at codon position two, reflecting codon degeneracy for amino acid encoding. Exon splice sites show high conservation and very low SNP rates, reflecting both splicing signals and protein coding. Relaxed constraint on the codon third position is dramatically seen when separating exonic SNP rates based on intron phase. At polyadenylation sites, a peak of conservation and low SNP rate occurs from 30 to 17 nt preceding the site. This region is highly enriched for the sequence AAUAAA, reflecting the location of the conserved polyA signal. miRNA 3′ UTR target sites are predicted incorporating interspecies genomic sequence conservation; SNP rates are low in these sites, again showing fewer SNPs in conserved regions. Together, these results confirm that SNPs, reflecting recent genetic variation, occur more frequently in regions with less evolutionarily conservation. Public Library of Science 2011-06-06 /pmc/articles/PMC3108954/ /pubmed/21674007 http://dx.doi.org/10.1371/journal.pone.0020660 Text en John C. Castle. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Castle, John C.
SNPs Occur in Regions with Less Genomic Sequence Conservation
title SNPs Occur in Regions with Less Genomic Sequence Conservation
title_full SNPs Occur in Regions with Less Genomic Sequence Conservation
title_fullStr SNPs Occur in Regions with Less Genomic Sequence Conservation
title_full_unstemmed SNPs Occur in Regions with Less Genomic Sequence Conservation
title_short SNPs Occur in Regions with Less Genomic Sequence Conservation
title_sort snps occur in regions with less genomic sequence conservation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3108954/
https://www.ncbi.nlm.nih.gov/pubmed/21674007
http://dx.doi.org/10.1371/journal.pone.0020660
work_keys_str_mv AT castlejohnc snpsoccurinregionswithlessgenomicsequenceconservation