Cargando…
SNP and indel frequencies at transcription start sites and at canonical and alternative translation initiation sites in the human genome
Single-nucleotide polymorphisms (SNPs) are the most common form of genetic variation in humans and drive phenotypic variation. Due to evolutionary conservation, SNPs and indels (insertion and deletions) are depleted in functionally important sequence elements. Recently, population-scale sequencing e...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6461226/ https://www.ncbi.nlm.nih.gov/pubmed/30978217 http://dx.doi.org/10.1371/journal.pone.0214816 |
_version_ | 1783410464707837952 |
---|---|
author | Neininger, Kerstin Marschall, Tobias Helms, Volkhard |
author_facet | Neininger, Kerstin Marschall, Tobias Helms, Volkhard |
author_sort | Neininger, Kerstin |
collection | PubMed |
description | Single-nucleotide polymorphisms (SNPs) are the most common form of genetic variation in humans and drive phenotypic variation. Due to evolutionary conservation, SNPs and indels (insertion and deletions) are depleted in functionally important sequence elements. Recently, population-scale sequencing efforts such as the 1000 Genomes Project and the Genome of the Netherlands Project have catalogued large numbers of sequence variants. Here, we present a systematic analysis of the polymorphisms reported by these two projects in different coding and non-coding genomic elements of the human genome (intergenic regions, CpG islands, promoters, 5’ UTRs, coding exons, 3’ UTRs, introns, and intragenic regions). Furthermore, we were especially interested in the distribution of SNPs and indels in direct vicinity to the transcription start site (TSS) and translation start site (CSS). Thereby, we discovered an enrichment of dinucleotides CpG and CpA and an accumulation of SNPs at base position −1 relative to the TSS that involved primarily CpG and CpA dinucleotides. Genes having a CpG dinucleotide at TSS position -1 were enriched in the functional GO terms “Phosphoprotein”, “Alternative splicing”, and “Protein binding”. Focusing on the CSS, we compared SNP patterns in the flanking regions of canonical and alternative AUG and near-cognate start sites where we considered alternative starts previously identified by experimental ribosome profiling. We observed similar conservation patterns of canonical and alternative translation start sites, which underlines the importance of alternative translation mechanisms for cellular function. |
format | Online Article Text |
id | pubmed-6461226 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-64612262019-05-03 SNP and indel frequencies at transcription start sites and at canonical and alternative translation initiation sites in the human genome Neininger, Kerstin Marschall, Tobias Helms, Volkhard PLoS One Research Article Single-nucleotide polymorphisms (SNPs) are the most common form of genetic variation in humans and drive phenotypic variation. Due to evolutionary conservation, SNPs and indels (insertion and deletions) are depleted in functionally important sequence elements. Recently, population-scale sequencing efforts such as the 1000 Genomes Project and the Genome of the Netherlands Project have catalogued large numbers of sequence variants. Here, we present a systematic analysis of the polymorphisms reported by these two projects in different coding and non-coding genomic elements of the human genome (intergenic regions, CpG islands, promoters, 5’ UTRs, coding exons, 3’ UTRs, introns, and intragenic regions). Furthermore, we were especially interested in the distribution of SNPs and indels in direct vicinity to the transcription start site (TSS) and translation start site (CSS). Thereby, we discovered an enrichment of dinucleotides CpG and CpA and an accumulation of SNPs at base position −1 relative to the TSS that involved primarily CpG and CpA dinucleotides. Genes having a CpG dinucleotide at TSS position -1 were enriched in the functional GO terms “Phosphoprotein”, “Alternative splicing”, and “Protein binding”. Focusing on the CSS, we compared SNP patterns in the flanking regions of canonical and alternative AUG and near-cognate start sites where we considered alternative starts previously identified by experimental ribosome profiling. We observed similar conservation patterns of canonical and alternative translation start sites, which underlines the importance of alternative translation mechanisms for cellular function. Public Library of Science 2019-04-12 /pmc/articles/PMC6461226/ /pubmed/30978217 http://dx.doi.org/10.1371/journal.pone.0214816 Text en © 2019 Neininger et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Neininger, Kerstin Marschall, Tobias Helms, Volkhard SNP and indel frequencies at transcription start sites and at canonical and alternative translation initiation sites in the human genome |
title | SNP and indel frequencies at transcription start sites and at canonical and alternative translation initiation sites in the human genome |
title_full | SNP and indel frequencies at transcription start sites and at canonical and alternative translation initiation sites in the human genome |
title_fullStr | SNP and indel frequencies at transcription start sites and at canonical and alternative translation initiation sites in the human genome |
title_full_unstemmed | SNP and indel frequencies at transcription start sites and at canonical and alternative translation initiation sites in the human genome |
title_short | SNP and indel frequencies at transcription start sites and at canonical and alternative translation initiation sites in the human genome |
title_sort | snp and indel frequencies at transcription start sites and at canonical and alternative translation initiation sites in the human genome |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6461226/ https://www.ncbi.nlm.nih.gov/pubmed/30978217 http://dx.doi.org/10.1371/journal.pone.0214816 |
work_keys_str_mv | AT neiningerkerstin snpandindelfrequenciesattranscriptionstartsitesandatcanonicalandalternativetranslationinitiationsitesinthehumangenome AT marschalltobias snpandindelfrequenciesattranscriptionstartsitesandatcanonicalandalternativetranslationinitiationsitesinthehumangenome AT helmsvolkhard snpandindelfrequenciesattranscriptionstartsitesandatcanonicalandalternativetranslationinitiationsitesinthehumangenome |