Cargando…

SNP and indel frequencies at transcription start sites and at canonical and alternative translation initiation sites in the human genome

Single-nucleotide polymorphisms (SNPs) are the most common form of genetic variation in humans and drive phenotypic variation. Due to evolutionary conservation, SNPs and indels (insertion and deletions) are depleted in functionally important sequence elements. Recently, population-scale sequencing e...

Descripción completa

Detalles Bibliográficos
Autores principales: Neininger, Kerstin, Marschall, Tobias, Helms, Volkhard
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6461226/
https://www.ncbi.nlm.nih.gov/pubmed/30978217
http://dx.doi.org/10.1371/journal.pone.0214816
_version_ 1783410464707837952
author Neininger, Kerstin
Marschall, Tobias
Helms, Volkhard
author_facet Neininger, Kerstin
Marschall, Tobias
Helms, Volkhard
author_sort Neininger, Kerstin
collection PubMed
description Single-nucleotide polymorphisms (SNPs) are the most common form of genetic variation in humans and drive phenotypic variation. Due to evolutionary conservation, SNPs and indels (insertion and deletions) are depleted in functionally important sequence elements. Recently, population-scale sequencing efforts such as the 1000 Genomes Project and the Genome of the Netherlands Project have catalogued large numbers of sequence variants. Here, we present a systematic analysis of the polymorphisms reported by these two projects in different coding and non-coding genomic elements of the human genome (intergenic regions, CpG islands, promoters, 5’ UTRs, coding exons, 3’ UTRs, introns, and intragenic regions). Furthermore, we were especially interested in the distribution of SNPs and indels in direct vicinity to the transcription start site (TSS) and translation start site (CSS). Thereby, we discovered an enrichment of dinucleotides CpG and CpA and an accumulation of SNPs at base position −1 relative to the TSS that involved primarily CpG and CpA dinucleotides. Genes having a CpG dinucleotide at TSS position -1 were enriched in the functional GO terms “Phosphoprotein”, “Alternative splicing”, and “Protein binding”. Focusing on the CSS, we compared SNP patterns in the flanking regions of canonical and alternative AUG and near-cognate start sites where we considered alternative starts previously identified by experimental ribosome profiling. We observed similar conservation patterns of canonical and alternative translation start sites, which underlines the importance of alternative translation mechanisms for cellular function.
format Online
Article
Text
id pubmed-6461226
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-64612262019-05-03 SNP and indel frequencies at transcription start sites and at canonical and alternative translation initiation sites in the human genome Neininger, Kerstin Marschall, Tobias Helms, Volkhard PLoS One Research Article Single-nucleotide polymorphisms (SNPs) are the most common form of genetic variation in humans and drive phenotypic variation. Due to evolutionary conservation, SNPs and indels (insertion and deletions) are depleted in functionally important sequence elements. Recently, population-scale sequencing efforts such as the 1000 Genomes Project and the Genome of the Netherlands Project have catalogued large numbers of sequence variants. Here, we present a systematic analysis of the polymorphisms reported by these two projects in different coding and non-coding genomic elements of the human genome (intergenic regions, CpG islands, promoters, 5’ UTRs, coding exons, 3’ UTRs, introns, and intragenic regions). Furthermore, we were especially interested in the distribution of SNPs and indels in direct vicinity to the transcription start site (TSS) and translation start site (CSS). Thereby, we discovered an enrichment of dinucleotides CpG and CpA and an accumulation of SNPs at base position −1 relative to the TSS that involved primarily CpG and CpA dinucleotides. Genes having a CpG dinucleotide at TSS position -1 were enriched in the functional GO terms “Phosphoprotein”, “Alternative splicing”, and “Protein binding”. Focusing on the CSS, we compared SNP patterns in the flanking regions of canonical and alternative AUG and near-cognate start sites where we considered alternative starts previously identified by experimental ribosome profiling. We observed similar conservation patterns of canonical and alternative translation start sites, which underlines the importance of alternative translation mechanisms for cellular function. Public Library of Science 2019-04-12 /pmc/articles/PMC6461226/ /pubmed/30978217 http://dx.doi.org/10.1371/journal.pone.0214816 Text en © 2019 Neininger et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Neininger, Kerstin
Marschall, Tobias
Helms, Volkhard
SNP and indel frequencies at transcription start sites and at canonical and alternative translation initiation sites in the human genome
title SNP and indel frequencies at transcription start sites and at canonical and alternative translation initiation sites in the human genome
title_full SNP and indel frequencies at transcription start sites and at canonical and alternative translation initiation sites in the human genome
title_fullStr SNP and indel frequencies at transcription start sites and at canonical and alternative translation initiation sites in the human genome
title_full_unstemmed SNP and indel frequencies at transcription start sites and at canonical and alternative translation initiation sites in the human genome
title_short SNP and indel frequencies at transcription start sites and at canonical and alternative translation initiation sites in the human genome
title_sort snp and indel frequencies at transcription start sites and at canonical and alternative translation initiation sites in the human genome
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6461226/
https://www.ncbi.nlm.nih.gov/pubmed/30978217
http://dx.doi.org/10.1371/journal.pone.0214816
work_keys_str_mv AT neiningerkerstin snpandindelfrequenciesattranscriptionstartsitesandatcanonicalandalternativetranslationinitiationsitesinthehumangenome
AT marschalltobias snpandindelfrequenciesattranscriptionstartsitesandatcanonicalandalternativetranslationinitiationsitesinthehumangenome
AT helmsvolkhard snpandindelfrequenciesattranscriptionstartsitesandatcanonicalandalternativetranslationinitiationsitesinthehumangenome