Cargando…

Genome-Wide Data-Mining of Candidate Human Splice Translational Efficiency Polymorphisms (STEPs) and an Online Database

BACKGROUND: Variation in pre-mRNA splicing is common and in some cases caused by genetic variants in intronic splicing motifs. Recent studies into the insulin gene (INS) discovered a polymorphism in a 5′ non-coding intron that influences the likelihood of intron retention in the final mRNA, extendin...

Descripción completa

Detalles Bibliográficos
Autores principales: Raistrick, Christopher A., Day, Ian N. M., Gaunt, Tom R.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2952627/
https://www.ncbi.nlm.nih.gov/pubmed/20948966
http://dx.doi.org/10.1371/journal.pone.0013340
_version_ 1782187795079495680
author Raistrick, Christopher A.
Day, Ian N. M.
Gaunt, Tom R.
author_facet Raistrick, Christopher A.
Day, Ian N. M.
Gaunt, Tom R.
author_sort Raistrick, Christopher A.
collection PubMed
description BACKGROUND: Variation in pre-mRNA splicing is common and in some cases caused by genetic variants in intronic splicing motifs. Recent studies into the insulin gene (INS) discovered a polymorphism in a 5′ non-coding intron that influences the likelihood of intron retention in the final mRNA, extending the 5′ untranslated region and maintaining protein quality. Retention was also associated with increased insulin levels, suggesting that such variants - splice translational efficiency polymorphisms (STEPs) - may relate to disease phenotypes through differential protein expression. We set out to explore the prevalence of STEPs in the human genome and validate this new category of protein quantitative trait loci (pQTL) using publicly available data. METHODOLOGY/PRINCIPAL FINDINGS: Gene transcript and variant data were collected and mined for candidate STEPs in motif regions. Sequences from transcripts containing potential STEPs were analysed for evidence of splice site recognition and an effect in expressed sequence tags (ESTs). 16 publicly released genome-wide association data sets of common diseases were searched for association to candidate polymorphisms with HapMap frequency data. Our study found 3324 candidate STEPs lying in motif sequences of 5′ non-coding introns and further mining revealed 170 with transcript evidence of intron retention. 21 potential STEPs had EST evidence of intron retention or exon extension, as well as population frequency data for comparison. CONCLUSIONS/SIGNIFICANCE: Results suggest that the insulin STEP was not a unique example and that many STEPs may occur genome-wide with potentially causal effects in complex disease. An online database of STEPs is freely accessible at http://dbstep.genes.org.uk/.
format Text
id pubmed-2952627
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-29526272010-10-14 Genome-Wide Data-Mining of Candidate Human Splice Translational Efficiency Polymorphisms (STEPs) and an Online Database Raistrick, Christopher A. Day, Ian N. M. Gaunt, Tom R. PLoS One Research Article BACKGROUND: Variation in pre-mRNA splicing is common and in some cases caused by genetic variants in intronic splicing motifs. Recent studies into the insulin gene (INS) discovered a polymorphism in a 5′ non-coding intron that influences the likelihood of intron retention in the final mRNA, extending the 5′ untranslated region and maintaining protein quality. Retention was also associated with increased insulin levels, suggesting that such variants - splice translational efficiency polymorphisms (STEPs) - may relate to disease phenotypes through differential protein expression. We set out to explore the prevalence of STEPs in the human genome and validate this new category of protein quantitative trait loci (pQTL) using publicly available data. METHODOLOGY/PRINCIPAL FINDINGS: Gene transcript and variant data were collected and mined for candidate STEPs in motif regions. Sequences from transcripts containing potential STEPs were analysed for evidence of splice site recognition and an effect in expressed sequence tags (ESTs). 16 publicly released genome-wide association data sets of common diseases were searched for association to candidate polymorphisms with HapMap frequency data. Our study found 3324 candidate STEPs lying in motif sequences of 5′ non-coding introns and further mining revealed 170 with transcript evidence of intron retention. 21 potential STEPs had EST evidence of intron retention or exon extension, as well as population frequency data for comparison. CONCLUSIONS/SIGNIFICANCE: Results suggest that the insulin STEP was not a unique example and that many STEPs may occur genome-wide with potentially causal effects in complex disease. An online database of STEPs is freely accessible at http://dbstep.genes.org.uk/. Public Library of Science 2010-10-11 /pmc/articles/PMC2952627/ /pubmed/20948966 http://dx.doi.org/10.1371/journal.pone.0013340 Text en Raistrick et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Raistrick, Christopher A.
Day, Ian N. M.
Gaunt, Tom R.
Genome-Wide Data-Mining of Candidate Human Splice Translational Efficiency Polymorphisms (STEPs) and an Online Database
title Genome-Wide Data-Mining of Candidate Human Splice Translational Efficiency Polymorphisms (STEPs) and an Online Database
title_full Genome-Wide Data-Mining of Candidate Human Splice Translational Efficiency Polymorphisms (STEPs) and an Online Database
title_fullStr Genome-Wide Data-Mining of Candidate Human Splice Translational Efficiency Polymorphisms (STEPs) and an Online Database
title_full_unstemmed Genome-Wide Data-Mining of Candidate Human Splice Translational Efficiency Polymorphisms (STEPs) and an Online Database
title_short Genome-Wide Data-Mining of Candidate Human Splice Translational Efficiency Polymorphisms (STEPs) and an Online Database
title_sort genome-wide data-mining of candidate human splice translational efficiency polymorphisms (steps) and an online database
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2952627/
https://www.ncbi.nlm.nih.gov/pubmed/20948966
http://dx.doi.org/10.1371/journal.pone.0013340
work_keys_str_mv AT raistrickchristophera genomewidedataminingofcandidatehumansplicetranslationalefficiencypolymorphismsstepsandanonlinedatabase
AT dayiannm genomewidedataminingofcandidatehumansplicetranslationalefficiencypolymorphismsstepsandanonlinedatabase
AT gaunttomr genomewidedataminingofcandidatehumansplicetranslationalefficiencypolymorphismsstepsandanonlinedatabase