Cargando…
Large-scale analysis of branchpoint usage across species and cell lines
The coding sequence of each human pre-mRNA is interrupted, on average, by 11 introns that must be spliced out for proper gene expression. Each intron contains three obligate signals: a 5′ splice site, a branch site, and a 3′ splice site. Splice site usage has been mapped exhaustively across differen...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5378181/ https://www.ncbi.nlm.nih.gov/pubmed/28119336 http://dx.doi.org/10.1101/gr.202820.115 |
_version_ | 1782519406902902784 |
---|---|
author | Taggart, Allison J. Lin, Chien-Ling Shrestha, Barsha Heintzelman, Claire Kim, Seongwon Fairbrother, William G. |
author_facet | Taggart, Allison J. Lin, Chien-Ling Shrestha, Barsha Heintzelman, Claire Kim, Seongwon Fairbrother, William G. |
author_sort | Taggart, Allison J. |
collection | PubMed |
description | The coding sequence of each human pre-mRNA is interrupted, on average, by 11 introns that must be spliced out for proper gene expression. Each intron contains three obligate signals: a 5′ splice site, a branch site, and a 3′ splice site. Splice site usage has been mapped exhaustively across different species, cell types, and cellular states. In contrast, only a small fraction of branch sites have been identified even once. The few reported annotations of branch site are imprecise as reverse transcriptase skips several nucleotides while traversing a 2–5 linkage. Here, we report large-scale mapping of the branchpoints from deep sequencing data in three different species and in the SF3B1 K700E oncogenic mutant background. We have developed a novel method whereby raw lariat reads are refined by U2snRNP/pre-mRNA base-pairing models to return the largest current data set of branchpoint sequences with quality metrics. This analysis discovers novel modes of U2snRNA:pre-mRNA base-pairing conserved in yeast and provides insight into the biogenesis of intron circles. Finally, matching branch site usage with isoform selection across the extensive panel of ENCODE RNA-seq data sets offers insight into the mechanisms by which branchpoint usage drives alternative splicing. |
format | Online Article Text |
id | pubmed-5378181 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Cold Spring Harbor Laboratory Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-53781812017-10-01 Large-scale analysis of branchpoint usage across species and cell lines Taggart, Allison J. Lin, Chien-Ling Shrestha, Barsha Heintzelman, Claire Kim, Seongwon Fairbrother, William G. Genome Res Resource The coding sequence of each human pre-mRNA is interrupted, on average, by 11 introns that must be spliced out for proper gene expression. Each intron contains three obligate signals: a 5′ splice site, a branch site, and a 3′ splice site. Splice site usage has been mapped exhaustively across different species, cell types, and cellular states. In contrast, only a small fraction of branch sites have been identified even once. The few reported annotations of branch site are imprecise as reverse transcriptase skips several nucleotides while traversing a 2–5 linkage. Here, we report large-scale mapping of the branchpoints from deep sequencing data in three different species and in the SF3B1 K700E oncogenic mutant background. We have developed a novel method whereby raw lariat reads are refined by U2snRNP/pre-mRNA base-pairing models to return the largest current data set of branchpoint sequences with quality metrics. This analysis discovers novel modes of U2snRNA:pre-mRNA base-pairing conserved in yeast and provides insight into the biogenesis of intron circles. Finally, matching branch site usage with isoform selection across the extensive panel of ENCODE RNA-seq data sets offers insight into the mechanisms by which branchpoint usage drives alternative splicing. Cold Spring Harbor Laboratory Press 2017-04 /pmc/articles/PMC5378181/ /pubmed/28119336 http://dx.doi.org/10.1101/gr.202820.115 Text en © 2017 Taggart et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/. |
spellingShingle | Resource Taggart, Allison J. Lin, Chien-Ling Shrestha, Barsha Heintzelman, Claire Kim, Seongwon Fairbrother, William G. Large-scale analysis of branchpoint usage across species and cell lines |
title | Large-scale analysis of branchpoint usage across species and cell lines |
title_full | Large-scale analysis of branchpoint usage across species and cell lines |
title_fullStr | Large-scale analysis of branchpoint usage across species and cell lines |
title_full_unstemmed | Large-scale analysis of branchpoint usage across species and cell lines |
title_short | Large-scale analysis of branchpoint usage across species and cell lines |
title_sort | large-scale analysis of branchpoint usage across species and cell lines |
topic | Resource |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5378181/ https://www.ncbi.nlm.nih.gov/pubmed/28119336 http://dx.doi.org/10.1101/gr.202820.115 |
work_keys_str_mv | AT taggartallisonj largescaleanalysisofbranchpointusageacrossspeciesandcelllines AT linchienling largescaleanalysisofbranchpointusageacrossspeciesandcelllines AT shresthabarsha largescaleanalysisofbranchpointusageacrossspeciesandcelllines AT heintzelmanclaire largescaleanalysisofbranchpointusageacrossspeciesandcelllines AT kimseongwon largescaleanalysisofbranchpointusageacrossspeciesandcelllines AT fairbrotherwilliamg largescaleanalysisofbranchpointusageacrossspeciesandcelllines |