Cargando…

Genome-wide identification and predictive modeling of tissue-specific alternative polyadenylation

Motivation: Pre-mRNA cleavage and polyadenylation are essential steps for 3′-end maturation and subsequent stability and degradation of mRNAs. This process is highly controlled by cis-regulatory elements surrounding the cleavage/polyadenylation sites (polyA sites), which are frequently constrained b...

Descripción completa

Detalles Bibliográficos
Autores principales: Hafez, Dina, Ni, Ting, Mukherjee, Sayan, Zhu, Jun, Ohler, Uwe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3694680/
https://www.ncbi.nlm.nih.gov/pubmed/23812974
http://dx.doi.org/10.1093/bioinformatics/btt233
_version_ 1782274887921958912
author Hafez, Dina
Ni, Ting
Mukherjee, Sayan
Zhu, Jun
Ohler, Uwe
author_facet Hafez, Dina
Ni, Ting
Mukherjee, Sayan
Zhu, Jun
Ohler, Uwe
author_sort Hafez, Dina
collection PubMed
description Motivation: Pre-mRNA cleavage and polyadenylation are essential steps for 3′-end maturation and subsequent stability and degradation of mRNAs. This process is highly controlled by cis-regulatory elements surrounding the cleavage/polyadenylation sites (polyA sites), which are frequently constrained by sequence content and position. More than 50% of human transcripts have multiple functional polyA sites, and the specific use of alternative polyA sites (APA) results in isoforms with variable 3′-untranslated regions, thus potentially affecting gene regulation. Elucidating the regulatory mechanisms underlying differential polyA preferences in multiple cell types has been hindered both by the lack of suitable data on the precise location of cleavage sites, as well as of appropriate tests for determining APAs with significant differences across multiple libraries. Results: We applied a tailored paired-end RNA-seq protocol to specifically probe the position of polyA sites in three human adult tissue types. We specified a linear-effects regression model to identify tissue-specific biases indicating regulated APA; the significance of differences between tissue types was assessed by an appropriately designed permutation test. This combination allowed to identify highly specific subsets of APA events in the individual tissue types. Predictive models successfully classified constitutive polyA sites from a biologically relevant background (auROC = 99.6%), as well as tissue-specific regulated sets from each other. We found that the main cis-regulatory elements described for polyadenylation are a strong, and highly informative, hallmark for constitutive sites only. Tissue-specific regulated sites were found to contain other regulatory motifs, with the canonical polyadenylation signal being nearly absent at brain-specific polyA sites. Together, our results contribute to the understanding of the diversity of post-transcriptional gene regulation. Availability: Raw data are deposited on SRA, accession numbers: brain SRX208132, kidney SRX208087 and liver SRX208134. Processed datasets as well as model code are published on our website: http://www.genome.duke.edu/labs/ohler/research/UTR/ Contact: uwe.ohler@duke.edu
format Online
Article
Text
id pubmed-3694680
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-36946802013-06-27 Genome-wide identification and predictive modeling of tissue-specific alternative polyadenylation Hafez, Dina Ni, Ting Mukherjee, Sayan Zhu, Jun Ohler, Uwe Bioinformatics Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany Motivation: Pre-mRNA cleavage and polyadenylation are essential steps for 3′-end maturation and subsequent stability and degradation of mRNAs. This process is highly controlled by cis-regulatory elements surrounding the cleavage/polyadenylation sites (polyA sites), which are frequently constrained by sequence content and position. More than 50% of human transcripts have multiple functional polyA sites, and the specific use of alternative polyA sites (APA) results in isoforms with variable 3′-untranslated regions, thus potentially affecting gene regulation. Elucidating the regulatory mechanisms underlying differential polyA preferences in multiple cell types has been hindered both by the lack of suitable data on the precise location of cleavage sites, as well as of appropriate tests for determining APAs with significant differences across multiple libraries. Results: We applied a tailored paired-end RNA-seq protocol to specifically probe the position of polyA sites in three human adult tissue types. We specified a linear-effects regression model to identify tissue-specific biases indicating regulated APA; the significance of differences between tissue types was assessed by an appropriately designed permutation test. This combination allowed to identify highly specific subsets of APA events in the individual tissue types. Predictive models successfully classified constitutive polyA sites from a biologically relevant background (auROC = 99.6%), as well as tissue-specific regulated sets from each other. We found that the main cis-regulatory elements described for polyadenylation are a strong, and highly informative, hallmark for constitutive sites only. Tissue-specific regulated sites were found to contain other regulatory motifs, with the canonical polyadenylation signal being nearly absent at brain-specific polyA sites. Together, our results contribute to the understanding of the diversity of post-transcriptional gene regulation. Availability: Raw data are deposited on SRA, accession numbers: brain SRX208132, kidney SRX208087 and liver SRX208134. Processed datasets as well as model code are published on our website: http://www.genome.duke.edu/labs/ohler/research/UTR/ Contact: uwe.ohler@duke.edu Oxford University Press 2013-07-01 2013-06-19 /pmc/articles/PMC3694680/ /pubmed/23812974 http://dx.doi.org/10.1093/bioinformatics/btt233 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany
Hafez, Dina
Ni, Ting
Mukherjee, Sayan
Zhu, Jun
Ohler, Uwe
Genome-wide identification and predictive modeling of tissue-specific alternative polyadenylation
title Genome-wide identification and predictive modeling of tissue-specific alternative polyadenylation
title_full Genome-wide identification and predictive modeling of tissue-specific alternative polyadenylation
title_fullStr Genome-wide identification and predictive modeling of tissue-specific alternative polyadenylation
title_full_unstemmed Genome-wide identification and predictive modeling of tissue-specific alternative polyadenylation
title_short Genome-wide identification and predictive modeling of tissue-specific alternative polyadenylation
title_sort genome-wide identification and predictive modeling of tissue-specific alternative polyadenylation
topic Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3694680/
https://www.ncbi.nlm.nih.gov/pubmed/23812974
http://dx.doi.org/10.1093/bioinformatics/btt233
work_keys_str_mv AT hafezdina genomewideidentificationandpredictivemodelingoftissuespecificalternativepolyadenylation
AT niting genomewideidentificationandpredictivemodelingoftissuespecificalternativepolyadenylation
AT mukherjeesayan genomewideidentificationandpredictivemodelingoftissuespecificalternativepolyadenylation
AT zhujun genomewideidentificationandpredictivemodelingoftissuespecificalternativepolyadenylation
AT ohleruwe genomewideidentificationandpredictivemodelingoftissuespecificalternativepolyadenylation