Cargando…

Prediction of clustered RNA-binding protein motif sites in the mammalian genome

Sequence-specific interactions of RNA-binding proteins (RBPs) with their target transcripts are essential for post-transcriptional gene expression regulation in mammals. However, accurate prediction of RBP motif sites has been difficult because many RBPs recognize short and degenerate sequences. Her...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Chaolin, Lee, Kuang-Yung, Swanson, Maurice S., Darnell, Robert B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3737533/
https://www.ncbi.nlm.nih.gov/pubmed/23685613
http://dx.doi.org/10.1093/nar/gkt421
_version_ 1782279874326560768
author Zhang, Chaolin
Lee, Kuang-Yung
Swanson, Maurice S.
Darnell, Robert B.
author_facet Zhang, Chaolin
Lee, Kuang-Yung
Swanson, Maurice S.
Darnell, Robert B.
author_sort Zhang, Chaolin
collection PubMed
description Sequence-specific interactions of RNA-binding proteins (RBPs) with their target transcripts are essential for post-transcriptional gene expression regulation in mammals. However, accurate prediction of RBP motif sites has been difficult because many RBPs recognize short and degenerate sequences. Here we describe a hidden Markov model (HMM)-based algorithm mCarts to predict clustered functional RBP-binding sites by effectively integrating the number and spacing of individual motif sites, their accessibility in local RNA secondary structures and cross-species conservation. This algorithm learns and quantifies rules of these features, taking advantage of a large number of in vivo RBP-binding sites obtained from cross-linking and immunoprecipitation data. We applied this algorithm to study two representative RBP families, Nova and Mbnl, which regulate tissue-specific alternative splicing through interacting with clustered YCAY and YGCY elements, respectively, and predicted their binding sites in the mouse transcriptome. Despite the low information content in individual motif elements, our algorithm made specific predictions for successful experimental validation. Analysis of predicted sites also revealed cases of extensive and distal RBP-binding sites important for splicing regulation. This algorithm can be readily applied to other RBPs to infer their RNA-regulatory networks. The software is freely available at http://zhanglab.c2b2.columbia.edu/index.php/MCarts.
format Online
Article
Text
id pubmed-3737533
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-37375332013-08-08 Prediction of clustered RNA-binding protein motif sites in the mammalian genome Zhang, Chaolin Lee, Kuang-Yung Swanson, Maurice S. Darnell, Robert B. Nucleic Acids Res Computational Biology Sequence-specific interactions of RNA-binding proteins (RBPs) with their target transcripts are essential for post-transcriptional gene expression regulation in mammals. However, accurate prediction of RBP motif sites has been difficult because many RBPs recognize short and degenerate sequences. Here we describe a hidden Markov model (HMM)-based algorithm mCarts to predict clustered functional RBP-binding sites by effectively integrating the number and spacing of individual motif sites, their accessibility in local RNA secondary structures and cross-species conservation. This algorithm learns and quantifies rules of these features, taking advantage of a large number of in vivo RBP-binding sites obtained from cross-linking and immunoprecipitation data. We applied this algorithm to study two representative RBP families, Nova and Mbnl, which regulate tissue-specific alternative splicing through interacting with clustered YCAY and YGCY elements, respectively, and predicted their binding sites in the mouse transcriptome. Despite the low information content in individual motif elements, our algorithm made specific predictions for successful experimental validation. Analysis of predicted sites also revealed cases of extensive and distal RBP-binding sites important for splicing regulation. This algorithm can be readily applied to other RBPs to infer their RNA-regulatory networks. The software is freely available at http://zhanglab.c2b2.columbia.edu/index.php/MCarts. Oxford University Press 2013-08 2013-05-18 /pmc/articles/PMC3737533/ /pubmed/23685613 http://dx.doi.org/10.1093/nar/gkt421 Text en © The Author(s) 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Zhang, Chaolin
Lee, Kuang-Yung
Swanson, Maurice S.
Darnell, Robert B.
Prediction of clustered RNA-binding protein motif sites in the mammalian genome
title Prediction of clustered RNA-binding protein motif sites in the mammalian genome
title_full Prediction of clustered RNA-binding protein motif sites in the mammalian genome
title_fullStr Prediction of clustered RNA-binding protein motif sites in the mammalian genome
title_full_unstemmed Prediction of clustered RNA-binding protein motif sites in the mammalian genome
title_short Prediction of clustered RNA-binding protein motif sites in the mammalian genome
title_sort prediction of clustered rna-binding protein motif sites in the mammalian genome
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3737533/
https://www.ncbi.nlm.nih.gov/pubmed/23685613
http://dx.doi.org/10.1093/nar/gkt421
work_keys_str_mv AT zhangchaolin predictionofclusteredrnabindingproteinmotifsitesinthemammaliangenome
AT leekuangyung predictionofclusteredrnabindingproteinmotifsitesinthemammaliangenome
AT swansonmaurices predictionofclusteredrnabindingproteinmotifsitesinthemammaliangenome
AT darnellrobertb predictionofclusteredrnabindingproteinmotifsitesinthemammaliangenome