Cargando…

Leveraging cross-link modification events in CLIP-seq for motif discovery

High-throughput protein–RNA interaction data generated by CLIP-seq has provided an unprecedented depth of access to the activities of RNA-binding proteins (RBPs), the key players in co- and post-transcriptional regulation of gene expression. Motif discovery forms part of the necessary follow-up data...

Descripción completa

Detalles Bibliográficos
Autores principales: Bahrami-Samani, Emad, Penalva, Luiz O.F., Smith, Andrew D., Uren, Philip J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4288180/
https://www.ncbi.nlm.nih.gov/pubmed/25505146
http://dx.doi.org/10.1093/nar/gku1288
_version_ 1782351924263124992
author Bahrami-Samani, Emad
Penalva, Luiz O.F.
Smith, Andrew D.
Uren, Philip J.
author_facet Bahrami-Samani, Emad
Penalva, Luiz O.F.
Smith, Andrew D.
Uren, Philip J.
author_sort Bahrami-Samani, Emad
collection PubMed
description High-throughput protein–RNA interaction data generated by CLIP-seq has provided an unprecedented depth of access to the activities of RNA-binding proteins (RBPs), the key players in co- and post-transcriptional regulation of gene expression. Motif discovery forms part of the necessary follow-up data analysis for CLIP-seq, both to refine the exact locations of RBP binding sites, and to characterize them. The specific properties of RBP binding sites, and the CLIP-seq methods, provide additional information not usually present in the classic motif discovery problem: the binding site structure, and cross-linking induced events in reads. We show that CLIP-seq data contains clear secondary structure signals, as well as technology- and RBP-specific cross-link signals. We introduce Zagros, a motif discovery algorithm specifically designed to leverage this information and explore its impact on the quality of recovered motifs. Our results indicate that using both secondary structure and cross-link modifications can greatly improve motif discovery on CLIP-seq data. Further, the motifs we recover provide insight into the balance between sequence- and structure-specificity struck by RBP binding.
format Online
Article
Text
id pubmed-4288180
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-42881802015-02-19 Leveraging cross-link modification events in CLIP-seq for motif discovery Bahrami-Samani, Emad Penalva, Luiz O.F. Smith, Andrew D. Uren, Philip J. Nucleic Acids Res Computational Biology High-throughput protein–RNA interaction data generated by CLIP-seq has provided an unprecedented depth of access to the activities of RNA-binding proteins (RBPs), the key players in co- and post-transcriptional regulation of gene expression. Motif discovery forms part of the necessary follow-up data analysis for CLIP-seq, both to refine the exact locations of RBP binding sites, and to characterize them. The specific properties of RBP binding sites, and the CLIP-seq methods, provide additional information not usually present in the classic motif discovery problem: the binding site structure, and cross-linking induced events in reads. We show that CLIP-seq data contains clear secondary structure signals, as well as technology- and RBP-specific cross-link signals. We introduce Zagros, a motif discovery algorithm specifically designed to leverage this information and explore its impact on the quality of recovered motifs. Our results indicate that using both secondary structure and cross-link modifications can greatly improve motif discovery on CLIP-seq data. Further, the motifs we recover provide insight into the balance between sequence- and structure-specificity struck by RBP binding. Oxford University Press 2015-01-09 2014-12-10 /pmc/articles/PMC4288180/ /pubmed/25505146 http://dx.doi.org/10.1093/nar/gku1288 Text en © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Computational Biology
Bahrami-Samani, Emad
Penalva, Luiz O.F.
Smith, Andrew D.
Uren, Philip J.
Leveraging cross-link modification events in CLIP-seq for motif discovery
title Leveraging cross-link modification events in CLIP-seq for motif discovery
title_full Leveraging cross-link modification events in CLIP-seq for motif discovery
title_fullStr Leveraging cross-link modification events in CLIP-seq for motif discovery
title_full_unstemmed Leveraging cross-link modification events in CLIP-seq for motif discovery
title_short Leveraging cross-link modification events in CLIP-seq for motif discovery
title_sort leveraging cross-link modification events in clip-seq for motif discovery
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4288180/
https://www.ncbi.nlm.nih.gov/pubmed/25505146
http://dx.doi.org/10.1093/nar/gku1288
work_keys_str_mv AT bahramisamaniemad leveragingcrosslinkmodificationeventsinclipseqformotifdiscovery
AT penalvaluizof leveragingcrosslinkmodificationeventsinclipseqformotifdiscovery
AT smithandrewd leveragingcrosslinkmodificationeventsinclipseqformotifdiscovery
AT urenphilipj leveragingcrosslinkmodificationeventsinclipseqformotifdiscovery