Cargando…

Improving CLIP-seq data analysis by incorporating transcript information

BACKGROUND: Current peak callers for identifying RNA-binding protein (RBP) binding sites from CLIP-seq data take into account genomic read profiles, but they ignore the underlying transcript information, that is information regarding splicing events. So far, there are no studies available that close...

Descripción completa

Detalles Bibliográficos
Autores principales: Uhl, Michael, Tran, Van Dinh, Backofen, Rolf
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7745353/
https://www.ncbi.nlm.nih.gov/pubmed/33334306
http://dx.doi.org/10.1186/s12864-020-07297-0
Descripción
Sumario:BACKGROUND: Current peak callers for identifying RNA-binding protein (RBP) binding sites from CLIP-seq data take into account genomic read profiles, but they ignore the underlying transcript information, that is information regarding splicing events. So far, there are no studies available that closer observe this issue. RESULTS: Here we show that current peak callers are susceptible to false peak calling near exon borders. We quantify its extent in publicly available datasets, which turns out to be substantial. By providing a tool called CLIPcontext for automatic transcript and genomic context sequence extraction, we further demonstrate that context choice affects the performances of RBP binding site prediction tools. Moreover, we show that known motifs of exon-binding RBPs are often enriched in transcript context sites, which should enable the recovery of more authentic binding sites. Finally, we discuss possible strategies on how to integrate transcript information into future workflows. CONCLUSIONS: Our results demonstrate the importance of incorporating transcript information in CLIP-seq data analysis. Taking advantage of the underlying transcript information should therefore become an integral part of future peak calling and downstream analysis tools. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (doi:10.1186/s12864-020-07297-0).