Cargando…

A deep boosting based approach for capturing the sequence binding preferences of RNA-binding proteins from high-throughput CLIP-seq data

Characterizing the binding behaviors of RNA-binding proteins (RBPs) is important for understanding their functional roles in gene expression regulation. However, current high-throughput experimental methods for identifying RBP targets, such as CLIP-seq and RNAcompete, usually suffer from the false n...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Shuya, Dong, Fanghong, Wu, Yuexin, Zhang, Sai, Zhang, Chen, Liu, Xiao, Jiang, Tao, Zeng, Jianyang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5737578/
https://www.ncbi.nlm.nih.gov/pubmed/28575488
http://dx.doi.org/10.1093/nar/gkx492
_version_ 1783287543428546560
author Li, Shuya
Dong, Fanghong
Wu, Yuexin
Zhang, Sai
Zhang, Chen
Liu, Xiao
Jiang, Tao
Zeng, Jianyang
author_facet Li, Shuya
Dong, Fanghong
Wu, Yuexin
Zhang, Sai
Zhang, Chen
Liu, Xiao
Jiang, Tao
Zeng, Jianyang
author_sort Li, Shuya
collection PubMed
description Characterizing the binding behaviors of RNA-binding proteins (RBPs) is important for understanding their functional roles in gene expression regulation. However, current high-throughput experimental methods for identifying RBP targets, such as CLIP-seq and RNAcompete, usually suffer from the false negative issue. Here, we develop a deep boosting based machine learning approach, called DeBooster, to accurately model the binding sequence preferences and identify the corresponding binding targets of RBPs from CLIP-seq data. Comprehensive validation tests have shown that DeBooster can outperform other state-of-the-art approaches in RBP target prediction. In addition, we have demonstrated that DeBooster may provide new insights into understanding the regulatory functions of RBPs, including the binding effects of the RNA helicase MOV10 on mRNA degradation, the potentially different ADAR1 binding behaviors related to its editing activity, as well as the antagonizing effect of RBP binding on miRNA repression. Moreover, DeBooster may provide an effective index to investigate the effect of pathogenic mutations in RBP binding sites, especially those related to splicing events. We expect that DeBooster will be widely applied to analyze large-scale CLIP-seq experimental data and can provide a practically useful tool for novel biological discoveries in understanding the regulatory mechanisms of RBPs. The source code of DeBooster can be downloaded from http://github.com/dongfanghong/deepboost.
format Online
Article
Text
id pubmed-5737578
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-57375782018-01-04 A deep boosting based approach for capturing the sequence binding preferences of RNA-binding proteins from high-throughput CLIP-seq data Li, Shuya Dong, Fanghong Wu, Yuexin Zhang, Sai Zhang, Chen Liu, Xiao Jiang, Tao Zeng, Jianyang Nucleic Acids Res Methods Online Characterizing the binding behaviors of RNA-binding proteins (RBPs) is important for understanding their functional roles in gene expression regulation. However, current high-throughput experimental methods for identifying RBP targets, such as CLIP-seq and RNAcompete, usually suffer from the false negative issue. Here, we develop a deep boosting based machine learning approach, called DeBooster, to accurately model the binding sequence preferences and identify the corresponding binding targets of RBPs from CLIP-seq data. Comprehensive validation tests have shown that DeBooster can outperform other state-of-the-art approaches in RBP target prediction. In addition, we have demonstrated that DeBooster may provide new insights into understanding the regulatory functions of RBPs, including the binding effects of the RNA helicase MOV10 on mRNA degradation, the potentially different ADAR1 binding behaviors related to its editing activity, as well as the antagonizing effect of RBP binding on miRNA repression. Moreover, DeBooster may provide an effective index to investigate the effect of pathogenic mutations in RBP binding sites, especially those related to splicing events. We expect that DeBooster will be widely applied to analyze large-scale CLIP-seq experimental data and can provide a practically useful tool for novel biological discoveries in understanding the regulatory mechanisms of RBPs. The source code of DeBooster can be downloaded from http://github.com/dongfanghong/deepboost. Oxford University Press 2017-08-21 2017-05-30 /pmc/articles/PMC5737578/ /pubmed/28575488 http://dx.doi.org/10.1093/nar/gkx492 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Li, Shuya
Dong, Fanghong
Wu, Yuexin
Zhang, Sai
Zhang, Chen
Liu, Xiao
Jiang, Tao
Zeng, Jianyang
A deep boosting based approach for capturing the sequence binding preferences of RNA-binding proteins from high-throughput CLIP-seq data
title A deep boosting based approach for capturing the sequence binding preferences of RNA-binding proteins from high-throughput CLIP-seq data
title_full A deep boosting based approach for capturing the sequence binding preferences of RNA-binding proteins from high-throughput CLIP-seq data
title_fullStr A deep boosting based approach for capturing the sequence binding preferences of RNA-binding proteins from high-throughput CLIP-seq data
title_full_unstemmed A deep boosting based approach for capturing the sequence binding preferences of RNA-binding proteins from high-throughput CLIP-seq data
title_short A deep boosting based approach for capturing the sequence binding preferences of RNA-binding proteins from high-throughput CLIP-seq data
title_sort deep boosting based approach for capturing the sequence binding preferences of rna-binding proteins from high-throughput clip-seq data
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5737578/
https://www.ncbi.nlm.nih.gov/pubmed/28575488
http://dx.doi.org/10.1093/nar/gkx492
work_keys_str_mv AT lishuya adeepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata
AT dongfanghong adeepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata
AT wuyuexin adeepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata
AT zhangsai adeepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata
AT zhangchen adeepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata
AT liuxiao adeepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata
AT jiangtao adeepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata
AT zengjianyang adeepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata
AT lishuya deepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata
AT dongfanghong deepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata
AT wuyuexin deepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata
AT zhangsai deepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata
AT zhangchen deepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata
AT liuxiao deepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata
AT jiangtao deepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata
AT zengjianyang deepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata