Cargando…
A deep boosting based approach for capturing the sequence binding preferences of RNA-binding proteins from high-throughput CLIP-seq data
Characterizing the binding behaviors of RNA-binding proteins (RBPs) is important for understanding their functional roles in gene expression regulation. However, current high-throughput experimental methods for identifying RBP targets, such as CLIP-seq and RNAcompete, usually suffer from the false n...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5737578/ https://www.ncbi.nlm.nih.gov/pubmed/28575488 http://dx.doi.org/10.1093/nar/gkx492 |
_version_ | 1783287543428546560 |
---|---|
author | Li, Shuya Dong, Fanghong Wu, Yuexin Zhang, Sai Zhang, Chen Liu, Xiao Jiang, Tao Zeng, Jianyang |
author_facet | Li, Shuya Dong, Fanghong Wu, Yuexin Zhang, Sai Zhang, Chen Liu, Xiao Jiang, Tao Zeng, Jianyang |
author_sort | Li, Shuya |
collection | PubMed |
description | Characterizing the binding behaviors of RNA-binding proteins (RBPs) is important for understanding their functional roles in gene expression regulation. However, current high-throughput experimental methods for identifying RBP targets, such as CLIP-seq and RNAcompete, usually suffer from the false negative issue. Here, we develop a deep boosting based machine learning approach, called DeBooster, to accurately model the binding sequence preferences and identify the corresponding binding targets of RBPs from CLIP-seq data. Comprehensive validation tests have shown that DeBooster can outperform other state-of-the-art approaches in RBP target prediction. In addition, we have demonstrated that DeBooster may provide new insights into understanding the regulatory functions of RBPs, including the binding effects of the RNA helicase MOV10 on mRNA degradation, the potentially different ADAR1 binding behaviors related to its editing activity, as well as the antagonizing effect of RBP binding on miRNA repression. Moreover, DeBooster may provide an effective index to investigate the effect of pathogenic mutations in RBP binding sites, especially those related to splicing events. We expect that DeBooster will be widely applied to analyze large-scale CLIP-seq experimental data and can provide a practically useful tool for novel biological discoveries in understanding the regulatory mechanisms of RBPs. The source code of DeBooster can be downloaded from http://github.com/dongfanghong/deepboost. |
format | Online Article Text |
id | pubmed-5737578 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-57375782018-01-04 A deep boosting based approach for capturing the sequence binding preferences of RNA-binding proteins from high-throughput CLIP-seq data Li, Shuya Dong, Fanghong Wu, Yuexin Zhang, Sai Zhang, Chen Liu, Xiao Jiang, Tao Zeng, Jianyang Nucleic Acids Res Methods Online Characterizing the binding behaviors of RNA-binding proteins (RBPs) is important for understanding their functional roles in gene expression regulation. However, current high-throughput experimental methods for identifying RBP targets, such as CLIP-seq and RNAcompete, usually suffer from the false negative issue. Here, we develop a deep boosting based machine learning approach, called DeBooster, to accurately model the binding sequence preferences and identify the corresponding binding targets of RBPs from CLIP-seq data. Comprehensive validation tests have shown that DeBooster can outperform other state-of-the-art approaches in RBP target prediction. In addition, we have demonstrated that DeBooster may provide new insights into understanding the regulatory functions of RBPs, including the binding effects of the RNA helicase MOV10 on mRNA degradation, the potentially different ADAR1 binding behaviors related to its editing activity, as well as the antagonizing effect of RBP binding on miRNA repression. Moreover, DeBooster may provide an effective index to investigate the effect of pathogenic mutations in RBP binding sites, especially those related to splicing events. We expect that DeBooster will be widely applied to analyze large-scale CLIP-seq experimental data and can provide a practically useful tool for novel biological discoveries in understanding the regulatory mechanisms of RBPs. The source code of DeBooster can be downloaded from http://github.com/dongfanghong/deepboost. Oxford University Press 2017-08-21 2017-05-30 /pmc/articles/PMC5737578/ /pubmed/28575488 http://dx.doi.org/10.1093/nar/gkx492 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Methods Online Li, Shuya Dong, Fanghong Wu, Yuexin Zhang, Sai Zhang, Chen Liu, Xiao Jiang, Tao Zeng, Jianyang A deep boosting based approach for capturing the sequence binding preferences of RNA-binding proteins from high-throughput CLIP-seq data |
title | A deep boosting based approach for capturing the sequence binding preferences of RNA-binding proteins from high-throughput CLIP-seq data |
title_full | A deep boosting based approach for capturing the sequence binding preferences of RNA-binding proteins from high-throughput CLIP-seq data |
title_fullStr | A deep boosting based approach for capturing the sequence binding preferences of RNA-binding proteins from high-throughput CLIP-seq data |
title_full_unstemmed | A deep boosting based approach for capturing the sequence binding preferences of RNA-binding proteins from high-throughput CLIP-seq data |
title_short | A deep boosting based approach for capturing the sequence binding preferences of RNA-binding proteins from high-throughput CLIP-seq data |
title_sort | deep boosting based approach for capturing the sequence binding preferences of rna-binding proteins from high-throughput clip-seq data |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5737578/ https://www.ncbi.nlm.nih.gov/pubmed/28575488 http://dx.doi.org/10.1093/nar/gkx492 |
work_keys_str_mv | AT lishuya adeepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata AT dongfanghong adeepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata AT wuyuexin adeepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata AT zhangsai adeepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata AT zhangchen adeepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata AT liuxiao adeepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata AT jiangtao adeepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata AT zengjianyang adeepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata AT lishuya deepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata AT dongfanghong deepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata AT wuyuexin deepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata AT zhangsai deepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata AT zhangchen deepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata AT liuxiao deepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata AT jiangtao deepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata AT zengjianyang deepboostingbasedapproachforcapturingthesequencebindingpreferencesofrnabindingproteinsfromhighthroughputclipseqdata |