Cargando…

Inferring RNA-binding protein target preferences using adversarial domain adaptation

Precise identification of target sites of RNA-binding proteins (RBP) is important to understand their biochemical and cellular functions. A large amount of experimental data is generated by in vivo and in vitro approaches. The binding preferences determined from these platforms share similar pattern...

Descripción completa

Detalles Bibliográficos
Autores principales:	Liu, Ying, Li, Ruihui, Luo, Jiawei, Zhang, Zhaolei
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2022
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8870515/ https://www.ncbi.nlm.nih.gov/pubmed/35202389 http://dx.doi.org/10.1371/journal.pcbi.1009863

_version_	1784656776212250624
author	Liu, Ying Li, Ruihui Luo, Jiawei Zhang, Zhaolei
author_facet	Liu, Ying Li, Ruihui Luo, Jiawei Zhang, Zhaolei
author_sort	Liu, Ying
collection	PubMed
description	Precise identification of target sites of RNA-binding proteins (RBP) is important to understand their biochemical and cellular functions. A large amount of experimental data is generated by in vivo and in vitro approaches. The binding preferences determined from these platforms share similar patterns but there are discernable differences between these datasets. Computational methods trained on one dataset do not always work well on another dataset. To address this problem which resembles the classic “domain shift” in deep learning, we adopted the adversarial domain adaptation (ADDA) technique and developed a framework (RBP-ADDA) that can extract RBP binding preferences from an integration of in vivo and vitro datasets. Compared with conventional methods, ADDA has the advantage of working with two input datasets, as it trains the initial neural network for each dataset individually, projects the two datasets onto a feature space, and uses an adversarial framework to derive an optimal network that achieves an optimal discriminative predictive power. In the first step, for each RBP, we include only the in vitro data to pre-train a source network and a task predictor. Next, for the same RBP, we initiate the target network by using the source network and use adversarial domain adaptation to update the target network using both in vitro and in vivo data. These two steps help leverage the in vitro data to improve the prediction on in vivo data, which is typically challenging with a lower signal-to-noise ratio. Finally, to further take the advantage of the fused source and target data, we fine-tune the task predictor using both data. We showed that RBP-ADDA achieved better performance in modeling in vivo RBP binding data than other existing methods as judged by Pearson correlations. It also improved predictive performance on in vitro datasets. We further applied augmentation operations on RBPs with less in vivo data to expand the input data and showed that it can improve prediction performances. Lastly, we explored the predictive interpretability of RBP-ADDA, where we quantified the contribution of the input features by Integrated Gradients and identified nucleotide positions that are important for RBP recognition.
format	Online Article Text
id	pubmed-8870515
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-88705152022-02-25 Inferring RNA-binding protein target preferences using adversarial domain adaptation Liu, Ying Li, Ruihui Luo, Jiawei Zhang, Zhaolei PLoS Comput Biol Research Article Precise identification of target sites of RNA-binding proteins (RBP) is important to understand their biochemical and cellular functions. A large amount of experimental data is generated by in vivo and in vitro approaches. The binding preferences determined from these platforms share similar patterns but there are discernable differences between these datasets. Computational methods trained on one dataset do not always work well on another dataset. To address this problem which resembles the classic “domain shift” in deep learning, we adopted the adversarial domain adaptation (ADDA) technique and developed a framework (RBP-ADDA) that can extract RBP binding preferences from an integration of in vivo and vitro datasets. Compared with conventional methods, ADDA has the advantage of working with two input datasets, as it trains the initial neural network for each dataset individually, projects the two datasets onto a feature space, and uses an adversarial framework to derive an optimal network that achieves an optimal discriminative predictive power. In the first step, for each RBP, we include only the in vitro data to pre-train a source network and a task predictor. Next, for the same RBP, we initiate the target network by using the source network and use adversarial domain adaptation to update the target network using both in vitro and in vivo data. These two steps help leverage the in vitro data to improve the prediction on in vivo data, which is typically challenging with a lower signal-to-noise ratio. Finally, to further take the advantage of the fused source and target data, we fine-tune the task predictor using both data. We showed that RBP-ADDA achieved better performance in modeling in vivo RBP binding data than other existing methods as judged by Pearson correlations. It also improved predictive performance on in vitro datasets. We further applied augmentation operations on RBPs with less in vivo data to expand the input data and showed that it can improve prediction performances. Lastly, we explored the predictive interpretability of RBP-ADDA, where we quantified the contribution of the input features by Integrated Gradients and identified nucleotide positions that are important for RBP recognition. Public Library of Science 2022-02-24 /pmc/articles/PMC8870515/ /pubmed/35202389 http://dx.doi.org/10.1371/journal.pcbi.1009863 Text en © 2022 Liu et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Liu, Ying Li, Ruihui Luo, Jiawei Zhang, Zhaolei Inferring RNA-binding protein target preferences using adversarial domain adaptation
title	Inferring RNA-binding protein target preferences using adversarial domain adaptation
title_full	Inferring RNA-binding protein target preferences using adversarial domain adaptation
title_fullStr	Inferring RNA-binding protein target preferences using adversarial domain adaptation
title_full_unstemmed	Inferring RNA-binding protein target preferences using adversarial domain adaptation
title_short	Inferring RNA-binding protein target preferences using adversarial domain adaptation
title_sort	inferring rna-binding protein target preferences using adversarial domain adaptation
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8870515/ https://www.ncbi.nlm.nih.gov/pubmed/35202389 http://dx.doi.org/10.1371/journal.pcbi.1009863
work_keys_str_mv	AT liuying inferringrnabindingproteintargetpreferencesusingadversarialdomainadaptation AT liruihui inferringrnabindingproteintargetpreferencesusingadversarialdomainadaptation AT luojiawei inferringrnabindingproteintargetpreferencesusingadversarialdomainadaptation AT zhangzhaolei inferringrnabindingproteintargetpreferencesusingadversarialdomainadaptation

Inferring RNA-binding protein target preferences using adversarial domain adaptation

Ejemplares similares