Cargando…

DeepLPI: a multimodal deep learning method for predicting the interactions between lncRNAs and protein isoforms

BACKGROUND: Long non-coding RNAs (lncRNAs) regulate diverse biological processes via interactions with proteins. Since the experimental methods to identify these interactions are expensive and time-consuming, many computational methods have been proposed. Although these computational methods have ac...

Descripción completa

Detalles Bibliográficos
Autores principales: Shaw, Dipan, Chen, Hao, Xie, Minzhu, Jiang, Tao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7814738/
https://www.ncbi.nlm.nih.gov/pubmed/33461501
http://dx.doi.org/10.1186/s12859-020-03914-7
_version_ 1783638115258204160
author Shaw, Dipan
Chen, Hao
Xie, Minzhu
Jiang, Tao
author_facet Shaw, Dipan
Chen, Hao
Xie, Minzhu
Jiang, Tao
author_sort Shaw, Dipan
collection PubMed
description BACKGROUND: Long non-coding RNAs (lncRNAs) regulate diverse biological processes via interactions with proteins. Since the experimental methods to identify these interactions are expensive and time-consuming, many computational methods have been proposed. Although these computational methods have achieved promising prediction performance, they neglect the fact that a gene may encode multiple protein isoforms and different isoforms of the same gene may interact differently with the same lncRNA. RESULTS: In this study, we propose a novel method, DeepLPI, for predicting the interactions between lncRNAs and protein isoforms. Our method uses sequence and structure data to extract intrinsic features and expression data to extract topological features. To combine these different data, we adopt a hybrid framework by integrating a multimodal deep learning neural network and a conditional random field. To overcome the lack of known interactions between lncRNAs and protein isoforms, we apply a multiple instance learning (MIL) approach. In our experiment concerning the human lncRNA-protein interactions in the NPInter v3.0 database, DeepLPI improved the prediction performance by 4.7% in term of AUC and 5.9% in term of AUPRC over the state-of-the-art methods. Our further correlation analyses between interactive lncRNAs and protein isoforms also illustrated that their co-expression information helped predict the interactions. Finally, we give some examples where DeepLPI was able to outperform the other methods in predicting mouse lncRNA-protein interactions and novel human lncRNA-protein interactions. CONCLUSION: Our results demonstrated that the use of isoforms and MIL contributed significantly to the improvement of performance in predicting lncRNA and protein interactions. We believe that such an approach would find more applications in predicting other functional roles of RNAs and proteins.
format Online
Article
Text
id pubmed-7814738
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-78147382021-01-21 DeepLPI: a multimodal deep learning method for predicting the interactions between lncRNAs and protein isoforms Shaw, Dipan Chen, Hao Xie, Minzhu Jiang, Tao BMC Bioinformatics Methodology Article BACKGROUND: Long non-coding RNAs (lncRNAs) regulate diverse biological processes via interactions with proteins. Since the experimental methods to identify these interactions are expensive and time-consuming, many computational methods have been proposed. Although these computational methods have achieved promising prediction performance, they neglect the fact that a gene may encode multiple protein isoforms and different isoforms of the same gene may interact differently with the same lncRNA. RESULTS: In this study, we propose a novel method, DeepLPI, for predicting the interactions between lncRNAs and protein isoforms. Our method uses sequence and structure data to extract intrinsic features and expression data to extract topological features. To combine these different data, we adopt a hybrid framework by integrating a multimodal deep learning neural network and a conditional random field. To overcome the lack of known interactions between lncRNAs and protein isoforms, we apply a multiple instance learning (MIL) approach. In our experiment concerning the human lncRNA-protein interactions in the NPInter v3.0 database, DeepLPI improved the prediction performance by 4.7% in term of AUC and 5.9% in term of AUPRC over the state-of-the-art methods. Our further correlation analyses between interactive lncRNAs and protein isoforms also illustrated that their co-expression information helped predict the interactions. Finally, we give some examples where DeepLPI was able to outperform the other methods in predicting mouse lncRNA-protein interactions and novel human lncRNA-protein interactions. CONCLUSION: Our results demonstrated that the use of isoforms and MIL contributed significantly to the improvement of performance in predicting lncRNA and protein interactions. We believe that such an approach would find more applications in predicting other functional roles of RNAs and proteins. BioMed Central 2021-01-18 /pmc/articles/PMC7814738/ /pubmed/33461501 http://dx.doi.org/10.1186/s12859-020-03914-7 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Shaw, Dipan
Chen, Hao
Xie, Minzhu
Jiang, Tao
DeepLPI: a multimodal deep learning method for predicting the interactions between lncRNAs and protein isoforms
title DeepLPI: a multimodal deep learning method for predicting the interactions between lncRNAs and protein isoforms
title_full DeepLPI: a multimodal deep learning method for predicting the interactions between lncRNAs and protein isoforms
title_fullStr DeepLPI: a multimodal deep learning method for predicting the interactions between lncRNAs and protein isoforms
title_full_unstemmed DeepLPI: a multimodal deep learning method for predicting the interactions between lncRNAs and protein isoforms
title_short DeepLPI: a multimodal deep learning method for predicting the interactions between lncRNAs and protein isoforms
title_sort deeplpi: a multimodal deep learning method for predicting the interactions between lncrnas and protein isoforms
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7814738/
https://www.ncbi.nlm.nih.gov/pubmed/33461501
http://dx.doi.org/10.1186/s12859-020-03914-7
work_keys_str_mv AT shawdipan deeplpiamultimodaldeeplearningmethodforpredictingtheinteractionsbetweenlncrnasandproteinisoforms
AT chenhao deeplpiamultimodaldeeplearningmethodforpredictingtheinteractionsbetweenlncrnasandproteinisoforms
AT xieminzhu deeplpiamultimodaldeeplearningmethodforpredictingtheinteractionsbetweenlncrnasandproteinisoforms
AT jiangtao deeplpiamultimodaldeeplearningmethodforpredictingtheinteractionsbetweenlncrnasandproteinisoforms