Cargando…

GBDTL2E: Predicting lncRNA-EF Associations Using Diffusion and HeteSim Features Based on a Heterogeneous Network

Interactions between genetic factors and environmental factors (EFs) play an important role in many diseases. Many diseases result from the interaction between genetics and EFs. The long non-coding RNA (lncRNA) is an important non-coding RNA that regulates life processes. The ability to predict the...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Jiaqi, Kuang, Zhufang, Ma, Zhihao, Han, Genwei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7174746/
https://www.ncbi.nlm.nih.gov/pubmed/32351537
http://dx.doi.org/10.3389/fgene.2020.00272
_version_ 1783524691059671040
author Wang, Jiaqi
Kuang, Zhufang
Ma, Zhihao
Han, Genwei
author_facet Wang, Jiaqi
Kuang, Zhufang
Ma, Zhihao
Han, Genwei
author_sort Wang, Jiaqi
collection PubMed
description Interactions between genetic factors and environmental factors (EFs) play an important role in many diseases. Many diseases result from the interaction between genetics and EFs. The long non-coding RNA (lncRNA) is an important non-coding RNA that regulates life processes. The ability to predict the associations between lncRNAs and EFs is of important practical significance. However, the recent methods for predicting lncRNA-EF associations rarely use the topological information of heterogenous biological networks or simply treat all objects as the same type without considering the different and subtle semantic meanings of various paths in the heterogeneous network. In order to address this issue, a method based on the Gradient Boosting Decision Tree (GBDT) to predict the association between lncRNAs and EFs (GBDTL2E) is proposed in this paper. The innovation of the GBDTL2E integrates the structural information and heterogenous networks, combines the Hetesim features and the diffusion features based on multi-feature fusion, and uses the machine learning algorithm GBDT to predict the association between lncRNAs and EFs based on heterogeneous networks. The experimental results demonstrate that the proposed algorithm achieves a high performance.
format Online
Article
Text
id pubmed-7174746
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-71747462020-04-29 GBDTL2E: Predicting lncRNA-EF Associations Using Diffusion and HeteSim Features Based on a Heterogeneous Network Wang, Jiaqi Kuang, Zhufang Ma, Zhihao Han, Genwei Front Genet Genetics Interactions between genetic factors and environmental factors (EFs) play an important role in many diseases. Many diseases result from the interaction between genetics and EFs. The long non-coding RNA (lncRNA) is an important non-coding RNA that regulates life processes. The ability to predict the associations between lncRNAs and EFs is of important practical significance. However, the recent methods for predicting lncRNA-EF associations rarely use the topological information of heterogenous biological networks or simply treat all objects as the same type without considering the different and subtle semantic meanings of various paths in the heterogeneous network. In order to address this issue, a method based on the Gradient Boosting Decision Tree (GBDT) to predict the association between lncRNAs and EFs (GBDTL2E) is proposed in this paper. The innovation of the GBDTL2E integrates the structural information and heterogenous networks, combines the Hetesim features and the diffusion features based on multi-feature fusion, and uses the machine learning algorithm GBDT to predict the association between lncRNAs and EFs based on heterogeneous networks. The experimental results demonstrate that the proposed algorithm achieves a high performance. Frontiers Media S.A. 2020-04-15 /pmc/articles/PMC7174746/ /pubmed/32351537 http://dx.doi.org/10.3389/fgene.2020.00272 Text en Copyright © 2020 Wang, Kuang, Ma and Han. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Wang, Jiaqi
Kuang, Zhufang
Ma, Zhihao
Han, Genwei
GBDTL2E: Predicting lncRNA-EF Associations Using Diffusion and HeteSim Features Based on a Heterogeneous Network
title GBDTL2E: Predicting lncRNA-EF Associations Using Diffusion and HeteSim Features Based on a Heterogeneous Network
title_full GBDTL2E: Predicting lncRNA-EF Associations Using Diffusion and HeteSim Features Based on a Heterogeneous Network
title_fullStr GBDTL2E: Predicting lncRNA-EF Associations Using Diffusion and HeteSim Features Based on a Heterogeneous Network
title_full_unstemmed GBDTL2E: Predicting lncRNA-EF Associations Using Diffusion and HeteSim Features Based on a Heterogeneous Network
title_short GBDTL2E: Predicting lncRNA-EF Associations Using Diffusion and HeteSim Features Based on a Heterogeneous Network
title_sort gbdtl2e: predicting lncrna-ef associations using diffusion and hetesim features based on a heterogeneous network
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7174746/
https://www.ncbi.nlm.nih.gov/pubmed/32351537
http://dx.doi.org/10.3389/fgene.2020.00272
work_keys_str_mv AT wangjiaqi gbdtl2epredictinglncrnaefassociationsusingdiffusionandhetesimfeaturesbasedonaheterogeneousnetwork
AT kuangzhufang gbdtl2epredictinglncrnaefassociationsusingdiffusionandhetesimfeaturesbasedonaheterogeneousnetwork
AT mazhihao gbdtl2epredictinglncrnaefassociationsusingdiffusionandhetesimfeaturesbasedonaheterogeneousnetwork
AT hangenwei gbdtl2epredictinglncrnaefassociationsusingdiffusionandhetesimfeaturesbasedonaheterogeneousnetwork