Cargando…

Bioentity2vec: Attribute- and behavior-driven representation for predicting multi-type relationships between bioentities

BACKGROUND: The explosive growth of genomic, chemical, and pathological data provides new opportunities and challenges for humans to thoroughly understand life activities in cells. However, there exist few computational models that aggregate various bioentities to comprehensively reveal the physical...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Zhen-Hao, You, Zhu-Hong, Wang, Yan-Bin, Huang, De-Shuang, Yi, Hai-Cheng, Chen, Zhan-Heng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7293023/
https://www.ncbi.nlm.nih.gov/pubmed/32533701
http://dx.doi.org/10.1093/gigascience/giaa032
_version_ 1783546216467922944
author Guo, Zhen-Hao
You, Zhu-Hong
Wang, Yan-Bin
Huang, De-Shuang
Yi, Hai-Cheng
Chen, Zhan-Heng
author_facet Guo, Zhen-Hao
You, Zhu-Hong
Wang, Yan-Bin
Huang, De-Shuang
Yi, Hai-Cheng
Chen, Zhan-Heng
author_sort Guo, Zhen-Hao
collection PubMed
description BACKGROUND: The explosive growth of genomic, chemical, and pathological data provides new opportunities and challenges for humans to thoroughly understand life activities in cells. However, there exist few computational models that aggregate various bioentities to comprehensively reveal the physical and functional landscape of biological systems. RESULTS: We constructed a molecular association network, which contains 18 edges (relationships) between 8 nodes (bioentities). Based on this, we propose Bioentity2vec, a new method for representing bioentities, which integrates information about the attributes and behaviors of a bioentity. Applying the random forest classifier, we achieved promising performance on 18 relationships, with an area under the curve of 0.9608 and an area under the precision-recall curve of 0.9572. CONCLUSIONS: Our study shows that constructing a network with rich topological and biological information is important for systematic understanding of the biological landscape at the molecular level. Our results show that Bioentity2vec can effectively represent biological entities and provides easily distinguishable information about classification tasks. Our method is also able to simultaneously predict relationships between single types and multiple types, which will accelerate progress in biological experimental research and industrial product development.
format Online
Article
Text
id pubmed-7293023
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-72930232020-06-17 Bioentity2vec: Attribute- and behavior-driven representation for predicting multi-type relationships between bioentities Guo, Zhen-Hao You, Zhu-Hong Wang, Yan-Bin Huang, De-Shuang Yi, Hai-Cheng Chen, Zhan-Heng Gigascience Research BACKGROUND: The explosive growth of genomic, chemical, and pathological data provides new opportunities and challenges for humans to thoroughly understand life activities in cells. However, there exist few computational models that aggregate various bioentities to comprehensively reveal the physical and functional landscape of biological systems. RESULTS: We constructed a molecular association network, which contains 18 edges (relationships) between 8 nodes (bioentities). Based on this, we propose Bioentity2vec, a new method for representing bioentities, which integrates information about the attributes and behaviors of a bioentity. Applying the random forest classifier, we achieved promising performance on 18 relationships, with an area under the curve of 0.9608 and an area under the precision-recall curve of 0.9572. CONCLUSIONS: Our study shows that constructing a network with rich topological and biological information is important for systematic understanding of the biological landscape at the molecular level. Our results show that Bioentity2vec can effectively represent biological entities and provides easily distinguishable information about classification tasks. Our method is also able to simultaneously predict relationships between single types and multiple types, which will accelerate progress in biological experimental research and industrial product development. Oxford University Press 2020-06-13 /pmc/articles/PMC7293023/ /pubmed/32533701 http://dx.doi.org/10.1093/gigascience/giaa032 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Guo, Zhen-Hao
You, Zhu-Hong
Wang, Yan-Bin
Huang, De-Shuang
Yi, Hai-Cheng
Chen, Zhan-Heng
Bioentity2vec: Attribute- and behavior-driven representation for predicting multi-type relationships between bioentities
title Bioentity2vec: Attribute- and behavior-driven representation for predicting multi-type relationships between bioentities
title_full Bioentity2vec: Attribute- and behavior-driven representation for predicting multi-type relationships between bioentities
title_fullStr Bioentity2vec: Attribute- and behavior-driven representation for predicting multi-type relationships between bioentities
title_full_unstemmed Bioentity2vec: Attribute- and behavior-driven representation for predicting multi-type relationships between bioentities
title_short Bioentity2vec: Attribute- and behavior-driven representation for predicting multi-type relationships between bioentities
title_sort bioentity2vec: attribute- and behavior-driven representation for predicting multi-type relationships between bioentities
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7293023/
https://www.ncbi.nlm.nih.gov/pubmed/32533701
http://dx.doi.org/10.1093/gigascience/giaa032
work_keys_str_mv AT guozhenhao bioentity2vecattributeandbehaviordrivenrepresentationforpredictingmultityperelationshipsbetweenbioentities
AT youzhuhong bioentity2vecattributeandbehaviordrivenrepresentationforpredictingmultityperelationshipsbetweenbioentities
AT wangyanbin bioentity2vecattributeandbehaviordrivenrepresentationforpredictingmultityperelationshipsbetweenbioentities
AT huangdeshuang bioentity2vecattributeandbehaviordrivenrepresentationforpredictingmultityperelationshipsbetweenbioentities
AT yihaicheng bioentity2vecattributeandbehaviordrivenrepresentationforpredictingmultityperelationshipsbetweenbioentities
AT chenzhanheng bioentity2vecattributeandbehaviordrivenrepresentationforpredictingmultityperelationshipsbetweenbioentities