Cargando…
Bioentity2vec: Attribute- and behavior-driven representation for predicting multi-type relationships between bioentities
BACKGROUND: The explosive growth of genomic, chemical, and pathological data provides new opportunities and challenges for humans to thoroughly understand life activities in cells. However, there exist few computational models that aggregate various bioentities to comprehensively reveal the physical...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7293023/ https://www.ncbi.nlm.nih.gov/pubmed/32533701 http://dx.doi.org/10.1093/gigascience/giaa032 |
_version_ | 1783546216467922944 |
---|---|
author | Guo, Zhen-Hao You, Zhu-Hong Wang, Yan-Bin Huang, De-Shuang Yi, Hai-Cheng Chen, Zhan-Heng |
author_facet | Guo, Zhen-Hao You, Zhu-Hong Wang, Yan-Bin Huang, De-Shuang Yi, Hai-Cheng Chen, Zhan-Heng |
author_sort | Guo, Zhen-Hao |
collection | PubMed |
description | BACKGROUND: The explosive growth of genomic, chemical, and pathological data provides new opportunities and challenges for humans to thoroughly understand life activities in cells. However, there exist few computational models that aggregate various bioentities to comprehensively reveal the physical and functional landscape of biological systems. RESULTS: We constructed a molecular association network, which contains 18 edges (relationships) between 8 nodes (bioentities). Based on this, we propose Bioentity2vec, a new method for representing bioentities, which integrates information about the attributes and behaviors of a bioentity. Applying the random forest classifier, we achieved promising performance on 18 relationships, with an area under the curve of 0.9608 and an area under the precision-recall curve of 0.9572. CONCLUSIONS: Our study shows that constructing a network with rich topological and biological information is important for systematic understanding of the biological landscape at the molecular level. Our results show that Bioentity2vec can effectively represent biological entities and provides easily distinguishable information about classification tasks. Our method is also able to simultaneously predict relationships between single types and multiple types, which will accelerate progress in biological experimental research and industrial product development. |
format | Online Article Text |
id | pubmed-7293023 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-72930232020-06-17 Bioentity2vec: Attribute- and behavior-driven representation for predicting multi-type relationships between bioentities Guo, Zhen-Hao You, Zhu-Hong Wang, Yan-Bin Huang, De-Shuang Yi, Hai-Cheng Chen, Zhan-Heng Gigascience Research BACKGROUND: The explosive growth of genomic, chemical, and pathological data provides new opportunities and challenges for humans to thoroughly understand life activities in cells. However, there exist few computational models that aggregate various bioentities to comprehensively reveal the physical and functional landscape of biological systems. RESULTS: We constructed a molecular association network, which contains 18 edges (relationships) between 8 nodes (bioentities). Based on this, we propose Bioentity2vec, a new method for representing bioentities, which integrates information about the attributes and behaviors of a bioentity. Applying the random forest classifier, we achieved promising performance on 18 relationships, with an area under the curve of 0.9608 and an area under the precision-recall curve of 0.9572. CONCLUSIONS: Our study shows that constructing a network with rich topological and biological information is important for systematic understanding of the biological landscape at the molecular level. Our results show that Bioentity2vec can effectively represent biological entities and provides easily distinguishable information about classification tasks. Our method is also able to simultaneously predict relationships between single types and multiple types, which will accelerate progress in biological experimental research and industrial product development. Oxford University Press 2020-06-13 /pmc/articles/PMC7293023/ /pubmed/32533701 http://dx.doi.org/10.1093/gigascience/giaa032 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Guo, Zhen-Hao You, Zhu-Hong Wang, Yan-Bin Huang, De-Shuang Yi, Hai-Cheng Chen, Zhan-Heng Bioentity2vec: Attribute- and behavior-driven representation for predicting multi-type relationships between bioentities |
title | Bioentity2vec: Attribute- and behavior-driven representation for predicting multi-type relationships between bioentities |
title_full | Bioentity2vec: Attribute- and behavior-driven representation for predicting multi-type relationships between bioentities |
title_fullStr | Bioentity2vec: Attribute- and behavior-driven representation for predicting multi-type relationships between bioentities |
title_full_unstemmed | Bioentity2vec: Attribute- and behavior-driven representation for predicting multi-type relationships between bioentities |
title_short | Bioentity2vec: Attribute- and behavior-driven representation for predicting multi-type relationships between bioentities |
title_sort | bioentity2vec: attribute- and behavior-driven representation for predicting multi-type relationships between bioentities |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7293023/ https://www.ncbi.nlm.nih.gov/pubmed/32533701 http://dx.doi.org/10.1093/gigascience/giaa032 |
work_keys_str_mv | AT guozhenhao bioentity2vecattributeandbehaviordrivenrepresentationforpredictingmultityperelationshipsbetweenbioentities AT youzhuhong bioentity2vecattributeandbehaviordrivenrepresentationforpredictingmultityperelationshipsbetweenbioentities AT wangyanbin bioentity2vecattributeandbehaviordrivenrepresentationforpredictingmultityperelationshipsbetweenbioentities AT huangdeshuang bioentity2vecattributeandbehaviordrivenrepresentationforpredictingmultityperelationshipsbetweenbioentities AT yihaicheng bioentity2vecattributeandbehaviordrivenrepresentationforpredictingmultityperelationshipsbetweenbioentities AT chenzhanheng bioentity2vecattributeandbehaviordrivenrepresentationforpredictingmultityperelationshipsbetweenbioentities |