Cargando…

A semantic relationship mining method among disorders, genes, and drugs from different biomedical datasets

BACKGROUND: Semantic web technology has been applied widely in the biomedical informatics field. Large numbers of biomedical datasets are available online in the resource description framework (RDF) format. Semantic relationship mining among genes, disorders, and drugs is widely used in, for example...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Li, Hu, Jiamei, Xu, Qianzhi, Li, Fang, Rao, Guozheng, Tao, Cui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7734713/
https://www.ncbi.nlm.nih.gov/pubmed/33317518
http://dx.doi.org/10.1186/s12911-020-01274-z
_version_ 1783622522000900096
author Zhang, Li
Hu, Jiamei
Xu, Qianzhi
Li, Fang
Rao, Guozheng
Tao, Cui
author_facet Zhang, Li
Hu, Jiamei
Xu, Qianzhi
Li, Fang
Rao, Guozheng
Tao, Cui
author_sort Zhang, Li
collection PubMed
description BACKGROUND: Semantic web technology has been applied widely in the biomedical informatics field. Large numbers of biomedical datasets are available online in the resource description framework (RDF) format. Semantic relationship mining among genes, disorders, and drugs is widely used in, for example, precision medicine and drug repositioning. However, most of the existing studies focused on a single dataset. It is not easy to find the most current relationships among disorder-gene-drug relationships since the relationships are distributed in heterogeneous datasets. How to mine their semantic relationships from different biomedical datasets is an important issue. METHODS: First, a variety of biomedical datasets were converted into RDF triple data; then, multisource biomedical datasets were integrated into a storage system using a data integration algorithm. Second, nine query patterns among genes, disorders, and drugs from different biomedical datasets were designed. Third, the gene-disorder-drug semantic relationship mining algorithm is presented. This algorithm can query the relationships among various entities from different datasets. RESULTS AND CONCLUSIONS: We focused on mining the putative and the most current disorder-gene-drug relationships about Parkinson’s disease (PD). The results demonstrate that our method has significant advantages in mining and integrating multisource heterogeneous biomedical datasets. Twenty-five new relationships among the genes, disorders, and drugs were mined from four different datasets. The query results showed that most of them came from different datasets. The precision of the method increased by 2.51% compared to that of the multisource linked open data fusion method presented in the 4th International Workshop on Semantics-Powered Data Mining and Analytics (SEPDA 2019). Moreover, the number of query results increased by 7.7%, and the number of correct queries increased by 9.5%.
format Online
Article
Text
id pubmed-7734713
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-77347132020-12-15 A semantic relationship mining method among disorders, genes, and drugs from different biomedical datasets Zhang, Li Hu, Jiamei Xu, Qianzhi Li, Fang Rao, Guozheng Tao, Cui BMC Med Inform Decis Mak Research BACKGROUND: Semantic web technology has been applied widely in the biomedical informatics field. Large numbers of biomedical datasets are available online in the resource description framework (RDF) format. Semantic relationship mining among genes, disorders, and drugs is widely used in, for example, precision medicine and drug repositioning. However, most of the existing studies focused on a single dataset. It is not easy to find the most current relationships among disorder-gene-drug relationships since the relationships are distributed in heterogeneous datasets. How to mine their semantic relationships from different biomedical datasets is an important issue. METHODS: First, a variety of biomedical datasets were converted into RDF triple data; then, multisource biomedical datasets were integrated into a storage system using a data integration algorithm. Second, nine query patterns among genes, disorders, and drugs from different biomedical datasets were designed. Third, the gene-disorder-drug semantic relationship mining algorithm is presented. This algorithm can query the relationships among various entities from different datasets. RESULTS AND CONCLUSIONS: We focused on mining the putative and the most current disorder-gene-drug relationships about Parkinson’s disease (PD). The results demonstrate that our method has significant advantages in mining and integrating multisource heterogeneous biomedical datasets. Twenty-five new relationships among the genes, disorders, and drugs were mined from four different datasets. The query results showed that most of them came from different datasets. The precision of the method increased by 2.51% compared to that of the multisource linked open data fusion method presented in the 4th International Workshop on Semantics-Powered Data Mining and Analytics (SEPDA 2019). Moreover, the number of query results increased by 7.7%, and the number of correct queries increased by 9.5%. BioMed Central 2020-12-14 /pmc/articles/PMC7734713/ /pubmed/33317518 http://dx.doi.org/10.1186/s12911-020-01274-z Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Zhang, Li
Hu, Jiamei
Xu, Qianzhi
Li, Fang
Rao, Guozheng
Tao, Cui
A semantic relationship mining method among disorders, genes, and drugs from different biomedical datasets
title A semantic relationship mining method among disorders, genes, and drugs from different biomedical datasets
title_full A semantic relationship mining method among disorders, genes, and drugs from different biomedical datasets
title_fullStr A semantic relationship mining method among disorders, genes, and drugs from different biomedical datasets
title_full_unstemmed A semantic relationship mining method among disorders, genes, and drugs from different biomedical datasets
title_short A semantic relationship mining method among disorders, genes, and drugs from different biomedical datasets
title_sort semantic relationship mining method among disorders, genes, and drugs from different biomedical datasets
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7734713/
https://www.ncbi.nlm.nih.gov/pubmed/33317518
http://dx.doi.org/10.1186/s12911-020-01274-z
work_keys_str_mv AT zhangli asemanticrelationshipminingmethodamongdisordersgenesanddrugsfromdifferentbiomedicaldatasets
AT hujiamei asemanticrelationshipminingmethodamongdisordersgenesanddrugsfromdifferentbiomedicaldatasets
AT xuqianzhi asemanticrelationshipminingmethodamongdisordersgenesanddrugsfromdifferentbiomedicaldatasets
AT lifang asemanticrelationshipminingmethodamongdisordersgenesanddrugsfromdifferentbiomedicaldatasets
AT raoguozheng asemanticrelationshipminingmethodamongdisordersgenesanddrugsfromdifferentbiomedicaldatasets
AT taocui asemanticrelationshipminingmethodamongdisordersgenesanddrugsfromdifferentbiomedicaldatasets
AT zhangli semanticrelationshipminingmethodamongdisordersgenesanddrugsfromdifferentbiomedicaldatasets
AT hujiamei semanticrelationshipminingmethodamongdisordersgenesanddrugsfromdifferentbiomedicaldatasets
AT xuqianzhi semanticrelationshipminingmethodamongdisordersgenesanddrugsfromdifferentbiomedicaldatasets
AT lifang semanticrelationshipminingmethodamongdisordersgenesanddrugsfromdifferentbiomedicaldatasets
AT raoguozheng semanticrelationshipminingmethodamongdisordersgenesanddrugsfromdifferentbiomedicaldatasets
AT taocui semanticrelationshipminingmethodamongdisordersgenesanddrugsfromdifferentbiomedicaldatasets