Cargando…
A semantic relationship mining method among disorders, genes, and drugs from different biomedical datasets
BACKGROUND: Semantic web technology has been applied widely in the biomedical informatics field. Large numbers of biomedical datasets are available online in the resource description framework (RDF) format. Semantic relationship mining among genes, disorders, and drugs is widely used in, for example...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7734713/ https://www.ncbi.nlm.nih.gov/pubmed/33317518 http://dx.doi.org/10.1186/s12911-020-01274-z |
_version_ | 1783622522000900096 |
---|---|
author | Zhang, Li Hu, Jiamei Xu, Qianzhi Li, Fang Rao, Guozheng Tao, Cui |
author_facet | Zhang, Li Hu, Jiamei Xu, Qianzhi Li, Fang Rao, Guozheng Tao, Cui |
author_sort | Zhang, Li |
collection | PubMed |
description | BACKGROUND: Semantic web technology has been applied widely in the biomedical informatics field. Large numbers of biomedical datasets are available online in the resource description framework (RDF) format. Semantic relationship mining among genes, disorders, and drugs is widely used in, for example, precision medicine and drug repositioning. However, most of the existing studies focused on a single dataset. It is not easy to find the most current relationships among disorder-gene-drug relationships since the relationships are distributed in heterogeneous datasets. How to mine their semantic relationships from different biomedical datasets is an important issue. METHODS: First, a variety of biomedical datasets were converted into RDF triple data; then, multisource biomedical datasets were integrated into a storage system using a data integration algorithm. Second, nine query patterns among genes, disorders, and drugs from different biomedical datasets were designed. Third, the gene-disorder-drug semantic relationship mining algorithm is presented. This algorithm can query the relationships among various entities from different datasets. RESULTS AND CONCLUSIONS: We focused on mining the putative and the most current disorder-gene-drug relationships about Parkinson’s disease (PD). The results demonstrate that our method has significant advantages in mining and integrating multisource heterogeneous biomedical datasets. Twenty-five new relationships among the genes, disorders, and drugs were mined from four different datasets. The query results showed that most of them came from different datasets. The precision of the method increased by 2.51% compared to that of the multisource linked open data fusion method presented in the 4th International Workshop on Semantics-Powered Data Mining and Analytics (SEPDA 2019). Moreover, the number of query results increased by 7.7%, and the number of correct queries increased by 9.5%. |
format | Online Article Text |
id | pubmed-7734713 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-77347132020-12-15 A semantic relationship mining method among disorders, genes, and drugs from different biomedical datasets Zhang, Li Hu, Jiamei Xu, Qianzhi Li, Fang Rao, Guozheng Tao, Cui BMC Med Inform Decis Mak Research BACKGROUND: Semantic web technology has been applied widely in the biomedical informatics field. Large numbers of biomedical datasets are available online in the resource description framework (RDF) format. Semantic relationship mining among genes, disorders, and drugs is widely used in, for example, precision medicine and drug repositioning. However, most of the existing studies focused on a single dataset. It is not easy to find the most current relationships among disorder-gene-drug relationships since the relationships are distributed in heterogeneous datasets. How to mine their semantic relationships from different biomedical datasets is an important issue. METHODS: First, a variety of biomedical datasets were converted into RDF triple data; then, multisource biomedical datasets were integrated into a storage system using a data integration algorithm. Second, nine query patterns among genes, disorders, and drugs from different biomedical datasets were designed. Third, the gene-disorder-drug semantic relationship mining algorithm is presented. This algorithm can query the relationships among various entities from different datasets. RESULTS AND CONCLUSIONS: We focused on mining the putative and the most current disorder-gene-drug relationships about Parkinson’s disease (PD). The results demonstrate that our method has significant advantages in mining and integrating multisource heterogeneous biomedical datasets. Twenty-five new relationships among the genes, disorders, and drugs were mined from four different datasets. The query results showed that most of them came from different datasets. The precision of the method increased by 2.51% compared to that of the multisource linked open data fusion method presented in the 4th International Workshop on Semantics-Powered Data Mining and Analytics (SEPDA 2019). Moreover, the number of query results increased by 7.7%, and the number of correct queries increased by 9.5%. BioMed Central 2020-12-14 /pmc/articles/PMC7734713/ /pubmed/33317518 http://dx.doi.org/10.1186/s12911-020-01274-z Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Zhang, Li Hu, Jiamei Xu, Qianzhi Li, Fang Rao, Guozheng Tao, Cui A semantic relationship mining method among disorders, genes, and drugs from different biomedical datasets |
title | A semantic relationship mining method among disorders, genes, and drugs from different biomedical datasets |
title_full | A semantic relationship mining method among disorders, genes, and drugs from different biomedical datasets |
title_fullStr | A semantic relationship mining method among disorders, genes, and drugs from different biomedical datasets |
title_full_unstemmed | A semantic relationship mining method among disorders, genes, and drugs from different biomedical datasets |
title_short | A semantic relationship mining method among disorders, genes, and drugs from different biomedical datasets |
title_sort | semantic relationship mining method among disorders, genes, and drugs from different biomedical datasets |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7734713/ https://www.ncbi.nlm.nih.gov/pubmed/33317518 http://dx.doi.org/10.1186/s12911-020-01274-z |
work_keys_str_mv | AT zhangli asemanticrelationshipminingmethodamongdisordersgenesanddrugsfromdifferentbiomedicaldatasets AT hujiamei asemanticrelationshipminingmethodamongdisordersgenesanddrugsfromdifferentbiomedicaldatasets AT xuqianzhi asemanticrelationshipminingmethodamongdisordersgenesanddrugsfromdifferentbiomedicaldatasets AT lifang asemanticrelationshipminingmethodamongdisordersgenesanddrugsfromdifferentbiomedicaldatasets AT raoguozheng asemanticrelationshipminingmethodamongdisordersgenesanddrugsfromdifferentbiomedicaldatasets AT taocui asemanticrelationshipminingmethodamongdisordersgenesanddrugsfromdifferentbiomedicaldatasets AT zhangli semanticrelationshipminingmethodamongdisordersgenesanddrugsfromdifferentbiomedicaldatasets AT hujiamei semanticrelationshipminingmethodamongdisordersgenesanddrugsfromdifferentbiomedicaldatasets AT xuqianzhi semanticrelationshipminingmethodamongdisordersgenesanddrugsfromdifferentbiomedicaldatasets AT lifang semanticrelationshipminingmethodamongdisordersgenesanddrugsfromdifferentbiomedicaldatasets AT raoguozheng semanticrelationshipminingmethodamongdisordersgenesanddrugsfromdifferentbiomedicaldatasets AT taocui semanticrelationshipminingmethodamongdisordersgenesanddrugsfromdifferentbiomedicaldatasets |