Cargando…

Predicate Oriented Pattern Analysis for Biomedical Knowledge Discovery

In the current biomedical data movement, numerous efforts have been made to convert and normalize a large number of traditional structured and unstructured data (e.g., EHRs, reports) to semi-structured data (e.g., RDF, OWL). With the increasing number of semi-structured data coming into the biomedic...

Descripción completa

Detalles Bibliográficos
Autores principales: Shen, Feichen, Liu, Hongfang, Sohn, Sunghwan, Larson, David W., Lee, Yugyung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5626454/
https://www.ncbi.nlm.nih.gov/pubmed/28983419
http://dx.doi.org/10.4236/iim.2016.83006
_version_ 1783268548169170944
author Shen, Feichen
Liu, Hongfang
Sohn, Sunghwan
Larson, David W.
Lee, Yugyung
author_facet Shen, Feichen
Liu, Hongfang
Sohn, Sunghwan
Larson, David W.
Lee, Yugyung
author_sort Shen, Feichen
collection PubMed
description In the current biomedical data movement, numerous efforts have been made to convert and normalize a large number of traditional structured and unstructured data (e.g., EHRs, reports) to semi-structured data (e.g., RDF, OWL). With the increasing number of semi-structured data coming into the biomedical community, data integration and knowledge discovery from heterogeneous domains become important research problem. In the application level, detection of related concepts among medical ontologies is an important goal of life science research. It is more crucial to figure out how different concepts are related within a single ontology or across multiple ontologies by analysing predicates in different knowledge bases. However, the world today is one of information explosion, and it is extremely difficult for biomedical researchers to find existing or potential predicates to perform linking among cross domain concepts without any support from schema pattern analysis. Therefore, there is a need for a mechanism to do predicate oriented pattern analysis to partition heterogeneous ontologies into closer small topics and do query generation to discover cross domain knowledge from each topic. In this paper, we present such a model that predicates oriented pattern analysis based on their close relationship and generates a similarity matrix. Based on this similarity matrix, we apply an innovated unsupervised learning algorithm to partition large data sets into smaller and closer topics and generate meaningful queries to fully discover knowledge over a set of interlinked data sources. We have implemented a prototype system named BmQGen and evaluate the proposed model with colorectal surgical cohort from the Mayo Clinic.
format Online
Article
Text
id pubmed-5626454
institution National Center for Biotechnology Information
language English
publishDate 2016
record_format MEDLINE/PubMed
spelling pubmed-56264542017-10-03 Predicate Oriented Pattern Analysis for Biomedical Knowledge Discovery Shen, Feichen Liu, Hongfang Sohn, Sunghwan Larson, David W. Lee, Yugyung Intell Inf Manag Article In the current biomedical data movement, numerous efforts have been made to convert and normalize a large number of traditional structured and unstructured data (e.g., EHRs, reports) to semi-structured data (e.g., RDF, OWL). With the increasing number of semi-structured data coming into the biomedical community, data integration and knowledge discovery from heterogeneous domains become important research problem. In the application level, detection of related concepts among medical ontologies is an important goal of life science research. It is more crucial to figure out how different concepts are related within a single ontology or across multiple ontologies by analysing predicates in different knowledge bases. However, the world today is one of information explosion, and it is extremely difficult for biomedical researchers to find existing or potential predicates to perform linking among cross domain concepts without any support from schema pattern analysis. Therefore, there is a need for a mechanism to do predicate oriented pattern analysis to partition heterogeneous ontologies into closer small topics and do query generation to discover cross domain knowledge from each topic. In this paper, we present such a model that predicates oriented pattern analysis based on their close relationship and generates a similarity matrix. Based on this similarity matrix, we apply an innovated unsupervised learning algorithm to partition large data sets into smaller and closer topics and generate meaningful queries to fully discover knowledge over a set of interlinked data sources. We have implemented a prototype system named BmQGen and evaluate the proposed model with colorectal surgical cohort from the Mayo Clinic. 2016-05 /pmc/articles/PMC5626454/ /pubmed/28983419 http://dx.doi.org/10.4236/iim.2016.83006 Text en http://creativecommons.org/licenses/by/4.0/ This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Shen, Feichen
Liu, Hongfang
Sohn, Sunghwan
Larson, David W.
Lee, Yugyung
Predicate Oriented Pattern Analysis for Biomedical Knowledge Discovery
title Predicate Oriented Pattern Analysis for Biomedical Knowledge Discovery
title_full Predicate Oriented Pattern Analysis for Biomedical Knowledge Discovery
title_fullStr Predicate Oriented Pattern Analysis for Biomedical Knowledge Discovery
title_full_unstemmed Predicate Oriented Pattern Analysis for Biomedical Knowledge Discovery
title_short Predicate Oriented Pattern Analysis for Biomedical Knowledge Discovery
title_sort predicate oriented pattern analysis for biomedical knowledge discovery
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5626454/
https://www.ncbi.nlm.nih.gov/pubmed/28983419
http://dx.doi.org/10.4236/iim.2016.83006
work_keys_str_mv AT shenfeichen predicateorientedpatternanalysisforbiomedicalknowledgediscovery
AT liuhongfang predicateorientedpatternanalysisforbiomedicalknowledgediscovery
AT sohnsunghwan predicateorientedpatternanalysisforbiomedicalknowledgediscovery
AT larsondavidw predicateorientedpatternanalysisforbiomedicalknowledgediscovery
AT leeyugyung predicateorientedpatternanalysisforbiomedicalknowledgediscovery