Cargando…
Integration of Neuroimaging and Microarray Datasets through Mapping and Model-Theoretic Semantic Decomposition of Unstructured Phenotypes
An approach towards heterogeneous neuroscience dataset integration is proposed that uses Natural Language Processing (NLP) and a knowledgebased phenotype organizer system (PhenOS) to link ontology-anchored terms to underlying data from each database, and then maps these terms based on a computable m...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
American Medical Informatics Association
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3041585/ https://www.ncbi.nlm.nih.gov/pubmed/21347176 |
_version_ | 1782198453224341504 |
---|---|
author | Pantazatos, Spiro P. Li, Jianrong Pavlidis, Paul Lussier, Yves A. |
author_facet | Pantazatos, Spiro P. Li, Jianrong Pavlidis, Paul Lussier, Yves A. |
author_sort | Pantazatos, Spiro P. |
collection | PubMed |
description | An approach towards heterogeneous neuroscience dataset integration is proposed that uses Natural Language Processing (NLP) and a knowledgebased phenotype organizer system (PhenOS) to link ontology-anchored terms to underlying data from each database, and then maps these terms based on a computable model of disease (SNOMED CT®). The approach was implemented using sample datasets from fMRIDC, GEO and Neuronames and allowed for complex queries such as “List all disorders with a finding site of brain region X, and then find the semantically related references in all participating databases based on the ontological model of the disease or its anatomical and morphological attributes”. Precision of the NLP-derived coding of the unstructured phenotypes in each datasets was 88% (n=50), and precision of the semantic mapping between these terms across datasets was 98% (n=100). To our knowledge, this is the first example of the use of both semantic decomposition of disease relationships and hierarchical information found in ontologies to integrate heterogeneous phenotypes across clinical and molecular datasets. |
format | Text |
id | pubmed-3041585 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | American Medical Informatics Association |
record_format | MEDLINE/PubMed |
spelling | pubmed-30415852011-02-23 Integration of Neuroimaging and Microarray Datasets through Mapping and Model-Theoretic Semantic Decomposition of Unstructured Phenotypes Pantazatos, Spiro P. Li, Jianrong Pavlidis, Paul Lussier, Yves A. Summit on Translat Bioinforma Articles An approach towards heterogeneous neuroscience dataset integration is proposed that uses Natural Language Processing (NLP) and a knowledgebased phenotype organizer system (PhenOS) to link ontology-anchored terms to underlying data from each database, and then maps these terms based on a computable model of disease (SNOMED CT®). The approach was implemented using sample datasets from fMRIDC, GEO and Neuronames and allowed for complex queries such as “List all disorders with a finding site of brain region X, and then find the semantically related references in all participating databases based on the ontological model of the disease or its anatomical and morphological attributes”. Precision of the NLP-derived coding of the unstructured phenotypes in each datasets was 88% (n=50), and precision of the semantic mapping between these terms across datasets was 98% (n=100). To our knowledge, this is the first example of the use of both semantic decomposition of disease relationships and hierarchical information found in ontologies to integrate heterogeneous phenotypes across clinical and molecular datasets. American Medical Informatics Association 2009-03-01 /pmc/articles/PMC3041585/ /pubmed/21347176 Text en ©2009 AMIA - All rights reserved. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose |
spellingShingle | Articles Pantazatos, Spiro P. Li, Jianrong Pavlidis, Paul Lussier, Yves A. Integration of Neuroimaging and Microarray Datasets through Mapping and Model-Theoretic Semantic Decomposition of Unstructured Phenotypes |
title | Integration of Neuroimaging and Microarray Datasets through Mapping and Model-Theoretic Semantic Decomposition of Unstructured Phenotypes |
title_full | Integration of Neuroimaging and Microarray Datasets through Mapping and Model-Theoretic Semantic Decomposition of Unstructured Phenotypes |
title_fullStr | Integration of Neuroimaging and Microarray Datasets through Mapping and Model-Theoretic Semantic Decomposition of Unstructured Phenotypes |
title_full_unstemmed | Integration of Neuroimaging and Microarray Datasets through Mapping and Model-Theoretic Semantic Decomposition of Unstructured Phenotypes |
title_short | Integration of Neuroimaging and Microarray Datasets through Mapping and Model-Theoretic Semantic Decomposition of Unstructured Phenotypes |
title_sort | integration of neuroimaging and microarray datasets through mapping and model-theoretic semantic decomposition of unstructured phenotypes |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3041585/ https://www.ncbi.nlm.nih.gov/pubmed/21347176 |
work_keys_str_mv | AT pantazatosspirop integrationofneuroimagingandmicroarraydatasetsthroughmappingandmodeltheoreticsemanticdecompositionofunstructuredphenotypes AT lijianrong integrationofneuroimagingandmicroarraydatasetsthroughmappingandmodeltheoreticsemanticdecompositionofunstructuredphenotypes AT pavlidispaul integrationofneuroimagingandmicroarraydatasetsthroughmappingandmodeltheoreticsemanticdecompositionofunstructuredphenotypes AT lussieryvesa integrationofneuroimagingandmicroarraydatasetsthroughmappingandmodeltheoreticsemanticdecompositionofunstructuredphenotypes |