Cargando…

Integration of Neuroimaging and Microarray Datasets through Mapping and Model-Theoretic Semantic Decomposition of Unstructured Phenotypes

An approach towards heterogeneous neuroscience dataset integration is proposed that uses Natural Language Processing (NLP) and a knowledge-based phenotype organizer system (PhenOS) to link ontology-anchored terms to underlying data from each database, and then maps these terms based on a computable...

Descripción completa

Detalles Bibliográficos
Autores principales: Pantazatos, Spiro P., Li, Jianrong, Pavlidis, Paul, Lussier, Yves A.
Formato: Texto
Lenguaje:English
Publicado: Libertas Academica 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2874327/
https://www.ncbi.nlm.nih.gov/pubmed/20495688
_version_ 1782181457832181760
author Pantazatos, Spiro P.
Li, Jianrong
Pavlidis, Paul
Lussier, Yves A.
author_facet Pantazatos, Spiro P.
Li, Jianrong
Pavlidis, Paul
Lussier, Yves A.
author_sort Pantazatos, Spiro P.
collection PubMed
description An approach towards heterogeneous neuroscience dataset integration is proposed that uses Natural Language Processing (NLP) and a knowledge-based phenotype organizer system (PhenOS) to link ontology-anchored terms to underlying data from each database, and then maps these terms based on a computable model of disease (SNOMED CT(®)). The approach was implemented using sample datasets from fMRIDC, GEO, The Whole Brain Atlas and Neuronames, and allowed for complex queries such as “List all disorders with a finding site of brain region X, and then find the semantically related references in all participating databases based on the ontological model of the disease or its anatomical and morphological attributes”. Precision of the NLP-derived coding of the unstructured phenotypes in each dataset was 88% (n = 50), and precision of the semantic mapping between these terms across datasets was 98% (n = 100). To our knowledge, this is the first example of the use of both semantic decomposition of disease relationships and hierarchical information found in ontologies to integrate heterogeneous phenotypes across clinical and molecular datasets.
format Text
id pubmed-2874327
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-28743272010-05-21 Integration of Neuroimaging and Microarray Datasets through Mapping and Model-Theoretic Semantic Decomposition of Unstructured Phenotypes Pantazatos, Spiro P. Li, Jianrong Pavlidis, Paul Lussier, Yves A. Cancer Inform Original Research - Special Issue An approach towards heterogeneous neuroscience dataset integration is proposed that uses Natural Language Processing (NLP) and a knowledge-based phenotype organizer system (PhenOS) to link ontology-anchored terms to underlying data from each database, and then maps these terms based on a computable model of disease (SNOMED CT(®)). The approach was implemented using sample datasets from fMRIDC, GEO, The Whole Brain Atlas and Neuronames, and allowed for complex queries such as “List all disorders with a finding site of brain region X, and then find the semantically related references in all participating databases based on the ontological model of the disease or its anatomical and morphological attributes”. Precision of the NLP-derived coding of the unstructured phenotypes in each dataset was 88% (n = 50), and precision of the semantic mapping between these terms across datasets was 98% (n = 100). To our knowledge, this is the first example of the use of both semantic decomposition of disease relationships and hierarchical information found in ontologies to integrate heterogeneous phenotypes across clinical and molecular datasets. Libertas Academica 2009-06-08 /pmc/articles/PMC2874327/ /pubmed/20495688 Text en © the author(s), publisher and licensee Libertas Academica Ltd. This is an open access article. Unrestricted non-commercial use is permitted provided the original work is properly cited.
spellingShingle Original Research - Special Issue
Pantazatos, Spiro P.
Li, Jianrong
Pavlidis, Paul
Lussier, Yves A.
Integration of Neuroimaging and Microarray Datasets through Mapping and Model-Theoretic Semantic Decomposition of Unstructured Phenotypes
title Integration of Neuroimaging and Microarray Datasets through Mapping and Model-Theoretic Semantic Decomposition of Unstructured Phenotypes
title_full Integration of Neuroimaging and Microarray Datasets through Mapping and Model-Theoretic Semantic Decomposition of Unstructured Phenotypes
title_fullStr Integration of Neuroimaging and Microarray Datasets through Mapping and Model-Theoretic Semantic Decomposition of Unstructured Phenotypes
title_full_unstemmed Integration of Neuroimaging and Microarray Datasets through Mapping and Model-Theoretic Semantic Decomposition of Unstructured Phenotypes
title_short Integration of Neuroimaging and Microarray Datasets through Mapping and Model-Theoretic Semantic Decomposition of Unstructured Phenotypes
title_sort integration of neuroimaging and microarray datasets through mapping and model-theoretic semantic decomposition of unstructured phenotypes
topic Original Research - Special Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2874327/
https://www.ncbi.nlm.nih.gov/pubmed/20495688
work_keys_str_mv AT pantazatosspirop integrationofneuroimagingandmicroarraydatasetsthroughmappingandmodeltheoreticsemanticdecompositionofunstructuredphenotypes
AT lijianrong integrationofneuroimagingandmicroarraydatasetsthroughmappingandmodeltheoreticsemanticdecompositionofunstructuredphenotypes
AT pavlidispaul integrationofneuroimagingandmicroarraydatasetsthroughmappingandmodeltheoreticsemanticdecompositionofunstructuredphenotypes
AT lussieryvesa integrationofneuroimagingandmicroarraydatasetsthroughmappingandmodeltheoreticsemanticdecompositionofunstructuredphenotypes