Cargando…

An ontology-based approach for harmonization and cross-cohort query of Alzheimer’s disease data resources

BACKGROUND: In the United States, the National Alzheimer’s Coordinating Center (NACC) and the Alzheimer’s Disease Neuroimaging Initiative (ADNI) are two major data sharing resources for Alzheimer’s Disease (AD) research. NACC and ADNI strive to make their data more FAIR (findable, interoperable, acc...

Descripción completa

Detalles Bibliográficos
Autores principales: Hao, Xubing, Li, Xiaojin, Zhang, Guo-Qiang, Tao, Cui, Schulz, Paul E., Cui, Licong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10401730/
https://www.ncbi.nlm.nih.gov/pubmed/37542312
http://dx.doi.org/10.1186/s12911-023-02250-z
_version_ 1785084725265694720
author Hao, Xubing
Li, Xiaojin
Zhang, Guo-Qiang
Tao, Cui
Schulz, Paul E.
Cui, Licong
author_facet Hao, Xubing
Li, Xiaojin
Zhang, Guo-Qiang
Tao, Cui
Schulz, Paul E.
Cui, Licong
author_sort Hao, Xubing
collection PubMed
description BACKGROUND: In the United States, the National Alzheimer’s Coordinating Center (NACC) and the Alzheimer’s Disease Neuroimaging Initiative (ADNI) are two major data sharing resources for Alzheimer’s Disease (AD) research. NACC and ADNI strive to make their data more FAIR (findable, interoperable, accessible and reusable) for the broader research community. However, there is limited work harmonizing and supporting cross-cohort interoperability of the two resources. METHOD: In this paper, we leverage an ontology-based approach to harmonize data elements in the two resources and develop a web-based query system to search patient cohorts across the two resources. We first mapped data elements across NACC and ADNI, and performed value harmonization for the mapped data elements with inconsistent permissible values. Then we built an Alzheimer’s Disease Data Element Ontology (ADEO) to model the mapped data elements in NACC and ADNI. We further developed a prototype cross-cohort query system to search patient cohorts across NACC and ADNI. RESULTS: After manual review, we found 172 mappings between NACC and ADNI. These 172 mappings were further used to construct common concepts in ADEO. Our data element mapping and harmonization resulted in five files storing common concepts, variables in NACC and ADNI, mappings between variables and common concepts, permissible values of categorical type data elements, and coding inconsistency harmonization, respectively. Our cross-cohort query system consists of three core architectural elements: a web-based interface, an advanced query engine, and a backend MongoDB database. CONCLUSIONS: In this work, ADEO has been specifically designed to facilitate data harmonization and cross-cohort query of NACC and ADNI data resources. Although our prototype cross-cohort query system was developed for exploring NACC and ADNI, its backend and frontend framework has been designed and implemented to be generally applicable to other domains for querying patient cohorts from multiple heterogeneous data sources.
format Online
Article
Text
id pubmed-10401730
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-104017302023-08-05 An ontology-based approach for harmonization and cross-cohort query of Alzheimer’s disease data resources Hao, Xubing Li, Xiaojin Zhang, Guo-Qiang Tao, Cui Schulz, Paul E. Cui, Licong BMC Med Inform Decis Mak Research BACKGROUND: In the United States, the National Alzheimer’s Coordinating Center (NACC) and the Alzheimer’s Disease Neuroimaging Initiative (ADNI) are two major data sharing resources for Alzheimer’s Disease (AD) research. NACC and ADNI strive to make their data more FAIR (findable, interoperable, accessible and reusable) for the broader research community. However, there is limited work harmonizing and supporting cross-cohort interoperability of the two resources. METHOD: In this paper, we leverage an ontology-based approach to harmonize data elements in the two resources and develop a web-based query system to search patient cohorts across the two resources. We first mapped data elements across NACC and ADNI, and performed value harmonization for the mapped data elements with inconsistent permissible values. Then we built an Alzheimer’s Disease Data Element Ontology (ADEO) to model the mapped data elements in NACC and ADNI. We further developed a prototype cross-cohort query system to search patient cohorts across NACC and ADNI. RESULTS: After manual review, we found 172 mappings between NACC and ADNI. These 172 mappings were further used to construct common concepts in ADEO. Our data element mapping and harmonization resulted in five files storing common concepts, variables in NACC and ADNI, mappings between variables and common concepts, permissible values of categorical type data elements, and coding inconsistency harmonization, respectively. Our cross-cohort query system consists of three core architectural elements: a web-based interface, an advanced query engine, and a backend MongoDB database. CONCLUSIONS: In this work, ADEO has been specifically designed to facilitate data harmonization and cross-cohort query of NACC and ADNI data resources. Although our prototype cross-cohort query system was developed for exploring NACC and ADNI, its backend and frontend framework has been designed and implemented to be generally applicable to other domains for querying patient cohorts from multiple heterogeneous data sources. BioMed Central 2023-08-04 /pmc/articles/PMC10401730/ /pubmed/37542312 http://dx.doi.org/10.1186/s12911-023-02250-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Hao, Xubing
Li, Xiaojin
Zhang, Guo-Qiang
Tao, Cui
Schulz, Paul E.
Cui, Licong
An ontology-based approach for harmonization and cross-cohort query of Alzheimer’s disease data resources
title An ontology-based approach for harmonization and cross-cohort query of Alzheimer’s disease data resources
title_full An ontology-based approach for harmonization and cross-cohort query of Alzheimer’s disease data resources
title_fullStr An ontology-based approach for harmonization and cross-cohort query of Alzheimer’s disease data resources
title_full_unstemmed An ontology-based approach for harmonization and cross-cohort query of Alzheimer’s disease data resources
title_short An ontology-based approach for harmonization and cross-cohort query of Alzheimer’s disease data resources
title_sort ontology-based approach for harmonization and cross-cohort query of alzheimer’s disease data resources
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10401730/
https://www.ncbi.nlm.nih.gov/pubmed/37542312
http://dx.doi.org/10.1186/s12911-023-02250-z
work_keys_str_mv AT haoxubing anontologybasedapproachforharmonizationandcrosscohortqueryofalzheimersdiseasedataresources
AT lixiaojin anontologybasedapproachforharmonizationandcrosscohortqueryofalzheimersdiseasedataresources
AT zhangguoqiang anontologybasedapproachforharmonizationandcrosscohortqueryofalzheimersdiseasedataresources
AT taocui anontologybasedapproachforharmonizationandcrosscohortqueryofalzheimersdiseasedataresources
AT schulzpaule anontologybasedapproachforharmonizationandcrosscohortqueryofalzheimersdiseasedataresources
AT anontologybasedapproachforharmonizationandcrosscohortqueryofalzheimersdiseasedataresources
AT cuilicong anontologybasedapproachforharmonizationandcrosscohortqueryofalzheimersdiseasedataresources
AT haoxubing ontologybasedapproachforharmonizationandcrosscohortqueryofalzheimersdiseasedataresources
AT lixiaojin ontologybasedapproachforharmonizationandcrosscohortqueryofalzheimersdiseasedataresources
AT zhangguoqiang ontologybasedapproachforharmonizationandcrosscohortqueryofalzheimersdiseasedataresources
AT taocui ontologybasedapproachforharmonizationandcrosscohortqueryofalzheimersdiseasedataresources
AT schulzpaule ontologybasedapproachforharmonizationandcrosscohortqueryofalzheimersdiseasedataresources
AT ontologybasedapproachforharmonizationandcrosscohortqueryofalzheimersdiseasedataresources
AT cuilicong ontologybasedapproachforharmonizationandcrosscohortqueryofalzheimersdiseasedataresources