Cargando…

X-search: an open access interface for cross-cohort exploration of the National Sleep Research Resource

BACKGROUND: The National Sleep Research Resource (NSRR) is a large-scale, openly shared, data repository of de-identified, highly curated clinical sleep data from multiple NIH-funded epidemiological studies. Although many data repositories allow users to browse their content, few support fine-graine...

Descripción completa

Detalles Bibliográficos
Autores principales: Cui, Licong, Zeng, Ningzhou, Kim, Matthew, Mueller, Remo, Hankosky, Emily R., Redline, Susan, Zhang, Guo-Qiang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6234631/
https://www.ncbi.nlm.nih.gov/pubmed/30424756
http://dx.doi.org/10.1186/s12911-018-0682-y
_version_ 1783370735190802432
author Cui, Licong
Zeng, Ningzhou
Kim, Matthew
Mueller, Remo
Hankosky, Emily R.
Redline, Susan
Zhang, Guo-Qiang
author_facet Cui, Licong
Zeng, Ningzhou
Kim, Matthew
Mueller, Remo
Hankosky, Emily R.
Redline, Susan
Zhang, Guo-Qiang
author_sort Cui, Licong
collection PubMed
description BACKGROUND: The National Sleep Research Resource (NSRR) is a large-scale, openly shared, data repository of de-identified, highly curated clinical sleep data from multiple NIH-funded epidemiological studies. Although many data repositories allow users to browse their content, few support fine-grained, cross-cohort query and exploration at study-subject level. We introduce a cross-cohort query and exploration system, called X-search, to enable researchers to query patient cohort counts across a growing number of completed, NIH-funded studies in NSRR and explore the feasibility or likelihood of reusing the data for research studies. METHODS: X-search has been designed as a general framework with two loosely-coupled components: semantically annotated data repository and cross-cohort exploration engine. The semantically annotated data repository is comprised of a canonical data dictionary, data sources with a data dictionary, and mappings between each individual data dictionary and the canonical data dictionary. The cross-cohort exploration engine consists of five modules: query builder, graphical exploration, case-control exploration, query translation, and query execution. The canonical data dictionary serves as the unified metadata to drive the visual exploration interfaces and facilitate query translation through the mappings. RESULTS: X-search is publicly available at https://www.x-search.net/with nine NSRR datasets consisting of over 26,000 unique subjects. The canonical data dictionary contains over 900 common data elements across the datasets. X-search has received over 1800 cross-cohort queries by users from 16 countries. CONCLUSIONS: X-search provides a powerful cross-cohort exploration interface for querying and exploring heterogeneous datasets in the NSRR data repository, so as to enable researchers to evaluate the feasibility of potential research studies and generate potential hypotheses using the NSRR data.
format Online
Article
Text
id pubmed-6234631
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-62346312018-11-23 X-search: an open access interface for cross-cohort exploration of the National Sleep Research Resource Cui, Licong Zeng, Ningzhou Kim, Matthew Mueller, Remo Hankosky, Emily R. Redline, Susan Zhang, Guo-Qiang BMC Med Inform Decis Mak Research Article BACKGROUND: The National Sleep Research Resource (NSRR) is a large-scale, openly shared, data repository of de-identified, highly curated clinical sleep data from multiple NIH-funded epidemiological studies. Although many data repositories allow users to browse their content, few support fine-grained, cross-cohort query and exploration at study-subject level. We introduce a cross-cohort query and exploration system, called X-search, to enable researchers to query patient cohort counts across a growing number of completed, NIH-funded studies in NSRR and explore the feasibility or likelihood of reusing the data for research studies. METHODS: X-search has been designed as a general framework with two loosely-coupled components: semantically annotated data repository and cross-cohort exploration engine. The semantically annotated data repository is comprised of a canonical data dictionary, data sources with a data dictionary, and mappings between each individual data dictionary and the canonical data dictionary. The cross-cohort exploration engine consists of five modules: query builder, graphical exploration, case-control exploration, query translation, and query execution. The canonical data dictionary serves as the unified metadata to drive the visual exploration interfaces and facilitate query translation through the mappings. RESULTS: X-search is publicly available at https://www.x-search.net/with nine NSRR datasets consisting of over 26,000 unique subjects. The canonical data dictionary contains over 900 common data elements across the datasets. X-search has received over 1800 cross-cohort queries by users from 16 countries. CONCLUSIONS: X-search provides a powerful cross-cohort exploration interface for querying and exploring heterogeneous datasets in the NSRR data repository, so as to enable researchers to evaluate the feasibility of potential research studies and generate potential hypotheses using the NSRR data. BioMed Central 2018-11-13 /pmc/articles/PMC6234631/ /pubmed/30424756 http://dx.doi.org/10.1186/s12911-018-0682-y Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Cui, Licong
Zeng, Ningzhou
Kim, Matthew
Mueller, Remo
Hankosky, Emily R.
Redline, Susan
Zhang, Guo-Qiang
X-search: an open access interface for cross-cohort exploration of the National Sleep Research Resource
title X-search: an open access interface for cross-cohort exploration of the National Sleep Research Resource
title_full X-search: an open access interface for cross-cohort exploration of the National Sleep Research Resource
title_fullStr X-search: an open access interface for cross-cohort exploration of the National Sleep Research Resource
title_full_unstemmed X-search: an open access interface for cross-cohort exploration of the National Sleep Research Resource
title_short X-search: an open access interface for cross-cohort exploration of the National Sleep Research Resource
title_sort x-search: an open access interface for cross-cohort exploration of the national sleep research resource
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6234631/
https://www.ncbi.nlm.nih.gov/pubmed/30424756
http://dx.doi.org/10.1186/s12911-018-0682-y
work_keys_str_mv AT cuilicong xsearchanopenaccessinterfaceforcrosscohortexplorationofthenationalsleepresearchresource
AT zengningzhou xsearchanopenaccessinterfaceforcrosscohortexplorationofthenationalsleepresearchresource
AT kimmatthew xsearchanopenaccessinterfaceforcrosscohortexplorationofthenationalsleepresearchresource
AT muellerremo xsearchanopenaccessinterfaceforcrosscohortexplorationofthenationalsleepresearchresource
AT hankoskyemilyr xsearchanopenaccessinterfaceforcrosscohortexplorationofthenationalsleepresearchresource
AT redlinesusan xsearchanopenaccessinterfaceforcrosscohortexplorationofthenationalsleepresearchresource
AT zhangguoqiang xsearchanopenaccessinterfaceforcrosscohortexplorationofthenationalsleepresearchresource