Cargando…
X-search: an open access interface for cross-cohort exploration of the National Sleep Research Resource
BACKGROUND: The National Sleep Research Resource (NSRR) is a large-scale, openly shared, data repository of de-identified, highly curated clinical sleep data from multiple NIH-funded epidemiological studies. Although many data repositories allow users to browse their content, few support fine-graine...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6234631/ https://www.ncbi.nlm.nih.gov/pubmed/30424756 http://dx.doi.org/10.1186/s12911-018-0682-y |
_version_ | 1783370735190802432 |
---|---|
author | Cui, Licong Zeng, Ningzhou Kim, Matthew Mueller, Remo Hankosky, Emily R. Redline, Susan Zhang, Guo-Qiang |
author_facet | Cui, Licong Zeng, Ningzhou Kim, Matthew Mueller, Remo Hankosky, Emily R. Redline, Susan Zhang, Guo-Qiang |
author_sort | Cui, Licong |
collection | PubMed |
description | BACKGROUND: The National Sleep Research Resource (NSRR) is a large-scale, openly shared, data repository of de-identified, highly curated clinical sleep data from multiple NIH-funded epidemiological studies. Although many data repositories allow users to browse their content, few support fine-grained, cross-cohort query and exploration at study-subject level. We introduce a cross-cohort query and exploration system, called X-search, to enable researchers to query patient cohort counts across a growing number of completed, NIH-funded studies in NSRR and explore the feasibility or likelihood of reusing the data for research studies. METHODS: X-search has been designed as a general framework with two loosely-coupled components: semantically annotated data repository and cross-cohort exploration engine. The semantically annotated data repository is comprised of a canonical data dictionary, data sources with a data dictionary, and mappings between each individual data dictionary and the canonical data dictionary. The cross-cohort exploration engine consists of five modules: query builder, graphical exploration, case-control exploration, query translation, and query execution. The canonical data dictionary serves as the unified metadata to drive the visual exploration interfaces and facilitate query translation through the mappings. RESULTS: X-search is publicly available at https://www.x-search.net/with nine NSRR datasets consisting of over 26,000 unique subjects. The canonical data dictionary contains over 900 common data elements across the datasets. X-search has received over 1800 cross-cohort queries by users from 16 countries. CONCLUSIONS: X-search provides a powerful cross-cohort exploration interface for querying and exploring heterogeneous datasets in the NSRR data repository, so as to enable researchers to evaluate the feasibility of potential research studies and generate potential hypotheses using the NSRR data. |
format | Online Article Text |
id | pubmed-6234631 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-62346312018-11-23 X-search: an open access interface for cross-cohort exploration of the National Sleep Research Resource Cui, Licong Zeng, Ningzhou Kim, Matthew Mueller, Remo Hankosky, Emily R. Redline, Susan Zhang, Guo-Qiang BMC Med Inform Decis Mak Research Article BACKGROUND: The National Sleep Research Resource (NSRR) is a large-scale, openly shared, data repository of de-identified, highly curated clinical sleep data from multiple NIH-funded epidemiological studies. Although many data repositories allow users to browse their content, few support fine-grained, cross-cohort query and exploration at study-subject level. We introduce a cross-cohort query and exploration system, called X-search, to enable researchers to query patient cohort counts across a growing number of completed, NIH-funded studies in NSRR and explore the feasibility or likelihood of reusing the data for research studies. METHODS: X-search has been designed as a general framework with two loosely-coupled components: semantically annotated data repository and cross-cohort exploration engine. The semantically annotated data repository is comprised of a canonical data dictionary, data sources with a data dictionary, and mappings between each individual data dictionary and the canonical data dictionary. The cross-cohort exploration engine consists of five modules: query builder, graphical exploration, case-control exploration, query translation, and query execution. The canonical data dictionary serves as the unified metadata to drive the visual exploration interfaces and facilitate query translation through the mappings. RESULTS: X-search is publicly available at https://www.x-search.net/with nine NSRR datasets consisting of over 26,000 unique subjects. The canonical data dictionary contains over 900 common data elements across the datasets. X-search has received over 1800 cross-cohort queries by users from 16 countries. CONCLUSIONS: X-search provides a powerful cross-cohort exploration interface for querying and exploring heterogeneous datasets in the NSRR data repository, so as to enable researchers to evaluate the feasibility of potential research studies and generate potential hypotheses using the NSRR data. BioMed Central 2018-11-13 /pmc/articles/PMC6234631/ /pubmed/30424756 http://dx.doi.org/10.1186/s12911-018-0682-y Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Cui, Licong Zeng, Ningzhou Kim, Matthew Mueller, Remo Hankosky, Emily R. Redline, Susan Zhang, Guo-Qiang X-search: an open access interface for cross-cohort exploration of the National Sleep Research Resource |
title | X-search: an open access interface for cross-cohort exploration of the National Sleep Research Resource |
title_full | X-search: an open access interface for cross-cohort exploration of the National Sleep Research Resource |
title_fullStr | X-search: an open access interface for cross-cohort exploration of the National Sleep Research Resource |
title_full_unstemmed | X-search: an open access interface for cross-cohort exploration of the National Sleep Research Resource |
title_short | X-search: an open access interface for cross-cohort exploration of the National Sleep Research Resource |
title_sort | x-search: an open access interface for cross-cohort exploration of the national sleep research resource |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6234631/ https://www.ncbi.nlm.nih.gov/pubmed/30424756 http://dx.doi.org/10.1186/s12911-018-0682-y |
work_keys_str_mv | AT cuilicong xsearchanopenaccessinterfaceforcrosscohortexplorationofthenationalsleepresearchresource AT zengningzhou xsearchanopenaccessinterfaceforcrosscohortexplorationofthenationalsleepresearchresource AT kimmatthew xsearchanopenaccessinterfaceforcrosscohortexplorationofthenationalsleepresearchresource AT muellerremo xsearchanopenaccessinterfaceforcrosscohortexplorationofthenationalsleepresearchresource AT hankoskyemilyr xsearchanopenaccessinterfaceforcrosscohortexplorationofthenationalsleepresearchresource AT redlinesusan xsearchanopenaccessinterfaceforcrosscohortexplorationofthenationalsleepresearchresource AT zhangguoqiang xsearchanopenaccessinterfaceforcrosscohortexplorationofthenationalsleepresearchresource |