Cargando…

Learning important common data elements from shared study data: The All of Us program analysis

There are many initiatives attempting to harmonize data collection across human clinical studies using common data elements (CDEs). The increased use of CDEs in large prior studies can guide researchers planning new studies. For that purpose, we analyzed the All of Us (AoU) program, an ongoing US st...

Descripción completa

Detalles Bibliográficos
Autores principales: Mayer, Craig S., Huser, Vojtech
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10328251/
https://www.ncbi.nlm.nih.gov/pubmed/37418391
http://dx.doi.org/10.1371/journal.pone.0283601
_version_ 1785069754819543040
author Mayer, Craig S.
Huser, Vojtech
author_facet Mayer, Craig S.
Huser, Vojtech
author_sort Mayer, Craig S.
collection PubMed
description There are many initiatives attempting to harmonize data collection across human clinical studies using common data elements (CDEs). The increased use of CDEs in large prior studies can guide researchers planning new studies. For that purpose, we analyzed the All of Us (AoU) program, an ongoing US study intending to enroll one million participants and serve as a platform for numerous observational analyses. AoU adopted the OMOP Common Data Model to standardize both research (Case Report Form [CRF]) and real-world (imported from Electronic Health Records [EHRs]) data. AoU standardized specific data elements and values by including CDEs from terminologies such as LOINC and SNOMED CT. For this study, we defined all elements from established terminologies as CDEs and all custom concepts created in the Participant Provided Information (PPI) terminology as unique data elements (UDEs). We found 1 033 research elements, 4 592 element-value combinations and 932 distinct values. Most elements were UDEs (869, 84.1%), while most CDEs were from LOINC (103 elements, 10.0%) or SNOMED CT (60, 5.8%). Of the LOINC CDEs, 87 (53.1% of 164 CDEs) originated from previous data collection initiatives, such as PhenX (17 CDEs) and PROMIS (15 CDEs). On a CRF level, The Basics (12 of 21 elements, 57.1%) and Lifestyle (10 of 14, 71.4%) were the only CRFs with multiple CDEs. On a value level, 61.7% of distinct values are from an established terminology. AoU demonstrates the use of the OMOP model for integrating research and routine healthcare data (64 elements in both contexts), which allows for monitoring lifestyle and health changes outside the research setting. The increased inclusion of CDEs in large studies (like AoU) is important in facilitating the use of existing tools and improving the ease of understanding and analyzing the data collected, which is more challenging when using study specific formats.
format Online
Article
Text
id pubmed-10328251
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-103282512023-07-08 Learning important common data elements from shared study data: The All of Us program analysis Mayer, Craig S. Huser, Vojtech PLoS One Research Article There are many initiatives attempting to harmonize data collection across human clinical studies using common data elements (CDEs). The increased use of CDEs in large prior studies can guide researchers planning new studies. For that purpose, we analyzed the All of Us (AoU) program, an ongoing US study intending to enroll one million participants and serve as a platform for numerous observational analyses. AoU adopted the OMOP Common Data Model to standardize both research (Case Report Form [CRF]) and real-world (imported from Electronic Health Records [EHRs]) data. AoU standardized specific data elements and values by including CDEs from terminologies such as LOINC and SNOMED CT. For this study, we defined all elements from established terminologies as CDEs and all custom concepts created in the Participant Provided Information (PPI) terminology as unique data elements (UDEs). We found 1 033 research elements, 4 592 element-value combinations and 932 distinct values. Most elements were UDEs (869, 84.1%), while most CDEs were from LOINC (103 elements, 10.0%) or SNOMED CT (60, 5.8%). Of the LOINC CDEs, 87 (53.1% of 164 CDEs) originated from previous data collection initiatives, such as PhenX (17 CDEs) and PROMIS (15 CDEs). On a CRF level, The Basics (12 of 21 elements, 57.1%) and Lifestyle (10 of 14, 71.4%) were the only CRFs with multiple CDEs. On a value level, 61.7% of distinct values are from an established terminology. AoU demonstrates the use of the OMOP model for integrating research and routine healthcare data (64 elements in both contexts), which allows for monitoring lifestyle and health changes outside the research setting. The increased inclusion of CDEs in large studies (like AoU) is important in facilitating the use of existing tools and improving the ease of understanding and analyzing the data collected, which is more challenging when using study specific formats. Public Library of Science 2023-07-07 /pmc/articles/PMC10328251/ /pubmed/37418391 http://dx.doi.org/10.1371/journal.pone.0283601 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Mayer, Craig S.
Huser, Vojtech
Learning important common data elements from shared study data: The All of Us program analysis
title Learning important common data elements from shared study data: The All of Us program analysis
title_full Learning important common data elements from shared study data: The All of Us program analysis
title_fullStr Learning important common data elements from shared study data: The All of Us program analysis
title_full_unstemmed Learning important common data elements from shared study data: The All of Us program analysis
title_short Learning important common data elements from shared study data: The All of Us program analysis
title_sort learning important common data elements from shared study data: the all of us program analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10328251/
https://www.ncbi.nlm.nih.gov/pubmed/37418391
http://dx.doi.org/10.1371/journal.pone.0283601
work_keys_str_mv AT mayercraigs learningimportantcommondataelementsfromsharedstudydatatheallofusprogramanalysis
AT huservojtech learningimportantcommondataelementsfromsharedstudydatatheallofusprogramanalysis