Cargando…

An atomic approach to the design and implementation of a research data warehouse

OBJECTIVE: As a long-standing Clinical and Translational Science Awards (CTSA) Program hub, the University of Pittsburgh and the University of Pittsburgh Medical Center (UPMC) developed and implemented a modern research data warehouse (RDW) to efficiently provision electronic patient data for clinic...

Descripción completa

Detalles Bibliográficos
Autores principales: Visweswaran, Shyam, McLay, Brian, Cappella, Nickie, Morris, Michele, Milnes, John T, Reis, Steven E, Silverstein, Jonathan C, Becich, Michael J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8922189/
https://www.ncbi.nlm.nih.gov/pubmed/34613409
http://dx.doi.org/10.1093/jamia/ocab204
_version_ 1784669478239338496
author Visweswaran, Shyam
McLay, Brian
Cappella, Nickie
Morris, Michele
Milnes, John T
Reis, Steven E
Silverstein, Jonathan C
Becich, Michael J
author_facet Visweswaran, Shyam
McLay, Brian
Cappella, Nickie
Morris, Michele
Milnes, John T
Reis, Steven E
Silverstein, Jonathan C
Becich, Michael J
author_sort Visweswaran, Shyam
collection PubMed
description OBJECTIVE: As a long-standing Clinical and Translational Science Awards (CTSA) Program hub, the University of Pittsburgh and the University of Pittsburgh Medical Center (UPMC) developed and implemented a modern research data warehouse (RDW) to efficiently provision electronic patient data for clinical and translational research. MATERIALS AND METHODS: We designed and implemented an RDW named Neptune to serve the specific needs of our CTSA. Neptune uses an atomic design where data are stored at a high level of granularity as represented in source systems. Neptune contains robust patient identity management tailored for research; integrates patient data from multiple sources, including electronic health records (EHRs), health plans, and research studies; and includes knowledge for mapping to standard terminologies. RESULTS: Neptune contains data for more than 5 million patients longitudinally organized as Health Insurance Portability and Accountability Act (HIPAA) Limited Data with dates and includes structured EHR data, clinical documents, health insurance claims, and research data. Neptune is used as a source for patient data for hundreds of institutional review board-approved research projects by local investigators and for national projects. DISCUSSION: The design of Neptune was heavily influenced by the large size of UPMC, the varied data sources, and the rich partnership between the University and the healthcare system. It includes several unique aspects, including the physical warehouse straddling the University and UPMC networks and management under an HIPAA Business Associates Agreement. CONCLUSION: We describe the design and implementation of an RDW at a large academic healthcare system that uses a distinctive atomic design where data are stored at a high level of granularity.
format Online
Article
Text
id pubmed-8922189
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-89221892022-03-15 An atomic approach to the design and implementation of a research data warehouse Visweswaran, Shyam McLay, Brian Cappella, Nickie Morris, Michele Milnes, John T Reis, Steven E Silverstein, Jonathan C Becich, Michael J J Am Med Inform Assoc Research and Applications OBJECTIVE: As a long-standing Clinical and Translational Science Awards (CTSA) Program hub, the University of Pittsburgh and the University of Pittsburgh Medical Center (UPMC) developed and implemented a modern research data warehouse (RDW) to efficiently provision electronic patient data for clinical and translational research. MATERIALS AND METHODS: We designed and implemented an RDW named Neptune to serve the specific needs of our CTSA. Neptune uses an atomic design where data are stored at a high level of granularity as represented in source systems. Neptune contains robust patient identity management tailored for research; integrates patient data from multiple sources, including electronic health records (EHRs), health plans, and research studies; and includes knowledge for mapping to standard terminologies. RESULTS: Neptune contains data for more than 5 million patients longitudinally organized as Health Insurance Portability and Accountability Act (HIPAA) Limited Data with dates and includes structured EHR data, clinical documents, health insurance claims, and research data. Neptune is used as a source for patient data for hundreds of institutional review board-approved research projects by local investigators and for national projects. DISCUSSION: The design of Neptune was heavily influenced by the large size of UPMC, the varied data sources, and the rich partnership between the University and the healthcare system. It includes several unique aspects, including the physical warehouse straddling the University and UPMC networks and management under an HIPAA Business Associates Agreement. CONCLUSION: We describe the design and implementation of an RDW at a large academic healthcare system that uses a distinctive atomic design where data are stored at a high level of granularity. Oxford University Press 2021-10-06 /pmc/articles/PMC8922189/ /pubmed/34613409 http://dx.doi.org/10.1093/jamia/ocab204 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Research and Applications
Visweswaran, Shyam
McLay, Brian
Cappella, Nickie
Morris, Michele
Milnes, John T
Reis, Steven E
Silverstein, Jonathan C
Becich, Michael J
An atomic approach to the design and implementation of a research data warehouse
title An atomic approach to the design and implementation of a research data warehouse
title_full An atomic approach to the design and implementation of a research data warehouse
title_fullStr An atomic approach to the design and implementation of a research data warehouse
title_full_unstemmed An atomic approach to the design and implementation of a research data warehouse
title_short An atomic approach to the design and implementation of a research data warehouse
title_sort atomic approach to the design and implementation of a research data warehouse
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8922189/
https://www.ncbi.nlm.nih.gov/pubmed/34613409
http://dx.doi.org/10.1093/jamia/ocab204
work_keys_str_mv AT visweswaranshyam anatomicapproachtothedesignandimplementationofaresearchdatawarehouse
AT mclaybrian anatomicapproachtothedesignandimplementationofaresearchdatawarehouse
AT cappellanickie anatomicapproachtothedesignandimplementationofaresearchdatawarehouse
AT morrismichele anatomicapproachtothedesignandimplementationofaresearchdatawarehouse
AT milnesjohnt anatomicapproachtothedesignandimplementationofaresearchdatawarehouse
AT reisstevene anatomicapproachtothedesignandimplementationofaresearchdatawarehouse
AT silversteinjonathanc anatomicapproachtothedesignandimplementationofaresearchdatawarehouse
AT becichmichaelj anatomicapproachtothedesignandimplementationofaresearchdatawarehouse
AT visweswaranshyam atomicapproachtothedesignandimplementationofaresearchdatawarehouse
AT mclaybrian atomicapproachtothedesignandimplementationofaresearchdatawarehouse
AT cappellanickie atomicapproachtothedesignandimplementationofaresearchdatawarehouse
AT morrismichele atomicapproachtothedesignandimplementationofaresearchdatawarehouse
AT milnesjohnt atomicapproachtothedesignandimplementationofaresearchdatawarehouse
AT reisstevene atomicapproachtothedesignandimplementationofaresearchdatawarehouse
AT silversteinjonathanc atomicapproachtothedesignandimplementationofaresearchdatawarehouse
AT becichmichaelj atomicapproachtothedesignandimplementationofaresearchdatawarehouse