Cargando…

The Stanford Medicine data science ecosystem for clinical and translational research

OBJECTIVE: To describe the infrastructure, tools, and services developed at Stanford Medicine to maintain its data science ecosystem and research patient data repository for clinical and translational research. MATERIALS AND METHODS: The data science ecosystem, dubbed the Stanford Data Science Resou...

Descripción completa

Detalles Bibliográficos
Autores principales: Callahan, Alison, Ashley, Euan, Datta, Somalee, Desai, Priyamvada, Ferris, Todd A, Fries, Jason A, Halaas, Michael, Langlotz, Curtis P, Mackey, Sean, Posada, José D, Pfeffer, Michael A, Shah, Nigam H
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10397535/
https://www.ncbi.nlm.nih.gov/pubmed/37545984
http://dx.doi.org/10.1093/jamiaopen/ooad054
_version_ 1785083933856104448
author Callahan, Alison
Ashley, Euan
Datta, Somalee
Desai, Priyamvada
Ferris, Todd A
Fries, Jason A
Halaas, Michael
Langlotz, Curtis P
Mackey, Sean
Posada, José D
Pfeffer, Michael A
Shah, Nigam H
author_facet Callahan, Alison
Ashley, Euan
Datta, Somalee
Desai, Priyamvada
Ferris, Todd A
Fries, Jason A
Halaas, Michael
Langlotz, Curtis P
Mackey, Sean
Posada, José D
Pfeffer, Michael A
Shah, Nigam H
author_sort Callahan, Alison
collection PubMed
description OBJECTIVE: To describe the infrastructure, tools, and services developed at Stanford Medicine to maintain its data science ecosystem and research patient data repository for clinical and translational research. MATERIALS AND METHODS: The data science ecosystem, dubbed the Stanford Data Science Resources (SDSR), includes infrastructure and tools to create, search, retrieve, and analyze patient data, as well as services for data deidentification, linkage, and processing to extract high-value information from healthcare IT systems. Data are made available via self-service and concierge access, on HIPAA compliant secure computing infrastructure supported by in-depth user training. RESULTS: The Stanford Medicine Research Data Repository (STARR) functions as the SDSR data integration point, and includes electronic medical records, clinical images, text, bedside monitoring data and HL7 messages. SDSR tools include tools for electronic phenotyping, cohort building, and a search engine for patient timelines. The SDSR supports patient data collection, reproducible research, and teaching using healthcare data, and facilitates industry collaborations and large-scale observational studies. DISCUSSION: Research patient data repositories and their underlying data science infrastructure are essential to realizing a learning health system and advancing the mission of academic medical centers. Challenges to maintaining the SDSR include ensuring sufficient financial support while providing researchers and clinicians with maximal access to data and digital infrastructure, balancing tool development with user training, and supporting the diverse needs of users. CONCLUSION: Our experience maintaining the SDSR offers a case study for academic medical centers developing data science and research informatics infrastructure.
format Online
Article
Text
id pubmed-10397535
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-103975352023-08-04 The Stanford Medicine data science ecosystem for clinical and translational research Callahan, Alison Ashley, Euan Datta, Somalee Desai, Priyamvada Ferris, Todd A Fries, Jason A Halaas, Michael Langlotz, Curtis P Mackey, Sean Posada, José D Pfeffer, Michael A Shah, Nigam H JAMIA Open Research and Applications OBJECTIVE: To describe the infrastructure, tools, and services developed at Stanford Medicine to maintain its data science ecosystem and research patient data repository for clinical and translational research. MATERIALS AND METHODS: The data science ecosystem, dubbed the Stanford Data Science Resources (SDSR), includes infrastructure and tools to create, search, retrieve, and analyze patient data, as well as services for data deidentification, linkage, and processing to extract high-value information from healthcare IT systems. Data are made available via self-service and concierge access, on HIPAA compliant secure computing infrastructure supported by in-depth user training. RESULTS: The Stanford Medicine Research Data Repository (STARR) functions as the SDSR data integration point, and includes electronic medical records, clinical images, text, bedside monitoring data and HL7 messages. SDSR tools include tools for electronic phenotyping, cohort building, and a search engine for patient timelines. The SDSR supports patient data collection, reproducible research, and teaching using healthcare data, and facilitates industry collaborations and large-scale observational studies. DISCUSSION: Research patient data repositories and their underlying data science infrastructure are essential to realizing a learning health system and advancing the mission of academic medical centers. Challenges to maintaining the SDSR include ensuring sufficient financial support while providing researchers and clinicians with maximal access to data and digital infrastructure, balancing tool development with user training, and supporting the diverse needs of users. CONCLUSION: Our experience maintaining the SDSR offers a case study for academic medical centers developing data science and research informatics infrastructure. Oxford University Press 2023-08-02 /pmc/articles/PMC10397535/ /pubmed/37545984 http://dx.doi.org/10.1093/jamiaopen/ooad054 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research and Applications
Callahan, Alison
Ashley, Euan
Datta, Somalee
Desai, Priyamvada
Ferris, Todd A
Fries, Jason A
Halaas, Michael
Langlotz, Curtis P
Mackey, Sean
Posada, José D
Pfeffer, Michael A
Shah, Nigam H
The Stanford Medicine data science ecosystem for clinical and translational research
title The Stanford Medicine data science ecosystem for clinical and translational research
title_full The Stanford Medicine data science ecosystem for clinical and translational research
title_fullStr The Stanford Medicine data science ecosystem for clinical and translational research
title_full_unstemmed The Stanford Medicine data science ecosystem for clinical and translational research
title_short The Stanford Medicine data science ecosystem for clinical and translational research
title_sort stanford medicine data science ecosystem for clinical and translational research
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10397535/
https://www.ncbi.nlm.nih.gov/pubmed/37545984
http://dx.doi.org/10.1093/jamiaopen/ooad054
work_keys_str_mv AT callahanalison thestanfordmedicinedatascienceecosystemforclinicalandtranslationalresearch
AT ashleyeuan thestanfordmedicinedatascienceecosystemforclinicalandtranslationalresearch
AT dattasomalee thestanfordmedicinedatascienceecosystemforclinicalandtranslationalresearch
AT desaipriyamvada thestanfordmedicinedatascienceecosystemforclinicalandtranslationalresearch
AT ferristodda thestanfordmedicinedatascienceecosystemforclinicalandtranslationalresearch
AT friesjasona thestanfordmedicinedatascienceecosystemforclinicalandtranslationalresearch
AT halaasmichael thestanfordmedicinedatascienceecosystemforclinicalandtranslationalresearch
AT langlotzcurtisp thestanfordmedicinedatascienceecosystemforclinicalandtranslationalresearch
AT mackeysean thestanfordmedicinedatascienceecosystemforclinicalandtranslationalresearch
AT posadajosed thestanfordmedicinedatascienceecosystemforclinicalandtranslationalresearch
AT pfeffermichaela thestanfordmedicinedatascienceecosystemforclinicalandtranslationalresearch
AT shahnigamh thestanfordmedicinedatascienceecosystemforclinicalandtranslationalresearch
AT callahanalison stanfordmedicinedatascienceecosystemforclinicalandtranslationalresearch
AT ashleyeuan stanfordmedicinedatascienceecosystemforclinicalandtranslationalresearch
AT dattasomalee stanfordmedicinedatascienceecosystemforclinicalandtranslationalresearch
AT desaipriyamvada stanfordmedicinedatascienceecosystemforclinicalandtranslationalresearch
AT ferristodda stanfordmedicinedatascienceecosystemforclinicalandtranslationalresearch
AT friesjasona stanfordmedicinedatascienceecosystemforclinicalandtranslationalresearch
AT halaasmichael stanfordmedicinedatascienceecosystemforclinicalandtranslationalresearch
AT langlotzcurtisp stanfordmedicinedatascienceecosystemforclinicalandtranslationalresearch
AT mackeysean stanfordmedicinedatascienceecosystemforclinicalandtranslationalresearch
AT posadajosed stanfordmedicinedatascienceecosystemforclinicalandtranslationalresearch
AT pfeffermichaela stanfordmedicinedatascienceecosystemforclinicalandtranslationalresearch
AT shahnigamh stanfordmedicinedatascienceecosystemforclinicalandtranslationalresearch