Cargando…

A system to build distributed multivariate models and manage disparate data sharing policies: implementation in the scalable national network for effectiveness research

Background Centralized and federated models for sharing data in research networks currently exist. To build multivariate data analysis for centralized networks, transfer of patient-level data to a central computation resource is necessary. The authors implemented distributed multivariate models for...

Descripción completa

Detalles Bibliográficos
Autores principales: Meeker, Daniella, Jiang, Xiaoqian, Matheny, Michael E, Farcas, Claudiu, D’Arcy, Michel, Pearlman, Laura, Nookala, Lavanya, Day, Michele E, Kim, Katherine K, Kim, Hyeoneui, Boxwala, Aziz, El-Kareh, Robert, Kuo, Grace M, Resnic, Frederic S, Kesselman, Carl, Ohno-Machado, Lucila
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4639714/
https://www.ncbi.nlm.nih.gov/pubmed/26142423
http://dx.doi.org/10.1093/jamia/ocv017
_version_ 1782399969277247488
author Meeker, Daniella
Jiang, Xiaoqian
Matheny, Michael E
Farcas, Claudiu
D’Arcy, Michel
Pearlman, Laura
Nookala, Lavanya
Day, Michele E
Kim, Katherine K
Kim, Hyeoneui
Boxwala, Aziz
El-Kareh, Robert
Kuo, Grace M
Resnic, Frederic S
Kesselman, Carl
Ohno-Machado, Lucila
author_facet Meeker, Daniella
Jiang, Xiaoqian
Matheny, Michael E
Farcas, Claudiu
D’Arcy, Michel
Pearlman, Laura
Nookala, Lavanya
Day, Michele E
Kim, Katherine K
Kim, Hyeoneui
Boxwala, Aziz
El-Kareh, Robert
Kuo, Grace M
Resnic, Frederic S
Kesselman, Carl
Ohno-Machado, Lucila
author_sort Meeker, Daniella
collection PubMed
description Background Centralized and federated models for sharing data in research networks currently exist. To build multivariate data analysis for centralized networks, transfer of patient-level data to a central computation resource is necessary. The authors implemented distributed multivariate models for federated networks in which patient-level data is kept at each site and data exchange policies are managed in a study-centric manner. Objective The objective was to implement infrastructure that supports the functionality of some existing research networks (e.g., cohort discovery, workflow management, and estimation of multivariate analytic models on centralized data) while adding additional important new features, such as algorithms for distributed iterative multivariate models, a graphical interface for multivariate model specification, synchronous and asynchronous response to network queries, investigator-initiated studies, and study-based control of staff, protocols, and data sharing policies. Materials and Methods Based on the requirements gathered from statisticians, administrators, and investigators from multiple institutions, the authors developed infrastructure and tools to support multisite comparative effectiveness studies using web services for multivariate statistical estimation in the SCANNER federated network. Results The authors implemented massively parallel (map-reduce) computation methods and a new policy management system to enable each study initiated by network participants to define the ways in which data may be processed, managed, queried, and shared. The authors illustrated the use of these systems among institutions with highly different policies and operating under different state laws. Discussion and Conclusion Federated research networks need not limit distributed query functionality to count queries, cohort discovery, or independently estimated analytic models. Multivariate analyses can be efficiently and securely conducted without patient-level data transport, allowing institutions with strict local data storage requirements to participate in sophisticated analyses based on federated research networks.
format Online
Article
Text
id pubmed-4639714
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-46397142016-11-01 A system to build distributed multivariate models and manage disparate data sharing policies: implementation in the scalable national network for effectiveness research Meeker, Daniella Jiang, Xiaoqian Matheny, Michael E Farcas, Claudiu D’Arcy, Michel Pearlman, Laura Nookala, Lavanya Day, Michele E Kim, Katherine K Kim, Hyeoneui Boxwala, Aziz El-Kareh, Robert Kuo, Grace M Resnic, Frederic S Kesselman, Carl Ohno-Machado, Lucila J Am Med Inform Assoc Research and Applications Background Centralized and federated models for sharing data in research networks currently exist. To build multivariate data analysis for centralized networks, transfer of patient-level data to a central computation resource is necessary. The authors implemented distributed multivariate models for federated networks in which patient-level data is kept at each site and data exchange policies are managed in a study-centric manner. Objective The objective was to implement infrastructure that supports the functionality of some existing research networks (e.g., cohort discovery, workflow management, and estimation of multivariate analytic models on centralized data) while adding additional important new features, such as algorithms for distributed iterative multivariate models, a graphical interface for multivariate model specification, synchronous and asynchronous response to network queries, investigator-initiated studies, and study-based control of staff, protocols, and data sharing policies. Materials and Methods Based on the requirements gathered from statisticians, administrators, and investigators from multiple institutions, the authors developed infrastructure and tools to support multisite comparative effectiveness studies using web services for multivariate statistical estimation in the SCANNER federated network. Results The authors implemented massively parallel (map-reduce) computation methods and a new policy management system to enable each study initiated by network participants to define the ways in which data may be processed, managed, queried, and shared. The authors illustrated the use of these systems among institutions with highly different policies and operating under different state laws. Discussion and Conclusion Federated research networks need not limit distributed query functionality to count queries, cohort discovery, or independently estimated analytic models. Multivariate analyses can be efficiently and securely conducted without patient-level data transport, allowing institutions with strict local data storage requirements to participate in sophisticated analyses based on federated research networks. Oxford University Press 2015-11 2015-07-03 /pmc/articles/PMC4639714/ /pubmed/26142423 http://dx.doi.org/10.1093/jamia/ocv017 Text en © The Author 2015. Published by Oxford University Press on behalf of the American Medical Informatics Association. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Research and Applications
Meeker, Daniella
Jiang, Xiaoqian
Matheny, Michael E
Farcas, Claudiu
D’Arcy, Michel
Pearlman, Laura
Nookala, Lavanya
Day, Michele E
Kim, Katherine K
Kim, Hyeoneui
Boxwala, Aziz
El-Kareh, Robert
Kuo, Grace M
Resnic, Frederic S
Kesselman, Carl
Ohno-Machado, Lucila
A system to build distributed multivariate models and manage disparate data sharing policies: implementation in the scalable national network for effectiveness research
title A system to build distributed multivariate models and manage disparate data sharing policies: implementation in the scalable national network for effectiveness research
title_full A system to build distributed multivariate models and manage disparate data sharing policies: implementation in the scalable national network for effectiveness research
title_fullStr A system to build distributed multivariate models and manage disparate data sharing policies: implementation in the scalable national network for effectiveness research
title_full_unstemmed A system to build distributed multivariate models and manage disparate data sharing policies: implementation in the scalable national network for effectiveness research
title_short A system to build distributed multivariate models and manage disparate data sharing policies: implementation in the scalable national network for effectiveness research
title_sort system to build distributed multivariate models and manage disparate data sharing policies: implementation in the scalable national network for effectiveness research
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4639714/
https://www.ncbi.nlm.nih.gov/pubmed/26142423
http://dx.doi.org/10.1093/jamia/ocv017
work_keys_str_mv AT meekerdaniella asystemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT jiangxiaoqian asystemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT mathenymichaele asystemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT farcasclaudiu asystemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT darcymichel asystemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT pearlmanlaura asystemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT nookalalavanya asystemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT daymichelee asystemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT kimkatherinek asystemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT kimhyeoneui asystemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT boxwalaaziz asystemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT elkarehrobert asystemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT kuogracem asystemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT resnicfrederics asystemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT kesselmancarl asystemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT ohnomachadolucila asystemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT meekerdaniella systemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT jiangxiaoqian systemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT mathenymichaele systemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT farcasclaudiu systemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT darcymichel systemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT pearlmanlaura systemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT nookalalavanya systemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT daymichelee systemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT kimkatherinek systemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT kimhyeoneui systemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT boxwalaaziz systemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT elkarehrobert systemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT kuogracem systemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT resnicfrederics systemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT kesselmancarl systemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch
AT ohnomachadolucila systemtobuilddistributedmultivariatemodelsandmanagedisparatedatasharingpoliciesimplementationinthescalablenationalnetworkforeffectivenessresearch