Cargando…

Migrating a research data warehouse to a public cloud: challenges and opportunities

OBJECTIVE: Clinical research data warehouses (RDWs) linked to genomic pipelines and open data archives are being created to support innovative, complex data-driven discoveries. The computing and storage needs of these research environments may quickly exceed the capacity of on-premises systems. New...

Descripción completa

Detalles Bibliográficos
Autores principales: Kahn, Michael G, Mui, Joyce Y, Ames, Michael J, Yamsani, Anoop K, Pozdeyev, Nikita, Rafaels, Nicholas, Brooks, Ian M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8922165/
https://www.ncbi.nlm.nih.gov/pubmed/34919694
http://dx.doi.org/10.1093/jamia/ocab278
_version_ 1784669474019868672
author Kahn, Michael G
Mui, Joyce Y
Ames, Michael J
Yamsani, Anoop K
Pozdeyev, Nikita
Rafaels, Nicholas
Brooks, Ian M
author_facet Kahn, Michael G
Mui, Joyce Y
Ames, Michael J
Yamsani, Anoop K
Pozdeyev, Nikita
Rafaels, Nicholas
Brooks, Ian M
author_sort Kahn, Michael G
collection PubMed
description OBJECTIVE: Clinical research data warehouses (RDWs) linked to genomic pipelines and open data archives are being created to support innovative, complex data-driven discoveries. The computing and storage needs of these research environments may quickly exceed the capacity of on-premises systems. New RDWs are migrating to cloud platforms for the scalability and flexibility needed to meet these challenges. We describe our experience in migrating a multi-institutional RDW to a public cloud. MATERIALS AND METHODS: This study is descriptive. Primary materials included internal and public presentations before and after the transition, analysis documents, and actual billing records. Findings were aggregated into topical categories. RESULTS: Eight categories of migration issues were identified. Unanticipated challenges included legacy system limitations; network, computing, and storage architectures that realize performance and cost benefits in the face of hyper-innovation, complex security reviews and approvals, and limited cloud consulting expertise. DISCUSSION: Cloud architectures enable previously unavailable capabilities, but numerous pitfalls can impede realizing the full benefits of a cloud environment. Rapid changes in cloud capabilities can quickly obsolete existing architectures and associated institutional policies. Touchpoints with on-premise networks and systems can add unforeseen complexity. Governance, resource management, and cost oversight are critical to allow rapid innovation while minimizing wasted resources and unnecessary costs. CONCLUSIONS: Migrating our RDW to the cloud has enabled capabilities and innovations that would not have been possible with an on-premises environment. Notwithstanding the challenges of managing cloud resources, the resulting RDW capabilities have been highly positive to our institution, research community, and partners.
format Online
Article
Text
id pubmed-8922165
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-89221652022-03-15 Migrating a research data warehouse to a public cloud: challenges and opportunities Kahn, Michael G Mui, Joyce Y Ames, Michael J Yamsani, Anoop K Pozdeyev, Nikita Rafaels, Nicholas Brooks, Ian M J Am Med Inform Assoc Research and Applications OBJECTIVE: Clinical research data warehouses (RDWs) linked to genomic pipelines and open data archives are being created to support innovative, complex data-driven discoveries. The computing and storage needs of these research environments may quickly exceed the capacity of on-premises systems. New RDWs are migrating to cloud platforms for the scalability and flexibility needed to meet these challenges. We describe our experience in migrating a multi-institutional RDW to a public cloud. MATERIALS AND METHODS: This study is descriptive. Primary materials included internal and public presentations before and after the transition, analysis documents, and actual billing records. Findings were aggregated into topical categories. RESULTS: Eight categories of migration issues were identified. Unanticipated challenges included legacy system limitations; network, computing, and storage architectures that realize performance and cost benefits in the face of hyper-innovation, complex security reviews and approvals, and limited cloud consulting expertise. DISCUSSION: Cloud architectures enable previously unavailable capabilities, but numerous pitfalls can impede realizing the full benefits of a cloud environment. Rapid changes in cloud capabilities can quickly obsolete existing architectures and associated institutional policies. Touchpoints with on-premise networks and systems can add unforeseen complexity. Governance, resource management, and cost oversight are critical to allow rapid innovation while minimizing wasted resources and unnecessary costs. CONCLUSIONS: Migrating our RDW to the cloud has enabled capabilities and innovations that would not have been possible with an on-premises environment. Notwithstanding the challenges of managing cloud resources, the resulting RDW capabilities have been highly positive to our institution, research community, and partners. Oxford University Press 2021-12-17 /pmc/articles/PMC8922165/ /pubmed/34919694 http://dx.doi.org/10.1093/jamia/ocab278 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research and Applications
Kahn, Michael G
Mui, Joyce Y
Ames, Michael J
Yamsani, Anoop K
Pozdeyev, Nikita
Rafaels, Nicholas
Brooks, Ian M
Migrating a research data warehouse to a public cloud: challenges and opportunities
title Migrating a research data warehouse to a public cloud: challenges and opportunities
title_full Migrating a research data warehouse to a public cloud: challenges and opportunities
title_fullStr Migrating a research data warehouse to a public cloud: challenges and opportunities
title_full_unstemmed Migrating a research data warehouse to a public cloud: challenges and opportunities
title_short Migrating a research data warehouse to a public cloud: challenges and opportunities
title_sort migrating a research data warehouse to a public cloud: challenges and opportunities
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8922165/
https://www.ncbi.nlm.nih.gov/pubmed/34919694
http://dx.doi.org/10.1093/jamia/ocab278
work_keys_str_mv AT kahnmichaelg migratingaresearchdatawarehousetoapubliccloudchallengesandopportunities
AT muijoycey migratingaresearchdatawarehousetoapubliccloudchallengesandopportunities
AT amesmichaelj migratingaresearchdatawarehousetoapubliccloudchallengesandopportunities
AT yamsanianoopk migratingaresearchdatawarehousetoapubliccloudchallengesandopportunities
AT pozdeyevnikita migratingaresearchdatawarehousetoapubliccloudchallengesandopportunities
AT rafaelsnicholas migratingaresearchdatawarehousetoapubliccloudchallengesandopportunities
AT brooksianm migratingaresearchdatawarehousetoapubliccloudchallengesandopportunities