Cargando…
Migrating a research data warehouse to a public cloud: challenges and opportunities
OBJECTIVE: Clinical research data warehouses (RDWs) linked to genomic pipelines and open data archives are being created to support innovative, complex data-driven discoveries. The computing and storage needs of these research environments may quickly exceed the capacity of on-premises systems. New...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8922165/ https://www.ncbi.nlm.nih.gov/pubmed/34919694 http://dx.doi.org/10.1093/jamia/ocab278 |
_version_ | 1784669474019868672 |
---|---|
author | Kahn, Michael G Mui, Joyce Y Ames, Michael J Yamsani, Anoop K Pozdeyev, Nikita Rafaels, Nicholas Brooks, Ian M |
author_facet | Kahn, Michael G Mui, Joyce Y Ames, Michael J Yamsani, Anoop K Pozdeyev, Nikita Rafaels, Nicholas Brooks, Ian M |
author_sort | Kahn, Michael G |
collection | PubMed |
description | OBJECTIVE: Clinical research data warehouses (RDWs) linked to genomic pipelines and open data archives are being created to support innovative, complex data-driven discoveries. The computing and storage needs of these research environments may quickly exceed the capacity of on-premises systems. New RDWs are migrating to cloud platforms for the scalability and flexibility needed to meet these challenges. We describe our experience in migrating a multi-institutional RDW to a public cloud. MATERIALS AND METHODS: This study is descriptive. Primary materials included internal and public presentations before and after the transition, analysis documents, and actual billing records. Findings were aggregated into topical categories. RESULTS: Eight categories of migration issues were identified. Unanticipated challenges included legacy system limitations; network, computing, and storage architectures that realize performance and cost benefits in the face of hyper-innovation, complex security reviews and approvals, and limited cloud consulting expertise. DISCUSSION: Cloud architectures enable previously unavailable capabilities, but numerous pitfalls can impede realizing the full benefits of a cloud environment. Rapid changes in cloud capabilities can quickly obsolete existing architectures and associated institutional policies. Touchpoints with on-premise networks and systems can add unforeseen complexity. Governance, resource management, and cost oversight are critical to allow rapid innovation while minimizing wasted resources and unnecessary costs. CONCLUSIONS: Migrating our RDW to the cloud has enabled capabilities and innovations that would not have been possible with an on-premises environment. Notwithstanding the challenges of managing cloud resources, the resulting RDW capabilities have been highly positive to our institution, research community, and partners. |
format | Online Article Text |
id | pubmed-8922165 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-89221652022-03-15 Migrating a research data warehouse to a public cloud: challenges and opportunities Kahn, Michael G Mui, Joyce Y Ames, Michael J Yamsani, Anoop K Pozdeyev, Nikita Rafaels, Nicholas Brooks, Ian M J Am Med Inform Assoc Research and Applications OBJECTIVE: Clinical research data warehouses (RDWs) linked to genomic pipelines and open data archives are being created to support innovative, complex data-driven discoveries. The computing and storage needs of these research environments may quickly exceed the capacity of on-premises systems. New RDWs are migrating to cloud platforms for the scalability and flexibility needed to meet these challenges. We describe our experience in migrating a multi-institutional RDW to a public cloud. MATERIALS AND METHODS: This study is descriptive. Primary materials included internal and public presentations before and after the transition, analysis documents, and actual billing records. Findings were aggregated into topical categories. RESULTS: Eight categories of migration issues were identified. Unanticipated challenges included legacy system limitations; network, computing, and storage architectures that realize performance and cost benefits in the face of hyper-innovation, complex security reviews and approvals, and limited cloud consulting expertise. DISCUSSION: Cloud architectures enable previously unavailable capabilities, but numerous pitfalls can impede realizing the full benefits of a cloud environment. Rapid changes in cloud capabilities can quickly obsolete existing architectures and associated institutional policies. Touchpoints with on-premise networks and systems can add unforeseen complexity. Governance, resource management, and cost oversight are critical to allow rapid innovation while minimizing wasted resources and unnecessary costs. CONCLUSIONS: Migrating our RDW to the cloud has enabled capabilities and innovations that would not have been possible with an on-premises environment. Notwithstanding the challenges of managing cloud resources, the resulting RDW capabilities have been highly positive to our institution, research community, and partners. Oxford University Press 2021-12-17 /pmc/articles/PMC8922165/ /pubmed/34919694 http://dx.doi.org/10.1093/jamia/ocab278 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research and Applications Kahn, Michael G Mui, Joyce Y Ames, Michael J Yamsani, Anoop K Pozdeyev, Nikita Rafaels, Nicholas Brooks, Ian M Migrating a research data warehouse to a public cloud: challenges and opportunities |
title | Migrating a research data warehouse to a public cloud: challenges and opportunities |
title_full | Migrating a research data warehouse to a public cloud: challenges and opportunities |
title_fullStr | Migrating a research data warehouse to a public cloud: challenges and opportunities |
title_full_unstemmed | Migrating a research data warehouse to a public cloud: challenges and opportunities |
title_short | Migrating a research data warehouse to a public cloud: challenges and opportunities |
title_sort | migrating a research data warehouse to a public cloud: challenges and opportunities |
topic | Research and Applications |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8922165/ https://www.ncbi.nlm.nih.gov/pubmed/34919694 http://dx.doi.org/10.1093/jamia/ocab278 |
work_keys_str_mv | AT kahnmichaelg migratingaresearchdatawarehousetoapubliccloudchallengesandopportunities AT muijoycey migratingaresearchdatawarehousetoapubliccloudchallengesandopportunities AT amesmichaelj migratingaresearchdatawarehousetoapubliccloudchallengesandopportunities AT yamsanianoopk migratingaresearchdatawarehousetoapubliccloudchallengesandopportunities AT pozdeyevnikita migratingaresearchdatawarehousetoapubliccloudchallengesandopportunities AT rafaelsnicholas migratingaresearchdatawarehousetoapubliccloudchallengesandopportunities AT brooksianm migratingaresearchdatawarehousetoapubliccloudchallengesandopportunities |