Cargando…

Practical estimation of cloud storage costs for clinical genomic data

BACKGROUND: Laboratories performing clinical high-throughput sequencing for oncology and germline testing are increasingly migrating their data storage to cloud-based solutions. Cloud-based storage has several advantages, such as low per-GB prices, scalability, and minimal fixed costs; however, whil...

Descripción completa

Detalles Bibliográficos
Autores principales: Krumm, Niklas, Hoffman, Noah
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7276491/
https://www.ncbi.nlm.nih.gov/pubmed/32529017
http://dx.doi.org/10.1016/j.plabm.2020.e00168
_version_ 1783542961171070976
author Krumm, Niklas
Hoffman, Noah
author_facet Krumm, Niklas
Hoffman, Noah
author_sort Krumm, Niklas
collection PubMed
description BACKGROUND: Laboratories performing clinical high-throughput sequencing for oncology and germline testing are increasingly migrating their data storage to cloud-based solutions. Cloud-based storage has several advantages, such as low per-GB prices, scalability, and minimal fixed costs; however, while these solutions tout ostensibly simple usage-based pricing plans, practical cost analysis of cloud storage for NGS data storage is not straightforward. METHODS: We developed an easy-to-use tool designed specifically for cost and usage estimation for laboratories performing clinical NGS testing (https://ngscosts.info). Our tool enables quick exploration of dozens of storage options across three major cloud providers, and provides complex cost and usage forecasts over 1–20 year timeframes. Parameters include current test volumes, growth rate, data compression, data retention policies, and case re-access rates. Outputs include an easy-to-visualize chart of total data stored, yearly and lifetime costs, and a “cost per test” estimate. RESULTS: Two factors were found to markedly decrease the average cost per test: 1) reducing total file size, including through the use of compression, 2) rapid transfer to “cold” or archival storage. In contrast, re-access of data from archival storage tiers was not found to dramatically increase the cost of storage per test. CONCLUSIONS: Steady declines in cloud storage pricing, as well as new options for storage and retrieval, make storing clinical NGS data on the cloud economical and friendly to laboratory workflows. Our web-based tool makes it possible to explore and compare cloud storage solutions and provide forecasts specifically for clinical NGS laboratories.
format Online
Article
Text
id pubmed-7276491
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-72764912020-06-10 Practical estimation of cloud storage costs for clinical genomic data Krumm, Niklas Hoffman, Noah Pract Lab Med Article BACKGROUND: Laboratories performing clinical high-throughput sequencing for oncology and germline testing are increasingly migrating their data storage to cloud-based solutions. Cloud-based storage has several advantages, such as low per-GB prices, scalability, and minimal fixed costs; however, while these solutions tout ostensibly simple usage-based pricing plans, practical cost analysis of cloud storage for NGS data storage is not straightforward. METHODS: We developed an easy-to-use tool designed specifically for cost and usage estimation for laboratories performing clinical NGS testing (https://ngscosts.info). Our tool enables quick exploration of dozens of storage options across three major cloud providers, and provides complex cost and usage forecasts over 1–20 year timeframes. Parameters include current test volumes, growth rate, data compression, data retention policies, and case re-access rates. Outputs include an easy-to-visualize chart of total data stored, yearly and lifetime costs, and a “cost per test” estimate. RESULTS: Two factors were found to markedly decrease the average cost per test: 1) reducing total file size, including through the use of compression, 2) rapid transfer to “cold” or archival storage. In contrast, re-access of data from archival storage tiers was not found to dramatically increase the cost of storage per test. CONCLUSIONS: Steady declines in cloud storage pricing, as well as new options for storage and retrieval, make storing clinical NGS data on the cloud economical and friendly to laboratory workflows. Our web-based tool makes it possible to explore and compare cloud storage solutions and provide forecasts specifically for clinical NGS laboratories. Elsevier 2020-05-15 /pmc/articles/PMC7276491/ /pubmed/32529017 http://dx.doi.org/10.1016/j.plabm.2020.e00168 Text en © 2020 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Krumm, Niklas
Hoffman, Noah
Practical estimation of cloud storage costs for clinical genomic data
title Practical estimation of cloud storage costs for clinical genomic data
title_full Practical estimation of cloud storage costs for clinical genomic data
title_fullStr Practical estimation of cloud storage costs for clinical genomic data
title_full_unstemmed Practical estimation of cloud storage costs for clinical genomic data
title_short Practical estimation of cloud storage costs for clinical genomic data
title_sort practical estimation of cloud storage costs for clinical genomic data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7276491/
https://www.ncbi.nlm.nih.gov/pubmed/32529017
http://dx.doi.org/10.1016/j.plabm.2020.e00168
work_keys_str_mv AT krummniklas practicalestimationofcloudstoragecostsforclinicalgenomicdata
AT hoffmannoah practicalestimationofcloudstoragecostsforclinicalgenomicdata