Cargando…

Cost-Effective Cloud Computing: A Case Study Using the Comparative Genomics Tool, Roundup

BACKGROUND: Comparative genomics resources, such as ortholog detection tools and repositories are rapidly increasing in scale and complexity. Cloud computing is an emerging technological paradigm that enables researchers to dynamically build a dedicated virtual cluster and may represent a valuable a...

Descripción completa

Detalles Bibliográficos
Autores principales: Kudtarkar, Parul, DeLuca, Todd F., Fusaro, Vincent A., Tonellato, Peter J., Wall, Dennis P.
Formato: Texto
Lenguaje:English
Publicado: Libertas Academica 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3023304/
https://www.ncbi.nlm.nih.gov/pubmed/21258651
http://dx.doi.org/10.4137/EBO.S6259
_version_ 1782196656957030400
author Kudtarkar, Parul
DeLuca, Todd F.
Fusaro, Vincent A.
Tonellato, Peter J.
Wall, Dennis P.
author_facet Kudtarkar, Parul
DeLuca, Todd F.
Fusaro, Vincent A.
Tonellato, Peter J.
Wall, Dennis P.
author_sort Kudtarkar, Parul
collection PubMed
description BACKGROUND: Comparative genomics resources, such as ortholog detection tools and repositories are rapidly increasing in scale and complexity. Cloud computing is an emerging technological paradigm that enables researchers to dynamically build a dedicated virtual cluster and may represent a valuable alternative for large computational tools in bioinformatics. In the present manuscript, we optimize the computation of a large-scale comparative genomics resource—Roundup—using cloud computing, describe the proper operating principles required to achieve computational efficiency on the cloud, and detail important procedures for improving cost-effectiveness to ensure maximal computation at minimal costs. METHODS: Utilizing the comparative genomics tool, Roundup, as a case study, we computed orthologs among 902 fully sequenced genomes on Amazon’s Elastic Compute Cloud. For managing the ortholog processes, we designed a strategy to deploy the web service, Elastic MapReduce, and maximize the use of the cloud while simultaneously minimizing costs. Specifically, we created a model to estimate cloud runtime based on the size and complexity of the genomes being compared that determines in advance the optimal order of the jobs to be submitted. RESULTS: We computed orthologous relationships for 245,323 genome-to-genome comparisons on Amazon’s computing cloud, a computation that required just over 200 hours and cost $8,000 USD, at least 40% less than expected under a strategy in which genome comparisons were submitted to the cloud randomly with respect to runtime. Our cost savings projections were based on a model that not only demonstrates the optimal strategy for deploying RSD to the cloud, but also finds the optimal cluster size to minimize waste and maximize usage. Our cost-reduction model is readily adaptable for other comparative genomics tools and potentially of significant benefit to labs seeking to take advantage of the cloud as an alternative to local computing infrastructure.
format Text
id pubmed-3023304
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-30233042011-01-21 Cost-Effective Cloud Computing: A Case Study Using the Comparative Genomics Tool, Roundup Kudtarkar, Parul DeLuca, Todd F. Fusaro, Vincent A. Tonellato, Peter J. Wall, Dennis P. Evol Bioinform Online Short Report BACKGROUND: Comparative genomics resources, such as ortholog detection tools and repositories are rapidly increasing in scale and complexity. Cloud computing is an emerging technological paradigm that enables researchers to dynamically build a dedicated virtual cluster and may represent a valuable alternative for large computational tools in bioinformatics. In the present manuscript, we optimize the computation of a large-scale comparative genomics resource—Roundup—using cloud computing, describe the proper operating principles required to achieve computational efficiency on the cloud, and detail important procedures for improving cost-effectiveness to ensure maximal computation at minimal costs. METHODS: Utilizing the comparative genomics tool, Roundup, as a case study, we computed orthologs among 902 fully sequenced genomes on Amazon’s Elastic Compute Cloud. For managing the ortholog processes, we designed a strategy to deploy the web service, Elastic MapReduce, and maximize the use of the cloud while simultaneously minimizing costs. Specifically, we created a model to estimate cloud runtime based on the size and complexity of the genomes being compared that determines in advance the optimal order of the jobs to be submitted. RESULTS: We computed orthologous relationships for 245,323 genome-to-genome comparisons on Amazon’s computing cloud, a computation that required just over 200 hours and cost $8,000 USD, at least 40% less than expected under a strategy in which genome comparisons were submitted to the cloud randomly with respect to runtime. Our cost savings projections were based on a model that not only demonstrates the optimal strategy for deploying RSD to the cloud, but also finds the optimal cluster size to minimize waste and maximize usage. Our cost-reduction model is readily adaptable for other comparative genomics tools and potentially of significant benefit to labs seeking to take advantage of the cloud as an alternative to local computing infrastructure. Libertas Academica 2010-12-22 /pmc/articles/PMC3023304/ /pubmed/21258651 http://dx.doi.org/10.4137/EBO.S6259 Text en © 2010 the author(s), publisher and licensee Libertas Academica Ltd. This is an open access article. Unrestricted non-commercial use is permitted provided the original work is properly cited.
spellingShingle Short Report
Kudtarkar, Parul
DeLuca, Todd F.
Fusaro, Vincent A.
Tonellato, Peter J.
Wall, Dennis P.
Cost-Effective Cloud Computing: A Case Study Using the Comparative Genomics Tool, Roundup
title Cost-Effective Cloud Computing: A Case Study Using the Comparative Genomics Tool, Roundup
title_full Cost-Effective Cloud Computing: A Case Study Using the Comparative Genomics Tool, Roundup
title_fullStr Cost-Effective Cloud Computing: A Case Study Using the Comparative Genomics Tool, Roundup
title_full_unstemmed Cost-Effective Cloud Computing: A Case Study Using the Comparative Genomics Tool, Roundup
title_short Cost-Effective Cloud Computing: A Case Study Using the Comparative Genomics Tool, Roundup
title_sort cost-effective cloud computing: a case study using the comparative genomics tool, roundup
topic Short Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3023304/
https://www.ncbi.nlm.nih.gov/pubmed/21258651
http://dx.doi.org/10.4137/EBO.S6259
work_keys_str_mv AT kudtarkarparul costeffectivecloudcomputingacasestudyusingthecomparativegenomicstoolroundup
AT delucatoddf costeffectivecloudcomputingacasestudyusingthecomparativegenomicstoolroundup
AT fusarovincenta costeffectivecloudcomputingacasestudyusingthecomparativegenomicstoolroundup
AT tonellatopeterj costeffectivecloudcomputingacasestudyusingthecomparativegenomicstoolroundup
AT walldennisp costeffectivecloudcomputingacasestudyusingthecomparativegenomicstoolroundup