Cargando…

Atlas2 Cloud: a framework for personal genome analysis in the cloud

BACKGROUND: Until recently, sequencing has primarily been carried out in large genome centers which have invested heavily in developing the computational infrastructure that enables genomic sequence analysis. The recent advancements in next generation sequencing (NGS) have led to a wide disseminatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Evani, Uday S, Challis, Danny, Yu, Jin, Jackson, Andrew R, Paithankar, Sameer, Bainbridge, Matthew N, Jakkamsetti, Adinarayana, Pham, Peter, Coarfa, Cristian, Milosavljevic, Aleksandar, Yu, Fuli
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3481437/
https://www.ncbi.nlm.nih.gov/pubmed/23134663
http://dx.doi.org/10.1186/1471-2164-13-S6-S19
_version_ 1782247738220478464
author Evani, Uday S
Challis, Danny
Yu, Jin
Jackson, Andrew R
Paithankar, Sameer
Bainbridge, Matthew N
Jakkamsetti, Adinarayana
Pham, Peter
Coarfa, Cristian
Milosavljevic, Aleksandar
Yu, Fuli
author_facet Evani, Uday S
Challis, Danny
Yu, Jin
Jackson, Andrew R
Paithankar, Sameer
Bainbridge, Matthew N
Jakkamsetti, Adinarayana
Pham, Peter
Coarfa, Cristian
Milosavljevic, Aleksandar
Yu, Fuli
author_sort Evani, Uday S
collection PubMed
description BACKGROUND: Until recently, sequencing has primarily been carried out in large genome centers which have invested heavily in developing the computational infrastructure that enables genomic sequence analysis. The recent advancements in next generation sequencing (NGS) have led to a wide dissemination of sequencing technologies and data, to highly diverse research groups. It is expected that clinical sequencing will become part of diagnostic routines shortly. However, limited accessibility to computational infrastructure and high quality bioinformatic tools, and the demand for personnel skilled in data analysis and interpretation remains a serious bottleneck. To this end, the cloud computing and Software-as-a-Service (SaaS) technologies can help address these issues. RESULTS: We successfully enabled the Atlas2 Cloud pipeline for personal genome analysis on two different cloud service platforms: a community cloud via the Genboree Workbench, and a commercial cloud via the Amazon Web Services using Software-as-a-Service model. We report a case study of personal genome analysis using our Atlas2 Genboree pipeline. We also outline a detailed cost structure for running Atlas2 Amazon on whole exome capture data, providing cost projections in terms of storage, compute and I/O when running Atlas2 Amazon on a large data set. CONCLUSIONS: We find that providing a web interface and an optimized pipeline clearly facilitates usage of cloud computing for personal genome analysis, but for it to be routinely used for large scale projects there needs to be a paradigm shift in the way we develop tools, in standard operating procedures, and in funding mechanisms.
format Online
Article
Text
id pubmed-3481437
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34814372012-11-02 Atlas2 Cloud: a framework for personal genome analysis in the cloud Evani, Uday S Challis, Danny Yu, Jin Jackson, Andrew R Paithankar, Sameer Bainbridge, Matthew N Jakkamsetti, Adinarayana Pham, Peter Coarfa, Cristian Milosavljevic, Aleksandar Yu, Fuli BMC Genomics Research BACKGROUND: Until recently, sequencing has primarily been carried out in large genome centers which have invested heavily in developing the computational infrastructure that enables genomic sequence analysis. The recent advancements in next generation sequencing (NGS) have led to a wide dissemination of sequencing technologies and data, to highly diverse research groups. It is expected that clinical sequencing will become part of diagnostic routines shortly. However, limited accessibility to computational infrastructure and high quality bioinformatic tools, and the demand for personnel skilled in data analysis and interpretation remains a serious bottleneck. To this end, the cloud computing and Software-as-a-Service (SaaS) technologies can help address these issues. RESULTS: We successfully enabled the Atlas2 Cloud pipeline for personal genome analysis on two different cloud service platforms: a community cloud via the Genboree Workbench, and a commercial cloud via the Amazon Web Services using Software-as-a-Service model. We report a case study of personal genome analysis using our Atlas2 Genboree pipeline. We also outline a detailed cost structure for running Atlas2 Amazon on whole exome capture data, providing cost projections in terms of storage, compute and I/O when running Atlas2 Amazon on a large data set. CONCLUSIONS: We find that providing a web interface and an optimized pipeline clearly facilitates usage of cloud computing for personal genome analysis, but for it to be routinely used for large scale projects there needs to be a paradigm shift in the way we develop tools, in standard operating procedures, and in funding mechanisms. BioMed Central 2012-10-26 /pmc/articles/PMC3481437/ /pubmed/23134663 http://dx.doi.org/10.1186/1471-2164-13-S6-S19 Text en Copyright ©2012 Evani et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Evani, Uday S
Challis, Danny
Yu, Jin
Jackson, Andrew R
Paithankar, Sameer
Bainbridge, Matthew N
Jakkamsetti, Adinarayana
Pham, Peter
Coarfa, Cristian
Milosavljevic, Aleksandar
Yu, Fuli
Atlas2 Cloud: a framework for personal genome analysis in the cloud
title Atlas2 Cloud: a framework for personal genome analysis in the cloud
title_full Atlas2 Cloud: a framework for personal genome analysis in the cloud
title_fullStr Atlas2 Cloud: a framework for personal genome analysis in the cloud
title_full_unstemmed Atlas2 Cloud: a framework for personal genome analysis in the cloud
title_short Atlas2 Cloud: a framework for personal genome analysis in the cloud
title_sort atlas2 cloud: a framework for personal genome analysis in the cloud
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3481437/
https://www.ncbi.nlm.nih.gov/pubmed/23134663
http://dx.doi.org/10.1186/1471-2164-13-S6-S19
work_keys_str_mv AT evaniudays atlas2cloudaframeworkforpersonalgenomeanalysisinthecloud
AT challisdanny atlas2cloudaframeworkforpersonalgenomeanalysisinthecloud
AT yujin atlas2cloudaframeworkforpersonalgenomeanalysisinthecloud
AT jacksonandrewr atlas2cloudaframeworkforpersonalgenomeanalysisinthecloud
AT paithankarsameer atlas2cloudaframeworkforpersonalgenomeanalysisinthecloud
AT bainbridgematthewn atlas2cloudaframeworkforpersonalgenomeanalysisinthecloud
AT jakkamsettiadinarayana atlas2cloudaframeworkforpersonalgenomeanalysisinthecloud
AT phampeter atlas2cloudaframeworkforpersonalgenomeanalysisinthecloud
AT coarfacristian atlas2cloudaframeworkforpersonalgenomeanalysisinthecloud
AT milosavljevicaleksandar atlas2cloudaframeworkforpersonalgenomeanalysisinthecloud
AT yufuli atlas2cloudaframeworkforpersonalgenomeanalysisinthecloud