Cargando…

Cloud BioLinux: pre-configured and on-demand bioinformatics computing for the genomics community

BACKGROUND: A steep drop in the cost of next-generation sequencing during recent years has made the technology affordable to the majority of researchers, but downstream bioinformatic analysis still poses a resource bottleneck for smaller laboratories and institutes that do not have access to substan...

Descripción completa

Detalles Bibliográficos
Autores principales: Krampis, Konstantinos, Booth, Tim, Chapman, Brad, Tiwari, Bela, Bicak, Mesude, Field, Dawn, Nelson, Karen E
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3372431/
https://www.ncbi.nlm.nih.gov/pubmed/22429538
http://dx.doi.org/10.1186/1471-2105-13-42
_version_ 1782235343673622528
author Krampis, Konstantinos
Booth, Tim
Chapman, Brad
Tiwari, Bela
Bicak, Mesude
Field, Dawn
Nelson, Karen E
author_facet Krampis, Konstantinos
Booth, Tim
Chapman, Brad
Tiwari, Bela
Bicak, Mesude
Field, Dawn
Nelson, Karen E
author_sort Krampis, Konstantinos
collection PubMed
description BACKGROUND: A steep drop in the cost of next-generation sequencing during recent years has made the technology affordable to the majority of researchers, but downstream bioinformatic analysis still poses a resource bottleneck for smaller laboratories and institutes that do not have access to substantial computational resources. Sequencing instruments are typically bundled with only the minimal processing and storage capacity required for data capture during sequencing runs. Given the scale of sequence datasets, scientific value cannot be obtained from acquiring a sequencer unless it is accompanied by an equal investment in informatics infrastructure. RESULTS: Cloud BioLinux is a publicly accessible Virtual Machine (VM) that enables scientists to quickly provision on-demand infrastructures for high-performance bioinformatics computing using cloud platforms. Users have instant access to a range of pre-configured command line and graphical software applications, including a full-featured desktop interface, documentation and over 135 bioinformatics packages for applications including sequence alignment, clustering, assembly, display, editing, and phylogeny. Each tool's functionality is fully described in the documentation directly accessible from the graphical interface of the VM. Besides the Amazon EC2 cloud, we have started instances of Cloud BioLinux on a private Eucalyptus cloud installed at the J. Craig Venter Institute, and demonstrated access to the bioinformatic tools interface through a remote connection to EC2 instances from a local desktop computer. Documentation for using Cloud BioLinux on EC2 is available from our project website, while a Eucalyptus cloud image and VirtualBox Appliance is also publicly available for download and use by researchers with access to private clouds. CONCLUSIONS: Cloud BioLinux provides a platform for developing bioinformatics infrastructures on the cloud. An automated and configurable process builds Virtual Machines, allowing the development of highly customized versions from a shared code base. This shared community toolkit enables application specific analysis platforms on the cloud by minimizing the effort required to prepare and maintain them.
format Online
Article
Text
id pubmed-3372431
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33724312012-06-12 Cloud BioLinux: pre-configured and on-demand bioinformatics computing for the genomics community Krampis, Konstantinos Booth, Tim Chapman, Brad Tiwari, Bela Bicak, Mesude Field, Dawn Nelson, Karen E BMC Bioinformatics Software BACKGROUND: A steep drop in the cost of next-generation sequencing during recent years has made the technology affordable to the majority of researchers, but downstream bioinformatic analysis still poses a resource bottleneck for smaller laboratories and institutes that do not have access to substantial computational resources. Sequencing instruments are typically bundled with only the minimal processing and storage capacity required for data capture during sequencing runs. Given the scale of sequence datasets, scientific value cannot be obtained from acquiring a sequencer unless it is accompanied by an equal investment in informatics infrastructure. RESULTS: Cloud BioLinux is a publicly accessible Virtual Machine (VM) that enables scientists to quickly provision on-demand infrastructures for high-performance bioinformatics computing using cloud platforms. Users have instant access to a range of pre-configured command line and graphical software applications, including a full-featured desktop interface, documentation and over 135 bioinformatics packages for applications including sequence alignment, clustering, assembly, display, editing, and phylogeny. Each tool's functionality is fully described in the documentation directly accessible from the graphical interface of the VM. Besides the Amazon EC2 cloud, we have started instances of Cloud BioLinux on a private Eucalyptus cloud installed at the J. Craig Venter Institute, and demonstrated access to the bioinformatic tools interface through a remote connection to EC2 instances from a local desktop computer. Documentation for using Cloud BioLinux on EC2 is available from our project website, while a Eucalyptus cloud image and VirtualBox Appliance is also publicly available for download and use by researchers with access to private clouds. CONCLUSIONS: Cloud BioLinux provides a platform for developing bioinformatics infrastructures on the cloud. An automated and configurable process builds Virtual Machines, allowing the development of highly customized versions from a shared code base. This shared community toolkit enables application specific analysis platforms on the cloud by minimizing the effort required to prepare and maintain them. BioMed Central 2012-03-19 /pmc/articles/PMC3372431/ /pubmed/22429538 http://dx.doi.org/10.1186/1471-2105-13-42 Text en Copyright ©2012 Krampis et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Krampis, Konstantinos
Booth, Tim
Chapman, Brad
Tiwari, Bela
Bicak, Mesude
Field, Dawn
Nelson, Karen E
Cloud BioLinux: pre-configured and on-demand bioinformatics computing for the genomics community
title Cloud BioLinux: pre-configured and on-demand bioinformatics computing for the genomics community
title_full Cloud BioLinux: pre-configured and on-demand bioinformatics computing for the genomics community
title_fullStr Cloud BioLinux: pre-configured and on-demand bioinformatics computing for the genomics community
title_full_unstemmed Cloud BioLinux: pre-configured and on-demand bioinformatics computing for the genomics community
title_short Cloud BioLinux: pre-configured and on-demand bioinformatics computing for the genomics community
title_sort cloud biolinux: pre-configured and on-demand bioinformatics computing for the genomics community
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3372431/
https://www.ncbi.nlm.nih.gov/pubmed/22429538
http://dx.doi.org/10.1186/1471-2105-13-42
work_keys_str_mv AT krampiskonstantinos cloudbiolinuxpreconfiguredandondemandbioinformaticscomputingforthegenomicscommunity
AT boothtim cloudbiolinuxpreconfiguredandondemandbioinformaticscomputingforthegenomicscommunity
AT chapmanbrad cloudbiolinuxpreconfiguredandondemandbioinformaticscomputingforthegenomicscommunity
AT tiwaribela cloudbiolinuxpreconfiguredandondemandbioinformaticscomputingforthegenomicscommunity
AT bicakmesude cloudbiolinuxpreconfiguredandondemandbioinformaticscomputingforthegenomicscommunity
AT fielddawn cloudbiolinuxpreconfiguredandondemandbioinformaticscomputingforthegenomicscommunity
AT nelsonkarene cloudbiolinuxpreconfiguredandondemandbioinformaticscomputingforthegenomicscommunity