Cargando…

ElasticBLAST: accelerating sequence search via cloud computing

BACKGROUND: Biomedical researchers use alignments produced by BLAST (Basic Local Alignment Search Tool) to categorize their query sequences. Producing such alignments is an essential bioinformatics task that is well suited for the cloud. The cloud can perform many calculations quickly as well as sto...

Descripción completa

Detalles Bibliográficos
Autores principales: Camacho, Christiam, Boratyn, Grzegorz M., Joukov, Victor, Vera Alvarez, Roberto, Madden, Thomas L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10040096/
https://www.ncbi.nlm.nih.gov/pubmed/36967390
http://dx.doi.org/10.1186/s12859-023-05245-9
_version_ 1784912408454627328
author Camacho, Christiam
Boratyn, Grzegorz M.
Joukov, Victor
Vera Alvarez, Roberto
Madden, Thomas L.
author_facet Camacho, Christiam
Boratyn, Grzegorz M.
Joukov, Victor
Vera Alvarez, Roberto
Madden, Thomas L.
author_sort Camacho, Christiam
collection PubMed
description BACKGROUND: Biomedical researchers use alignments produced by BLAST (Basic Local Alignment Search Tool) to categorize their query sequences. Producing such alignments is an essential bioinformatics task that is well suited for the cloud. The cloud can perform many calculations quickly as well as store and access large volumes of data. Bioinformaticians can also use it to collaborate with other researchers, sharing their results, datasets and even their pipelines on a common platform. RESULTS: We present ElasticBLAST, a cloud native application to perform BLAST alignments in the cloud. ElasticBLAST can handle anywhere from a few to many thousands of queries and run the searches on thousands of virtual CPUs (if desired), deleting resources when it is done. It uses cloud native tools for orchestration and can request discounted instances, lowering cloud costs for users. It is supported on Amazon Web Services and Google Cloud Platform. It can search BLAST databases that are user provided or from the National Center for Biotechnology Information. CONCLUSION: We show that ElasticBLAST is a useful application that can efficiently perform BLAST searches for the user in the cloud, demonstrating that with two examples. At the same time, it hides much of the complexity of working in the cloud, lowering the threshold to move work to the cloud.
format Online
Article
Text
id pubmed-10040096
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-100400962023-03-27 ElasticBLAST: accelerating sequence search via cloud computing Camacho, Christiam Boratyn, Grzegorz M. Joukov, Victor Vera Alvarez, Roberto Madden, Thomas L. BMC Bioinformatics Software BACKGROUND: Biomedical researchers use alignments produced by BLAST (Basic Local Alignment Search Tool) to categorize their query sequences. Producing such alignments is an essential bioinformatics task that is well suited for the cloud. The cloud can perform many calculations quickly as well as store and access large volumes of data. Bioinformaticians can also use it to collaborate with other researchers, sharing their results, datasets and even their pipelines on a common platform. RESULTS: We present ElasticBLAST, a cloud native application to perform BLAST alignments in the cloud. ElasticBLAST can handle anywhere from a few to many thousands of queries and run the searches on thousands of virtual CPUs (if desired), deleting resources when it is done. It uses cloud native tools for orchestration and can request discounted instances, lowering cloud costs for users. It is supported on Amazon Web Services and Google Cloud Platform. It can search BLAST databases that are user provided or from the National Center for Biotechnology Information. CONCLUSION: We show that ElasticBLAST is a useful application that can efficiently perform BLAST searches for the user in the cloud, demonstrating that with two examples. At the same time, it hides much of the complexity of working in the cloud, lowering the threshold to move work to the cloud. BioMed Central 2023-03-26 /pmc/articles/PMC10040096/ /pubmed/36967390 http://dx.doi.org/10.1186/s12859-023-05245-9 Text en © This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Camacho, Christiam
Boratyn, Grzegorz M.
Joukov, Victor
Vera Alvarez, Roberto
Madden, Thomas L.
ElasticBLAST: accelerating sequence search via cloud computing
title ElasticBLAST: accelerating sequence search via cloud computing
title_full ElasticBLAST: accelerating sequence search via cloud computing
title_fullStr ElasticBLAST: accelerating sequence search via cloud computing
title_full_unstemmed ElasticBLAST: accelerating sequence search via cloud computing
title_short ElasticBLAST: accelerating sequence search via cloud computing
title_sort elasticblast: accelerating sequence search via cloud computing
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10040096/
https://www.ncbi.nlm.nih.gov/pubmed/36967390
http://dx.doi.org/10.1186/s12859-023-05245-9
work_keys_str_mv AT camachochristiam elasticblastacceleratingsequencesearchviacloudcomputing
AT boratyngrzegorzm elasticblastacceleratingsequencesearchviacloudcomputing
AT joukovvictor elasticblastacceleratingsequencesearchviacloudcomputing
AT veraalvarezroberto elasticblastacceleratingsequencesearchviacloudcomputing
AT maddenthomasl elasticblastacceleratingsequencesearchviacloudcomputing