Cargando…
Swarm: A federated cloud framework for large-scale variant analysis
Genomic data analysis across multiple cloud platforms is an ongoing challenge, especially when large amounts of data are involved. Here, we present Swarm, a framework for federated computation that promotes minimal data motion and facilitates crosstalk between genomic datasets stored on various clou...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8143397/ https://www.ncbi.nlm.nih.gov/pubmed/33979321 http://dx.doi.org/10.1371/journal.pcbi.1008977 |
_version_ | 1783696744593227776 |
---|---|
author | Bahmani, Amir Ferriter, Kyle Krishnan, Vandhana Alavi, Arash Alavi, Amir Tsao, Philip S. Snyder, Michael P. Pan, Cuiping |
author_facet | Bahmani, Amir Ferriter, Kyle Krishnan, Vandhana Alavi, Arash Alavi, Amir Tsao, Philip S. Snyder, Michael P. Pan, Cuiping |
author_sort | Bahmani, Amir |
collection | PubMed |
description | Genomic data analysis across multiple cloud platforms is an ongoing challenge, especially when large amounts of data are involved. Here, we present Swarm, a framework for federated computation that promotes minimal data motion and facilitates crosstalk between genomic datasets stored on various cloud platforms. We demonstrate its utility via common inquiries of genomic variants across BigQuery in the Google Cloud Platform (GCP), Athena in the Amazon Web Services (AWS), Apache Presto and MySQL. Compared to single-cloud platforms, the Swarm framework significantly reduced computational costs, run-time delays and risks of security breach and privacy violation. |
format | Online Article Text |
id | pubmed-8143397 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-81433972021-06-07 Swarm: A federated cloud framework for large-scale variant analysis Bahmani, Amir Ferriter, Kyle Krishnan, Vandhana Alavi, Arash Alavi, Amir Tsao, Philip S. Snyder, Michael P. Pan, Cuiping PLoS Comput Biol Research Article Genomic data analysis across multiple cloud platforms is an ongoing challenge, especially when large amounts of data are involved. Here, we present Swarm, a framework for federated computation that promotes minimal data motion and facilitates crosstalk between genomic datasets stored on various cloud platforms. We demonstrate its utility via common inquiries of genomic variants across BigQuery in the Google Cloud Platform (GCP), Athena in the Amazon Web Services (AWS), Apache Presto and MySQL. Compared to single-cloud platforms, the Swarm framework significantly reduced computational costs, run-time delays and risks of security breach and privacy violation. Public Library of Science 2021-05-12 /pmc/articles/PMC8143397/ /pubmed/33979321 http://dx.doi.org/10.1371/journal.pcbi.1008977 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication. |
spellingShingle | Research Article Bahmani, Amir Ferriter, Kyle Krishnan, Vandhana Alavi, Arash Alavi, Amir Tsao, Philip S. Snyder, Michael P. Pan, Cuiping Swarm: A federated cloud framework for large-scale variant analysis |
title | Swarm: A federated cloud framework for large-scale variant analysis |
title_full | Swarm: A federated cloud framework for large-scale variant analysis |
title_fullStr | Swarm: A federated cloud framework for large-scale variant analysis |
title_full_unstemmed | Swarm: A federated cloud framework for large-scale variant analysis |
title_short | Swarm: A federated cloud framework for large-scale variant analysis |
title_sort | swarm: a federated cloud framework for large-scale variant analysis |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8143397/ https://www.ncbi.nlm.nih.gov/pubmed/33979321 http://dx.doi.org/10.1371/journal.pcbi.1008977 |
work_keys_str_mv | AT bahmaniamir swarmafederatedcloudframeworkforlargescalevariantanalysis AT ferriterkyle swarmafederatedcloudframeworkforlargescalevariantanalysis AT krishnanvandhana swarmafederatedcloudframeworkforlargescalevariantanalysis AT alaviarash swarmafederatedcloudframeworkforlargescalevariantanalysis AT alaviamir swarmafederatedcloudframeworkforlargescalevariantanalysis AT tsaophilips swarmafederatedcloudframeworkforlargescalevariantanalysis AT snydermichaelp swarmafederatedcloudframeworkforlargescalevariantanalysis AT pancuiping swarmafederatedcloudframeworkforlargescalevariantanalysis |