Cargando…
Towards an Internet of Science
Big data and complex analysis workflows (pipelines) are common issues in data driven science such as bioinformatics. Large amounts of computational tools are available for data analysis. Additionally, many workflow management systems to piece together such tools into data analysis pipelines have bee...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
De Gruyter
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6798852/ https://www.ncbi.nlm.nih.gov/pubmed/31145694 http://dx.doi.org/10.1515/jib-2019-0024 |
_version_ | 1783460148730134528 |
---|---|
author | Allmer, Jens |
author_facet | Allmer, Jens |
author_sort | Allmer, Jens |
collection | PubMed |
description | Big data and complex analysis workflows (pipelines) are common issues in data driven science such as bioinformatics. Large amounts of computational tools are available for data analysis. Additionally, many workflow management systems to piece together such tools into data analysis pipelines have been developed. For example, more than 50 computational tools for read mapping are available representing a large amount of duplicated effort. Furthermore, it is unclear whether these tools are correct and only a few have a user base large enough to have encountered and reported most of the potential problems. Bringing together many largely untested tools in a computational pipeline must lead to unpredictable results. Yet, this is the current state. While presently data analysis is performed on personal computers/workstations/clusters, the future will see development and analysis shift to the cloud. None of the workflow management systems is ready for this transition. This presents the opportunity to build a new system, which will overcome current duplications of effort, introduce proper testing, allow for development and analysis in public and private clouds, and include reporting features leading to interactive documents. |
format | Online Article Text |
id | pubmed-6798852 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | De Gruyter |
record_format | MEDLINE/PubMed |
spelling | pubmed-67988522019-10-28 Towards an Internet of Science Allmer, Jens J Integr Bioinform Opinion Papers Big data and complex analysis workflows (pipelines) are common issues in data driven science such as bioinformatics. Large amounts of computational tools are available for data analysis. Additionally, many workflow management systems to piece together such tools into data analysis pipelines have been developed. For example, more than 50 computational tools for read mapping are available representing a large amount of duplicated effort. Furthermore, it is unclear whether these tools are correct and only a few have a user base large enough to have encountered and reported most of the potential problems. Bringing together many largely untested tools in a computational pipeline must lead to unpredictable results. Yet, this is the current state. While presently data analysis is performed on personal computers/workstations/clusters, the future will see development and analysis shift to the cloud. None of the workflow management systems is ready for this transition. This presents the opportunity to build a new system, which will overcome current duplications of effort, introduce proper testing, allow for development and analysis in public and private clouds, and include reporting features leading to interactive documents. De Gruyter 2019-05-30 /pmc/articles/PMC6798852/ /pubmed/31145694 http://dx.doi.org/10.1515/jib-2019-0024 Text en © 2019, Jens Allmer, published by Walter de Gruyter GmbH, Berlin/Boston http://creativecommons.org/licenses/by/4.0 This work is licensed under the Creative Commons Attribution 4.0 Public License. |
spellingShingle | Opinion Papers Allmer, Jens Towards an Internet of Science |
title | Towards an Internet of Science |
title_full | Towards an Internet of Science |
title_fullStr | Towards an Internet of Science |
title_full_unstemmed | Towards an Internet of Science |
title_short | Towards an Internet of Science |
title_sort | towards an internet of science |
topic | Opinion Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6798852/ https://www.ncbi.nlm.nih.gov/pubmed/31145694 http://dx.doi.org/10.1515/jib-2019-0024 |
work_keys_str_mv | AT allmerjens towardsaninternetofscience |