Cargando…

Towards an Internet of Science

Big data and complex analysis workflows (pipelines) are common issues in data driven science such as bioinformatics. Large amounts of computational tools are available for data analysis. Additionally, many workflow management systems to piece together such tools into data analysis pipelines have bee...

Descripción completa

Detalles Bibliográficos
Autor principal: Allmer, Jens
Formato: Online Artículo Texto
Lenguaje:English
Publicado: De Gruyter 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6798852/
https://www.ncbi.nlm.nih.gov/pubmed/31145694
http://dx.doi.org/10.1515/jib-2019-0024
_version_ 1783460148730134528
author Allmer, Jens
author_facet Allmer, Jens
author_sort Allmer, Jens
collection PubMed
description Big data and complex analysis workflows (pipelines) are common issues in data driven science such as bioinformatics. Large amounts of computational tools are available for data analysis. Additionally, many workflow management systems to piece together such tools into data analysis pipelines have been developed. For example, more than 50 computational tools for read mapping are available representing a large amount of duplicated effort. Furthermore, it is unclear whether these tools are correct and only a few have a user base large enough to have encountered and reported most of the potential problems. Bringing together many largely untested tools in a computational pipeline must lead to unpredictable results. Yet, this is the current state. While presently data analysis is performed on personal computers/workstations/clusters, the future will see development and analysis shift to the cloud. None of the workflow management systems is ready for this transition. This presents the opportunity to build a new system, which will overcome current duplications of effort, introduce proper testing, allow for development and analysis in public and private clouds, and include reporting features leading to interactive documents.
format Online
Article
Text
id pubmed-6798852
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher De Gruyter
record_format MEDLINE/PubMed
spelling pubmed-67988522019-10-28 Towards an Internet of Science Allmer, Jens J Integr Bioinform Opinion Papers Big data and complex analysis workflows (pipelines) are common issues in data driven science such as bioinformatics. Large amounts of computational tools are available for data analysis. Additionally, many workflow management systems to piece together such tools into data analysis pipelines have been developed. For example, more than 50 computational tools for read mapping are available representing a large amount of duplicated effort. Furthermore, it is unclear whether these tools are correct and only a few have a user base large enough to have encountered and reported most of the potential problems. Bringing together many largely untested tools in a computational pipeline must lead to unpredictable results. Yet, this is the current state. While presently data analysis is performed on personal computers/workstations/clusters, the future will see development and analysis shift to the cloud. None of the workflow management systems is ready for this transition. This presents the opportunity to build a new system, which will overcome current duplications of effort, introduce proper testing, allow for development and analysis in public and private clouds, and include reporting features leading to interactive documents. De Gruyter 2019-05-30 /pmc/articles/PMC6798852/ /pubmed/31145694 http://dx.doi.org/10.1515/jib-2019-0024 Text en © 2019, Jens Allmer, published by Walter de Gruyter GmbH, Berlin/Boston http://creativecommons.org/licenses/by/4.0 This work is licensed under the Creative Commons Attribution 4.0 Public License.
spellingShingle Opinion Papers
Allmer, Jens
Towards an Internet of Science
title Towards an Internet of Science
title_full Towards an Internet of Science
title_fullStr Towards an Internet of Science
title_full_unstemmed Towards an Internet of Science
title_short Towards an Internet of Science
title_sort towards an internet of science
topic Opinion Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6798852/
https://www.ncbi.nlm.nih.gov/pubmed/31145694
http://dx.doi.org/10.1515/jib-2019-0024
work_keys_str_mv AT allmerjens towardsaninternetofscience