Cargando…
Bioinformatics tools developed to support BioCompute Objects
Developments in high-throughput sequencing (HTS) result in an exponential increase in the amount of data generated by sequencing experiments, an increase in the complexity of bioinformatics analysis reporting and an increase in the types of data generated. These increases in volume, diversity and co...
Autores principales: | , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8009203/ https://www.ncbi.nlm.nih.gov/pubmed/33784373 http://dx.doi.org/10.1093/database/baab008 |
_version_ | 1783672834944401408 |
---|---|
author | Patel, Janisha A Dean, Dennis A King, Charles Hadley Xiao, Nan Koc, Soner Minina, Ekaterina Golikov, Anton Brooks, Phillip Kahsay, Robel Navelkar, Rahi Ray, Manisha Roberson, Dave Armstrong, Chris Mazumder, Raja Keeney, Jonathon |
author_facet | Patel, Janisha A Dean, Dennis A King, Charles Hadley Xiao, Nan Koc, Soner Minina, Ekaterina Golikov, Anton Brooks, Phillip Kahsay, Robel Navelkar, Rahi Ray, Manisha Roberson, Dave Armstrong, Chris Mazumder, Raja Keeney, Jonathon |
author_sort | Patel, Janisha A |
collection | PubMed |
description | Developments in high-throughput sequencing (HTS) result in an exponential increase in the amount of data generated by sequencing experiments, an increase in the complexity of bioinformatics analysis reporting and an increase in the types of data generated. These increases in volume, diversity and complexity of the data generated and their analysis expose the necessity of a structured and standardized reporting template. BioCompute Objects (BCOs) provide the requisite support for communication of HTS data analysis that includes support for workflow, as well as data, curation, accessibility and reproducibility of communication. BCOs standardize how researchers report provenance and the established verification and validation protocols used in workflows while also being robust enough to convey content integration or curation in knowledge bases. BCOs that encapsulate tools, platforms, datasets and workflows are FAIR (findable, accessible, interoperable and reusable) compliant. Providing operational workflow and data information facilitates interoperability between platforms and incorporation of future dataset within an HTS analysis for use within industrial, academic and regulatory settings. Cloud-based platforms, including High-performance Integrated Virtual Environment (HIVE), Cancer Genomics Cloud (CGC) and Galaxy, support BCO generation for users. Given the 100K+ userbase between these platforms, BioCompute can be leveraged for workflow documentation. In this paper, we report the availability of platform-dependent and platform-independent BCO tools: HIVE BCO App, CGC BCO App, Galaxy BCO API Extension and BCO Portal. Community engagement was utilized to evaluate tool efficacy. We demonstrate that these tools further advance BCO creation from text editing approaches used in earlier releases of the standard. Moreover, we demonstrate that integrating BCO generation within existing analysis platforms greatly streamlines BCO creation while capturing granular workflow details. We also demonstrate that the BCO tools described in the paper provide an approach to solve the long-standing challenge of standardizing workflow descriptions that are both human and machine readable while accommodating manual and automated curation with evidence tagging. Database URL: https://www.biocomputeobject.org/resources |
format | Online Article Text |
id | pubmed-8009203 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-80092032021-04-02 Bioinformatics tools developed to support BioCompute Objects Patel, Janisha A Dean, Dennis A King, Charles Hadley Xiao, Nan Koc, Soner Minina, Ekaterina Golikov, Anton Brooks, Phillip Kahsay, Robel Navelkar, Rahi Ray, Manisha Roberson, Dave Armstrong, Chris Mazumder, Raja Keeney, Jonathon Database (Oxford) Technical Report Developments in high-throughput sequencing (HTS) result in an exponential increase in the amount of data generated by sequencing experiments, an increase in the complexity of bioinformatics analysis reporting and an increase in the types of data generated. These increases in volume, diversity and complexity of the data generated and their analysis expose the necessity of a structured and standardized reporting template. BioCompute Objects (BCOs) provide the requisite support for communication of HTS data analysis that includes support for workflow, as well as data, curation, accessibility and reproducibility of communication. BCOs standardize how researchers report provenance and the established verification and validation protocols used in workflows while also being robust enough to convey content integration or curation in knowledge bases. BCOs that encapsulate tools, platforms, datasets and workflows are FAIR (findable, accessible, interoperable and reusable) compliant. Providing operational workflow and data information facilitates interoperability between platforms and incorporation of future dataset within an HTS analysis for use within industrial, academic and regulatory settings. Cloud-based platforms, including High-performance Integrated Virtual Environment (HIVE), Cancer Genomics Cloud (CGC) and Galaxy, support BCO generation for users. Given the 100K+ userbase between these platforms, BioCompute can be leveraged for workflow documentation. In this paper, we report the availability of platform-dependent and platform-independent BCO tools: HIVE BCO App, CGC BCO App, Galaxy BCO API Extension and BCO Portal. Community engagement was utilized to evaluate tool efficacy. We demonstrate that these tools further advance BCO creation from text editing approaches used in earlier releases of the standard. Moreover, we demonstrate that integrating BCO generation within existing analysis platforms greatly streamlines BCO creation while capturing granular workflow details. We also demonstrate that the BCO tools described in the paper provide an approach to solve the long-standing challenge of standardizing workflow descriptions that are both human and machine readable while accommodating manual and automated curation with evidence tagging. Database URL: https://www.biocomputeobject.org/resources Oxford University Press 2021-03-30 /pmc/articles/PMC8009203/ /pubmed/33784373 http://dx.doi.org/10.1093/database/baab008 Text en © Oxford University Press 2021. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Technical Report Patel, Janisha A Dean, Dennis A King, Charles Hadley Xiao, Nan Koc, Soner Minina, Ekaterina Golikov, Anton Brooks, Phillip Kahsay, Robel Navelkar, Rahi Ray, Manisha Roberson, Dave Armstrong, Chris Mazumder, Raja Keeney, Jonathon Bioinformatics tools developed to support BioCompute Objects |
title | Bioinformatics tools developed to support BioCompute Objects |
title_full | Bioinformatics tools developed to support BioCompute Objects |
title_fullStr | Bioinformatics tools developed to support BioCompute Objects |
title_full_unstemmed | Bioinformatics tools developed to support BioCompute Objects |
title_short | Bioinformatics tools developed to support BioCompute Objects |
title_sort | bioinformatics tools developed to support biocompute objects |
topic | Technical Report |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8009203/ https://www.ncbi.nlm.nih.gov/pubmed/33784373 http://dx.doi.org/10.1093/database/baab008 |
work_keys_str_mv | AT pateljanishaa bioinformaticstoolsdevelopedtosupportbiocomputeobjects AT deandennisa bioinformaticstoolsdevelopedtosupportbiocomputeobjects AT kingcharleshadley bioinformaticstoolsdevelopedtosupportbiocomputeobjects AT xiaonan bioinformaticstoolsdevelopedtosupportbiocomputeobjects AT kocsoner bioinformaticstoolsdevelopedtosupportbiocomputeobjects AT mininaekaterina bioinformaticstoolsdevelopedtosupportbiocomputeobjects AT golikovanton bioinformaticstoolsdevelopedtosupportbiocomputeobjects AT brooksphillip bioinformaticstoolsdevelopedtosupportbiocomputeobjects AT kahsayrobel bioinformaticstoolsdevelopedtosupportbiocomputeobjects AT navelkarrahi bioinformaticstoolsdevelopedtosupportbiocomputeobjects AT raymanisha bioinformaticstoolsdevelopedtosupportbiocomputeobjects AT robersondave bioinformaticstoolsdevelopedtosupportbiocomputeobjects AT armstrongchris bioinformaticstoolsdevelopedtosupportbiocomputeobjects AT mazumderraja bioinformaticstoolsdevelopedtosupportbiocomputeobjects AT keeneyjonathon bioinformaticstoolsdevelopedtosupportbiocomputeobjects |