Cargando…

Bioinformatics tools developed to support BioCompute Objects

Developments in high-throughput sequencing (HTS) result in an exponential increase in the amount of data generated by sequencing experiments, an increase in the complexity of bioinformatics analysis reporting and an increase in the types of data generated. These increases in volume, diversity and co...

Descripción completa

Detalles Bibliográficos
Autores principales: Patel, Janisha A, Dean, Dennis A, King, Charles Hadley, Xiao, Nan, Koc, Soner, Minina, Ekaterina, Golikov, Anton, Brooks, Phillip, Kahsay, Robel, Navelkar, Rahi, Ray, Manisha, Roberson, Dave, Armstrong, Chris, Mazumder, Raja, Keeney, Jonathon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8009203/
https://www.ncbi.nlm.nih.gov/pubmed/33784373
http://dx.doi.org/10.1093/database/baab008
_version_ 1783672834944401408
author Patel, Janisha A
Dean, Dennis A
King, Charles Hadley
Xiao, Nan
Koc, Soner
Minina, Ekaterina
Golikov, Anton
Brooks, Phillip
Kahsay, Robel
Navelkar, Rahi
Ray, Manisha
Roberson, Dave
Armstrong, Chris
Mazumder, Raja
Keeney, Jonathon
author_facet Patel, Janisha A
Dean, Dennis A
King, Charles Hadley
Xiao, Nan
Koc, Soner
Minina, Ekaterina
Golikov, Anton
Brooks, Phillip
Kahsay, Robel
Navelkar, Rahi
Ray, Manisha
Roberson, Dave
Armstrong, Chris
Mazumder, Raja
Keeney, Jonathon
author_sort Patel, Janisha A
collection PubMed
description Developments in high-throughput sequencing (HTS) result in an exponential increase in the amount of data generated by sequencing experiments, an increase in the complexity of bioinformatics analysis reporting and an increase in the types of data generated. These increases in volume, diversity and complexity of the data generated and their analysis expose the necessity of a structured and standardized reporting template. BioCompute Objects (BCOs) provide the requisite support for communication of HTS data analysis that includes support for workflow, as well as data, curation, accessibility and reproducibility of communication. BCOs standardize how researchers report provenance and the established verification and validation protocols used in workflows while also being robust enough to convey content integration or curation in knowledge bases. BCOs that encapsulate tools, platforms, datasets and workflows are FAIR (findable, accessible, interoperable and reusable) compliant. Providing operational workflow and data information facilitates interoperability between platforms and incorporation of future dataset within an HTS analysis for use within industrial, academic and regulatory settings. Cloud-based platforms, including High-performance Integrated Virtual Environment (HIVE), Cancer Genomics Cloud (CGC) and Galaxy, support BCO generation for users. Given the 100K+ userbase between these platforms, BioCompute can be leveraged for workflow documentation. In this paper, we report the availability of platform-dependent and platform-independent BCO tools: HIVE BCO App, CGC BCO App, Galaxy BCO API Extension and BCO Portal. Community engagement was utilized to evaluate tool efficacy. We demonstrate that these tools further advance BCO creation from text editing approaches used in earlier releases of the standard. Moreover, we demonstrate that integrating BCO generation within existing analysis platforms greatly streamlines BCO creation while capturing granular workflow details. We also demonstrate that the BCO tools described in the paper provide an approach to solve the long-standing challenge of standardizing workflow descriptions that are both human and machine readable while accommodating manual and automated curation with evidence tagging. Database URL:  https://www.biocomputeobject.org/resources
format Online
Article
Text
id pubmed-8009203
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-80092032021-04-02 Bioinformatics tools developed to support BioCompute Objects Patel, Janisha A Dean, Dennis A King, Charles Hadley Xiao, Nan Koc, Soner Minina, Ekaterina Golikov, Anton Brooks, Phillip Kahsay, Robel Navelkar, Rahi Ray, Manisha Roberson, Dave Armstrong, Chris Mazumder, Raja Keeney, Jonathon Database (Oxford) Technical Report Developments in high-throughput sequencing (HTS) result in an exponential increase in the amount of data generated by sequencing experiments, an increase in the complexity of bioinformatics analysis reporting and an increase in the types of data generated. These increases in volume, diversity and complexity of the data generated and their analysis expose the necessity of a structured and standardized reporting template. BioCompute Objects (BCOs) provide the requisite support for communication of HTS data analysis that includes support for workflow, as well as data, curation, accessibility and reproducibility of communication. BCOs standardize how researchers report provenance and the established verification and validation protocols used in workflows while also being robust enough to convey content integration or curation in knowledge bases. BCOs that encapsulate tools, platforms, datasets and workflows are FAIR (findable, accessible, interoperable and reusable) compliant. Providing operational workflow and data information facilitates interoperability between platforms and incorporation of future dataset within an HTS analysis for use within industrial, academic and regulatory settings. Cloud-based platforms, including High-performance Integrated Virtual Environment (HIVE), Cancer Genomics Cloud (CGC) and Galaxy, support BCO generation for users. Given the 100K+ userbase between these platforms, BioCompute can be leveraged for workflow documentation. In this paper, we report the availability of platform-dependent and platform-independent BCO tools: HIVE BCO App, CGC BCO App, Galaxy BCO API Extension and BCO Portal. Community engagement was utilized to evaluate tool efficacy. We demonstrate that these tools further advance BCO creation from text editing approaches used in earlier releases of the standard. Moreover, we demonstrate that integrating BCO generation within existing analysis platforms greatly streamlines BCO creation while capturing granular workflow details. We also demonstrate that the BCO tools described in the paper provide an approach to solve the long-standing challenge of standardizing workflow descriptions that are both human and machine readable while accommodating manual and automated curation with evidence tagging. Database URL:  https://www.biocomputeobject.org/resources Oxford University Press 2021-03-30 /pmc/articles/PMC8009203/ /pubmed/33784373 http://dx.doi.org/10.1093/database/baab008 Text en © Oxford University Press 2021. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Report
Patel, Janisha A
Dean, Dennis A
King, Charles Hadley
Xiao, Nan
Koc, Soner
Minina, Ekaterina
Golikov, Anton
Brooks, Phillip
Kahsay, Robel
Navelkar, Rahi
Ray, Manisha
Roberson, Dave
Armstrong, Chris
Mazumder, Raja
Keeney, Jonathon
Bioinformatics tools developed to support BioCompute Objects
title Bioinformatics tools developed to support BioCompute Objects
title_full Bioinformatics tools developed to support BioCompute Objects
title_fullStr Bioinformatics tools developed to support BioCompute Objects
title_full_unstemmed Bioinformatics tools developed to support BioCompute Objects
title_short Bioinformatics tools developed to support BioCompute Objects
title_sort bioinformatics tools developed to support biocompute objects
topic Technical Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8009203/
https://www.ncbi.nlm.nih.gov/pubmed/33784373
http://dx.doi.org/10.1093/database/baab008
work_keys_str_mv AT pateljanishaa bioinformaticstoolsdevelopedtosupportbiocomputeobjects
AT deandennisa bioinformaticstoolsdevelopedtosupportbiocomputeobjects
AT kingcharleshadley bioinformaticstoolsdevelopedtosupportbiocomputeobjects
AT xiaonan bioinformaticstoolsdevelopedtosupportbiocomputeobjects
AT kocsoner bioinformaticstoolsdevelopedtosupportbiocomputeobjects
AT mininaekaterina bioinformaticstoolsdevelopedtosupportbiocomputeobjects
AT golikovanton bioinformaticstoolsdevelopedtosupportbiocomputeobjects
AT brooksphillip bioinformaticstoolsdevelopedtosupportbiocomputeobjects
AT kahsayrobel bioinformaticstoolsdevelopedtosupportbiocomputeobjects
AT navelkarrahi bioinformaticstoolsdevelopedtosupportbiocomputeobjects
AT raymanisha bioinformaticstoolsdevelopedtosupportbiocomputeobjects
AT robersondave bioinformaticstoolsdevelopedtosupportbiocomputeobjects
AT armstrongchris bioinformaticstoolsdevelopedtosupportbiocomputeobjects
AT mazumderraja bioinformaticstoolsdevelopedtosupportbiocomputeobjects
AT keeneyjonathon bioinformaticstoolsdevelopedtosupportbiocomputeobjects