Cargando…

Orchestrating and sharing large multimodal data for transparent and reproducible research

Reproducibility is essential to open science, as there is limited relevance for findings that can not be reproduced by independent research groups, regardless of its validity. It is therefore crucial for scientists to describe their experiments in sufficient detail so they can be reproduced, scrutin...

Descripción completa

Detalles Bibliográficos
Autores principales: Mammoliti, Anthony, Smirnov, Petr, Nakano, Minoru, Safikhani, Zhaleh, Eeles, Christopher, Seo, Heewon, Nair, Sisira Kadambat, Mer, Arvind S., Smith, Ian, Ho, Chantal, Beri, Gangesh, Kusko, Rebecca, Lin, Eva, Yu, Yihong, Martin, Scott, Hafner, Marc, Haibe-Kains, Benjamin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8490371/
https://www.ncbi.nlm.nih.gov/pubmed/34608132
http://dx.doi.org/10.1038/s41467-021-25974-w
_version_ 1784578508312281088
author Mammoliti, Anthony
Smirnov, Petr
Nakano, Minoru
Safikhani, Zhaleh
Eeles, Christopher
Seo, Heewon
Nair, Sisira Kadambat
Mer, Arvind S.
Smith, Ian
Ho, Chantal
Beri, Gangesh
Kusko, Rebecca
Lin, Eva
Yu, Yihong
Martin, Scott
Hafner, Marc
Haibe-Kains, Benjamin
author_facet Mammoliti, Anthony
Smirnov, Petr
Nakano, Minoru
Safikhani, Zhaleh
Eeles, Christopher
Seo, Heewon
Nair, Sisira Kadambat
Mer, Arvind S.
Smith, Ian
Ho, Chantal
Beri, Gangesh
Kusko, Rebecca
Lin, Eva
Yu, Yihong
Martin, Scott
Hafner, Marc
Haibe-Kains, Benjamin
author_sort Mammoliti, Anthony
collection PubMed
description Reproducibility is essential to open science, as there is limited relevance for findings that can not be reproduced by independent research groups, regardless of its validity. It is therefore crucial for scientists to describe their experiments in sufficient detail so they can be reproduced, scrutinized, challenged, and built upon. However, the intrinsic complexity and continuous growth of biomedical data makes it increasingly difficult to process, analyze, and share with the community in a FAIR (findable, accessible, interoperable, and reusable) manner. To overcome these issues, we created a cloud-based platform called ORCESTRA (orcestra.ca), which provides a flexible framework for the reproducible processing of multimodal biomedical data. It enables processing of clinical, genomic and perturbation profiles of cancer samples through automated processing pipelines that are user-customizable. ORCESTRA creates integrated and fully documented data objects with persistent identifiers (DOI) and manages multiple dataset versions, which can be shared for future studies.
format Online
Article
Text
id pubmed-8490371
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-84903712021-10-07 Orchestrating and sharing large multimodal data for transparent and reproducible research Mammoliti, Anthony Smirnov, Petr Nakano, Minoru Safikhani, Zhaleh Eeles, Christopher Seo, Heewon Nair, Sisira Kadambat Mer, Arvind S. Smith, Ian Ho, Chantal Beri, Gangesh Kusko, Rebecca Lin, Eva Yu, Yihong Martin, Scott Hafner, Marc Haibe-Kains, Benjamin Nat Commun Article Reproducibility is essential to open science, as there is limited relevance for findings that can not be reproduced by independent research groups, regardless of its validity. It is therefore crucial for scientists to describe their experiments in sufficient detail so they can be reproduced, scrutinized, challenged, and built upon. However, the intrinsic complexity and continuous growth of biomedical data makes it increasingly difficult to process, analyze, and share with the community in a FAIR (findable, accessible, interoperable, and reusable) manner. To overcome these issues, we created a cloud-based platform called ORCESTRA (orcestra.ca), which provides a flexible framework for the reproducible processing of multimodal biomedical data. It enables processing of clinical, genomic and perturbation profiles of cancer samples through automated processing pipelines that are user-customizable. ORCESTRA creates integrated and fully documented data objects with persistent identifiers (DOI) and manages multiple dataset versions, which can be shared for future studies. Nature Publishing Group UK 2021-10-04 /pmc/articles/PMC8490371/ /pubmed/34608132 http://dx.doi.org/10.1038/s41467-021-25974-w Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Mammoliti, Anthony
Smirnov, Petr
Nakano, Minoru
Safikhani, Zhaleh
Eeles, Christopher
Seo, Heewon
Nair, Sisira Kadambat
Mer, Arvind S.
Smith, Ian
Ho, Chantal
Beri, Gangesh
Kusko, Rebecca
Lin, Eva
Yu, Yihong
Martin, Scott
Hafner, Marc
Haibe-Kains, Benjamin
Orchestrating and sharing large multimodal data for transparent and reproducible research
title Orchestrating and sharing large multimodal data for transparent and reproducible research
title_full Orchestrating and sharing large multimodal data for transparent and reproducible research
title_fullStr Orchestrating and sharing large multimodal data for transparent and reproducible research
title_full_unstemmed Orchestrating and sharing large multimodal data for transparent and reproducible research
title_short Orchestrating and sharing large multimodal data for transparent and reproducible research
title_sort orchestrating and sharing large multimodal data for transparent and reproducible research
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8490371/
https://www.ncbi.nlm.nih.gov/pubmed/34608132
http://dx.doi.org/10.1038/s41467-021-25974-w
work_keys_str_mv AT mammolitianthony orchestratingandsharinglargemultimodaldatafortransparentandreproducibleresearch
AT smirnovpetr orchestratingandsharinglargemultimodaldatafortransparentandreproducibleresearch
AT nakanominoru orchestratingandsharinglargemultimodaldatafortransparentandreproducibleresearch
AT safikhanizhaleh orchestratingandsharinglargemultimodaldatafortransparentandreproducibleresearch
AT eeleschristopher orchestratingandsharinglargemultimodaldatafortransparentandreproducibleresearch
AT seoheewon orchestratingandsharinglargemultimodaldatafortransparentandreproducibleresearch
AT nairsisirakadambat orchestratingandsharinglargemultimodaldatafortransparentandreproducibleresearch
AT merarvinds orchestratingandsharinglargemultimodaldatafortransparentandreproducibleresearch
AT smithian orchestratingandsharinglargemultimodaldatafortransparentandreproducibleresearch
AT hochantal orchestratingandsharinglargemultimodaldatafortransparentandreproducibleresearch
AT berigangesh orchestratingandsharinglargemultimodaldatafortransparentandreproducibleresearch
AT kuskorebecca orchestratingandsharinglargemultimodaldatafortransparentandreproducibleresearch
AT orchestratingandsharinglargemultimodaldatafortransparentandreproducibleresearch
AT lineva orchestratingandsharinglargemultimodaldatafortransparentandreproducibleresearch
AT yuyihong orchestratingandsharinglargemultimodaldatafortransparentandreproducibleresearch
AT martinscott orchestratingandsharinglargemultimodaldatafortransparentandreproducibleresearch
AT hafnermarc orchestratingandsharinglargemultimodaldatafortransparentandreproducibleresearch
AT haibekainsbenjamin orchestratingandsharinglargemultimodaldatafortransparentandreproducibleresearch