Cargando…

Integration of “omics” Data and Phenotypic Data Within a Unified Extensible Multimodal Framework

Analysis of “omics” data is often a long and segmented process, encompassing multiple stages from initial data collection to processing, quality control and visualization. The cross-modal nature of recent genomic analyses renders this process challenging to both automate and standardize; consequentl...

Descripción completa

Detalles Bibliográficos
Autores principales: Das, Samir, Lecours Boucher, Xavier, Rogers, Christine, Makowski, Carolina, Chouinard-Decorte, François, Oros Klein, Kathleen, Beck, Natacha, Rioux, Pierre, Brown, Shawn T., Mohaddes, Zia, Zweber, Cole, Foing, Victoria, Forest, Marie, O’Donnell, Kieran J., Clark, Joanne, Meaney, Michael J., Greenwood, Celia M. T., Evans, Alan C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6315165/
https://www.ncbi.nlm.nih.gov/pubmed/30631270
http://dx.doi.org/10.3389/fninf.2018.00091
_version_ 1783384228678860800
author Das, Samir
Lecours Boucher, Xavier
Rogers, Christine
Makowski, Carolina
Chouinard-Decorte, François
Oros Klein, Kathleen
Beck, Natacha
Rioux, Pierre
Brown, Shawn T.
Mohaddes, Zia
Zweber, Cole
Foing, Victoria
Forest, Marie
O’Donnell, Kieran J.
Clark, Joanne
Meaney, Michael J.
Greenwood, Celia M. T.
Evans, Alan C.
author_facet Das, Samir
Lecours Boucher, Xavier
Rogers, Christine
Makowski, Carolina
Chouinard-Decorte, François
Oros Klein, Kathleen
Beck, Natacha
Rioux, Pierre
Brown, Shawn T.
Mohaddes, Zia
Zweber, Cole
Foing, Victoria
Forest, Marie
O’Donnell, Kieran J.
Clark, Joanne
Meaney, Michael J.
Greenwood, Celia M. T.
Evans, Alan C.
author_sort Das, Samir
collection PubMed
description Analysis of “omics” data is often a long and segmented process, encompassing multiple stages from initial data collection to processing, quality control and visualization. The cross-modal nature of recent genomic analyses renders this process challenging to both automate and standardize; consequently, users often resort to manual interventions that compromise data reliability and reproducibility. This in turn can produce multiple versions of datasets across storage systems. As a result, scientists can lose significant time and resources trying to execute and monitor their analytical workflows and encounter difficulties sharing versioned data. In 2015, the Ludmer Centre for Neuroinformatics and Mental Health at McGill University brought together expertise from the Douglas Mental Health University Institute, the Lady Davis Institute and the Montreal Neurological Institute (MNI) to form a genetics/epigenetics working group. The objectives of this working group are to: (i) design an automated and seamless process for (epi)genetic data that consolidates heterogeneous datasets into the LORIS open-source data platform; (ii) streamline data analysis; (iii) integrate results with provenance information; and (iv) facilitate structured and versioned sharing of pipelines for optimized reproducibility using high-performance computing (HPC) environments via the CBRAIN processing portal. This article outlines the resulting generalizable “omics” framework and its benefits, specifically, the ability to: (i) integrate multiple types of biological and multi-modal datasets (imaging, clinical, demographics and behavioral); (ii) automate the process of launching analysis pipelines on HPC platforms; (iii) remove the bioinformatic barriers that are inherent to this process; (iv) ensure standardization and transparent sharing of processing pipelines to improve computational consistency; (v) store results in a queryable web interface; (vi) offer visualization tools to better view the data; and (vii) provide the mechanisms to ensure usability and reproducibility. This framework for workflows facilitates brain research discovery by reducing human error through automation of analysis pipelines and seamless linking of multimodal data, allowing investigators to focus on research instead of data handling.
format Online
Article
Text
id pubmed-6315165
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-63151652019-01-10 Integration of “omics” Data and Phenotypic Data Within a Unified Extensible Multimodal Framework Das, Samir Lecours Boucher, Xavier Rogers, Christine Makowski, Carolina Chouinard-Decorte, François Oros Klein, Kathleen Beck, Natacha Rioux, Pierre Brown, Shawn T. Mohaddes, Zia Zweber, Cole Foing, Victoria Forest, Marie O’Donnell, Kieran J. Clark, Joanne Meaney, Michael J. Greenwood, Celia M. T. Evans, Alan C. Front Neuroinform Neuroscience Analysis of “omics” data is often a long and segmented process, encompassing multiple stages from initial data collection to processing, quality control and visualization. The cross-modal nature of recent genomic analyses renders this process challenging to both automate and standardize; consequently, users often resort to manual interventions that compromise data reliability and reproducibility. This in turn can produce multiple versions of datasets across storage systems. As a result, scientists can lose significant time and resources trying to execute and monitor their analytical workflows and encounter difficulties sharing versioned data. In 2015, the Ludmer Centre for Neuroinformatics and Mental Health at McGill University brought together expertise from the Douglas Mental Health University Institute, the Lady Davis Institute and the Montreal Neurological Institute (MNI) to form a genetics/epigenetics working group. The objectives of this working group are to: (i) design an automated and seamless process for (epi)genetic data that consolidates heterogeneous datasets into the LORIS open-source data platform; (ii) streamline data analysis; (iii) integrate results with provenance information; and (iv) facilitate structured and versioned sharing of pipelines for optimized reproducibility using high-performance computing (HPC) environments via the CBRAIN processing portal. This article outlines the resulting generalizable “omics” framework and its benefits, specifically, the ability to: (i) integrate multiple types of biological and multi-modal datasets (imaging, clinical, demographics and behavioral); (ii) automate the process of launching analysis pipelines on HPC platforms; (iii) remove the bioinformatic barriers that are inherent to this process; (iv) ensure standardization and transparent sharing of processing pipelines to improve computational consistency; (v) store results in a queryable web interface; (vi) offer visualization tools to better view the data; and (vii) provide the mechanisms to ensure usability and reproducibility. This framework for workflows facilitates brain research discovery by reducing human error through automation of analysis pipelines and seamless linking of multimodal data, allowing investigators to focus on research instead of data handling. Frontiers Media S.A. 2018-12-18 /pmc/articles/PMC6315165/ /pubmed/30631270 http://dx.doi.org/10.3389/fninf.2018.00091 Text en Copyright © 2018 Das, Lecours Boucher, Rogers, Makowski, Chouinard-Decorte, Oros Klein, Beck, Rioux, Brown, Mohaddes, Zweber, Foing, Forest, O’Donnell, Clark, Meaney, Greenwood and Evans. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Das, Samir
Lecours Boucher, Xavier
Rogers, Christine
Makowski, Carolina
Chouinard-Decorte, François
Oros Klein, Kathleen
Beck, Natacha
Rioux, Pierre
Brown, Shawn T.
Mohaddes, Zia
Zweber, Cole
Foing, Victoria
Forest, Marie
O’Donnell, Kieran J.
Clark, Joanne
Meaney, Michael J.
Greenwood, Celia M. T.
Evans, Alan C.
Integration of “omics” Data and Phenotypic Data Within a Unified Extensible Multimodal Framework
title Integration of “omics” Data and Phenotypic Data Within a Unified Extensible Multimodal Framework
title_full Integration of “omics” Data and Phenotypic Data Within a Unified Extensible Multimodal Framework
title_fullStr Integration of “omics” Data and Phenotypic Data Within a Unified Extensible Multimodal Framework
title_full_unstemmed Integration of “omics” Data and Phenotypic Data Within a Unified Extensible Multimodal Framework
title_short Integration of “omics” Data and Phenotypic Data Within a Unified Extensible Multimodal Framework
title_sort integration of “omics” data and phenotypic data within a unified extensible multimodal framework
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6315165/
https://www.ncbi.nlm.nih.gov/pubmed/30631270
http://dx.doi.org/10.3389/fninf.2018.00091
work_keys_str_mv AT dassamir integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework
AT lecoursboucherxavier integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework
AT rogerschristine integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework
AT makowskicarolina integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework
AT chouinarddecortefrancois integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework
AT oroskleinkathleen integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework
AT becknatacha integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework
AT riouxpierre integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework
AT brownshawnt integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework
AT mohaddeszia integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework
AT zwebercole integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework
AT foingvictoria integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework
AT forestmarie integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework
AT odonnellkieranj integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework
AT clarkjoanne integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework
AT meaneymichaelj integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework
AT greenwoodceliamt integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework
AT evansalanc integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework