Cargando…
Integration of “omics” Data and Phenotypic Data Within a Unified Extensible Multimodal Framework
Analysis of “omics” data is often a long and segmented process, encompassing multiple stages from initial data collection to processing, quality control and visualization. The cross-modal nature of recent genomic analyses renders this process challenging to both automate and standardize; consequentl...
Autores principales: | , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6315165/ https://www.ncbi.nlm.nih.gov/pubmed/30631270 http://dx.doi.org/10.3389/fninf.2018.00091 |
_version_ | 1783384228678860800 |
---|---|
author | Das, Samir Lecours Boucher, Xavier Rogers, Christine Makowski, Carolina Chouinard-Decorte, François Oros Klein, Kathleen Beck, Natacha Rioux, Pierre Brown, Shawn T. Mohaddes, Zia Zweber, Cole Foing, Victoria Forest, Marie O’Donnell, Kieran J. Clark, Joanne Meaney, Michael J. Greenwood, Celia M. T. Evans, Alan C. |
author_facet | Das, Samir Lecours Boucher, Xavier Rogers, Christine Makowski, Carolina Chouinard-Decorte, François Oros Klein, Kathleen Beck, Natacha Rioux, Pierre Brown, Shawn T. Mohaddes, Zia Zweber, Cole Foing, Victoria Forest, Marie O’Donnell, Kieran J. Clark, Joanne Meaney, Michael J. Greenwood, Celia M. T. Evans, Alan C. |
author_sort | Das, Samir |
collection | PubMed |
description | Analysis of “omics” data is often a long and segmented process, encompassing multiple stages from initial data collection to processing, quality control and visualization. The cross-modal nature of recent genomic analyses renders this process challenging to both automate and standardize; consequently, users often resort to manual interventions that compromise data reliability and reproducibility. This in turn can produce multiple versions of datasets across storage systems. As a result, scientists can lose significant time and resources trying to execute and monitor their analytical workflows and encounter difficulties sharing versioned data. In 2015, the Ludmer Centre for Neuroinformatics and Mental Health at McGill University brought together expertise from the Douglas Mental Health University Institute, the Lady Davis Institute and the Montreal Neurological Institute (MNI) to form a genetics/epigenetics working group. The objectives of this working group are to: (i) design an automated and seamless process for (epi)genetic data that consolidates heterogeneous datasets into the LORIS open-source data platform; (ii) streamline data analysis; (iii) integrate results with provenance information; and (iv) facilitate structured and versioned sharing of pipelines for optimized reproducibility using high-performance computing (HPC) environments via the CBRAIN processing portal. This article outlines the resulting generalizable “omics” framework and its benefits, specifically, the ability to: (i) integrate multiple types of biological and multi-modal datasets (imaging, clinical, demographics and behavioral); (ii) automate the process of launching analysis pipelines on HPC platforms; (iii) remove the bioinformatic barriers that are inherent to this process; (iv) ensure standardization and transparent sharing of processing pipelines to improve computational consistency; (v) store results in a queryable web interface; (vi) offer visualization tools to better view the data; and (vii) provide the mechanisms to ensure usability and reproducibility. This framework for workflows facilitates brain research discovery by reducing human error through automation of analysis pipelines and seamless linking of multimodal data, allowing investigators to focus on research instead of data handling. |
format | Online Article Text |
id | pubmed-6315165 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-63151652019-01-10 Integration of “omics” Data and Phenotypic Data Within a Unified Extensible Multimodal Framework Das, Samir Lecours Boucher, Xavier Rogers, Christine Makowski, Carolina Chouinard-Decorte, François Oros Klein, Kathleen Beck, Natacha Rioux, Pierre Brown, Shawn T. Mohaddes, Zia Zweber, Cole Foing, Victoria Forest, Marie O’Donnell, Kieran J. Clark, Joanne Meaney, Michael J. Greenwood, Celia M. T. Evans, Alan C. Front Neuroinform Neuroscience Analysis of “omics” data is often a long and segmented process, encompassing multiple stages from initial data collection to processing, quality control and visualization. The cross-modal nature of recent genomic analyses renders this process challenging to both automate and standardize; consequently, users often resort to manual interventions that compromise data reliability and reproducibility. This in turn can produce multiple versions of datasets across storage systems. As a result, scientists can lose significant time and resources trying to execute and monitor their analytical workflows and encounter difficulties sharing versioned data. In 2015, the Ludmer Centre for Neuroinformatics and Mental Health at McGill University brought together expertise from the Douglas Mental Health University Institute, the Lady Davis Institute and the Montreal Neurological Institute (MNI) to form a genetics/epigenetics working group. The objectives of this working group are to: (i) design an automated and seamless process for (epi)genetic data that consolidates heterogeneous datasets into the LORIS open-source data platform; (ii) streamline data analysis; (iii) integrate results with provenance information; and (iv) facilitate structured and versioned sharing of pipelines for optimized reproducibility using high-performance computing (HPC) environments via the CBRAIN processing portal. This article outlines the resulting generalizable “omics” framework and its benefits, specifically, the ability to: (i) integrate multiple types of biological and multi-modal datasets (imaging, clinical, demographics and behavioral); (ii) automate the process of launching analysis pipelines on HPC platforms; (iii) remove the bioinformatic barriers that are inherent to this process; (iv) ensure standardization and transparent sharing of processing pipelines to improve computational consistency; (v) store results in a queryable web interface; (vi) offer visualization tools to better view the data; and (vii) provide the mechanisms to ensure usability and reproducibility. This framework for workflows facilitates brain research discovery by reducing human error through automation of analysis pipelines and seamless linking of multimodal data, allowing investigators to focus on research instead of data handling. Frontiers Media S.A. 2018-12-18 /pmc/articles/PMC6315165/ /pubmed/30631270 http://dx.doi.org/10.3389/fninf.2018.00091 Text en Copyright © 2018 Das, Lecours Boucher, Rogers, Makowski, Chouinard-Decorte, Oros Klein, Beck, Rioux, Brown, Mohaddes, Zweber, Foing, Forest, O’Donnell, Clark, Meaney, Greenwood and Evans. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Neuroscience Das, Samir Lecours Boucher, Xavier Rogers, Christine Makowski, Carolina Chouinard-Decorte, François Oros Klein, Kathleen Beck, Natacha Rioux, Pierre Brown, Shawn T. Mohaddes, Zia Zweber, Cole Foing, Victoria Forest, Marie O’Donnell, Kieran J. Clark, Joanne Meaney, Michael J. Greenwood, Celia M. T. Evans, Alan C. Integration of “omics” Data and Phenotypic Data Within a Unified Extensible Multimodal Framework |
title | Integration of “omics” Data and Phenotypic Data Within a Unified Extensible Multimodal Framework |
title_full | Integration of “omics” Data and Phenotypic Data Within a Unified Extensible Multimodal Framework |
title_fullStr | Integration of “omics” Data and Phenotypic Data Within a Unified Extensible Multimodal Framework |
title_full_unstemmed | Integration of “omics” Data and Phenotypic Data Within a Unified Extensible Multimodal Framework |
title_short | Integration of “omics” Data and Phenotypic Data Within a Unified Extensible Multimodal Framework |
title_sort | integration of “omics” data and phenotypic data within a unified extensible multimodal framework |
topic | Neuroscience |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6315165/ https://www.ncbi.nlm.nih.gov/pubmed/30631270 http://dx.doi.org/10.3389/fninf.2018.00091 |
work_keys_str_mv | AT dassamir integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework AT lecoursboucherxavier integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework AT rogerschristine integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework AT makowskicarolina integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework AT chouinarddecortefrancois integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework AT oroskleinkathleen integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework AT becknatacha integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework AT riouxpierre integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework AT brownshawnt integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework AT mohaddeszia integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework AT zwebercole integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework AT foingvictoria integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework AT forestmarie integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework AT odonnellkieranj integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework AT clarkjoanne integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework AT meaneymichaelj integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework AT greenwoodceliamt integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework AT evansalanc integrationofomicsdataandphenotypicdatawithinaunifiedextensiblemultimodalframework |