Cargando…
A computational framework for complex disease stratification from multiple large-scale datasets
BACKGROUND: Multilevel data integration is becoming a major area of research in systems biology. Within this area, multi-‘omics datasets on complex diseases are becoming more readily available and there is a need to set standards and good practices for integrated analysis of biological, clinical and...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5975674/ https://www.ncbi.nlm.nih.gov/pubmed/29843806 http://dx.doi.org/10.1186/s12918-018-0556-z |
_version_ | 1783327031585406976 |
---|---|
author | De Meulder, Bertrand Lefaudeux, Diane Bansal, Aruna T. Mazein, Alexander Chaiboonchoe, Amphun Ahmed, Hassan Balaur, Irina Saqi, Mansoor Pellet, Johann Ballereau, Stéphane Lemonnier, Nathanaël Sun, Kai Pandis, Ioannis Yang, Xian Batuwitage, Manohara Kretsos, Kosmas van Eyll, Jonathan Bedding, Alun Davison, Timothy Dodson, Paul Larminie, Christopher Postle, Anthony Corfield, Julie Djukanovic, Ratko Chung, Kian Fan Adcock, Ian M. Guo, Yi-Ke Sterk, Peter J. Manta, Alexander Rowe, Anthony Baribaud, Frédéric Auffray, Charles |
author_facet | De Meulder, Bertrand Lefaudeux, Diane Bansal, Aruna T. Mazein, Alexander Chaiboonchoe, Amphun Ahmed, Hassan Balaur, Irina Saqi, Mansoor Pellet, Johann Ballereau, Stéphane Lemonnier, Nathanaël Sun, Kai Pandis, Ioannis Yang, Xian Batuwitage, Manohara Kretsos, Kosmas van Eyll, Jonathan Bedding, Alun Davison, Timothy Dodson, Paul Larminie, Christopher Postle, Anthony Corfield, Julie Djukanovic, Ratko Chung, Kian Fan Adcock, Ian M. Guo, Yi-Ke Sterk, Peter J. Manta, Alexander Rowe, Anthony Baribaud, Frédéric Auffray, Charles |
author_sort | De Meulder, Bertrand |
collection | PubMed |
description | BACKGROUND: Multilevel data integration is becoming a major area of research in systems biology. Within this area, multi-‘omics datasets on complex diseases are becoming more readily available and there is a need to set standards and good practices for integrated analysis of biological, clinical and environmental data. We present a framework to plan and generate single and multi-‘omics signatures of disease states. METHODS: The framework is divided into four major steps: dataset subsetting, feature filtering, ‘omics-based clustering and biomarker identification. RESULTS: We illustrate the usefulness of this framework by identifying potential patient clusters based on integrated multi-‘omics signatures in a publicly available ovarian cystadenocarcinoma dataset. The analysis generated a higher number of stable and clinically relevant clusters than previously reported, and enabled the generation of predictive models of patient outcomes. CONCLUSIONS: This framework will help health researchers plan and perform multi-‘omics big data analyses to generate hypotheses and make sense of their rich, diverse and ever growing datasets, to enable implementation of translational P4 medicine. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12918-018-0556-z) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5975674 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-59756742018-05-31 A computational framework for complex disease stratification from multiple large-scale datasets De Meulder, Bertrand Lefaudeux, Diane Bansal, Aruna T. Mazein, Alexander Chaiboonchoe, Amphun Ahmed, Hassan Balaur, Irina Saqi, Mansoor Pellet, Johann Ballereau, Stéphane Lemonnier, Nathanaël Sun, Kai Pandis, Ioannis Yang, Xian Batuwitage, Manohara Kretsos, Kosmas van Eyll, Jonathan Bedding, Alun Davison, Timothy Dodson, Paul Larminie, Christopher Postle, Anthony Corfield, Julie Djukanovic, Ratko Chung, Kian Fan Adcock, Ian M. Guo, Yi-Ke Sterk, Peter J. Manta, Alexander Rowe, Anthony Baribaud, Frédéric Auffray, Charles BMC Syst Biol Research Article BACKGROUND: Multilevel data integration is becoming a major area of research in systems biology. Within this area, multi-‘omics datasets on complex diseases are becoming more readily available and there is a need to set standards and good practices for integrated analysis of biological, clinical and environmental data. We present a framework to plan and generate single and multi-‘omics signatures of disease states. METHODS: The framework is divided into four major steps: dataset subsetting, feature filtering, ‘omics-based clustering and biomarker identification. RESULTS: We illustrate the usefulness of this framework by identifying potential patient clusters based on integrated multi-‘omics signatures in a publicly available ovarian cystadenocarcinoma dataset. The analysis generated a higher number of stable and clinically relevant clusters than previously reported, and enabled the generation of predictive models of patient outcomes. CONCLUSIONS: This framework will help health researchers plan and perform multi-‘omics big data analyses to generate hypotheses and make sense of their rich, diverse and ever growing datasets, to enable implementation of translational P4 medicine. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12918-018-0556-z) contains supplementary material, which is available to authorized users. BioMed Central 2018-05-29 /pmc/articles/PMC5975674/ /pubmed/29843806 http://dx.doi.org/10.1186/s12918-018-0556-z Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article De Meulder, Bertrand Lefaudeux, Diane Bansal, Aruna T. Mazein, Alexander Chaiboonchoe, Amphun Ahmed, Hassan Balaur, Irina Saqi, Mansoor Pellet, Johann Ballereau, Stéphane Lemonnier, Nathanaël Sun, Kai Pandis, Ioannis Yang, Xian Batuwitage, Manohara Kretsos, Kosmas van Eyll, Jonathan Bedding, Alun Davison, Timothy Dodson, Paul Larminie, Christopher Postle, Anthony Corfield, Julie Djukanovic, Ratko Chung, Kian Fan Adcock, Ian M. Guo, Yi-Ke Sterk, Peter J. Manta, Alexander Rowe, Anthony Baribaud, Frédéric Auffray, Charles A computational framework for complex disease stratification from multiple large-scale datasets |
title | A computational framework for complex disease stratification from multiple large-scale datasets |
title_full | A computational framework for complex disease stratification from multiple large-scale datasets |
title_fullStr | A computational framework for complex disease stratification from multiple large-scale datasets |
title_full_unstemmed | A computational framework for complex disease stratification from multiple large-scale datasets |
title_short | A computational framework for complex disease stratification from multiple large-scale datasets |
title_sort | computational framework for complex disease stratification from multiple large-scale datasets |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5975674/ https://www.ncbi.nlm.nih.gov/pubmed/29843806 http://dx.doi.org/10.1186/s12918-018-0556-z |
work_keys_str_mv | AT demeulderbertrand acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT lefaudeuxdiane acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT bansalarunat acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT mazeinalexander acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT chaiboonchoeamphun acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT ahmedhassan acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT balauririna acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT saqimansoor acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT pelletjohann acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT ballereaustephane acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT lemonniernathanael acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT sunkai acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT pandisioannis acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT yangxian acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT batuwitagemanohara acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT kretsoskosmas acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT vaneylljonathan acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT beddingalun acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT davisontimothy acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT dodsonpaul acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT larminiechristopher acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT postleanthony acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT corfieldjulie acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT djukanovicratko acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT chungkianfan acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT adcockianm acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT guoyike acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT sterkpeterj acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT mantaalexander acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT roweanthony acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT baribaudfrederic acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT auffraycharles acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT acomputationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT demeulderbertrand computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT lefaudeuxdiane computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT bansalarunat computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT mazeinalexander computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT chaiboonchoeamphun computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT ahmedhassan computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT balauririna computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT saqimansoor computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT pelletjohann computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT ballereaustephane computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT lemonniernathanael computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT sunkai computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT pandisioannis computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT yangxian computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT batuwitagemanohara computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT kretsoskosmas computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT vaneylljonathan computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT beddingalun computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT davisontimothy computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT dodsonpaul computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT larminiechristopher computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT postleanthony computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT corfieldjulie computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT djukanovicratko computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT chungkianfan computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT adcockianm computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT guoyike computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT sterkpeterj computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT mantaalexander computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT roweanthony computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT baribaudfrederic computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT auffraycharles computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets AT computationalframeworkforcomplexdiseasestratificationfrommultiplelargescaledatasets |