Cargando…

A computational framework for complex disease stratification from multiple large-scale datasets

BACKGROUND: Multilevel data integration is becoming a major area of research in systems biology. Within this area, multi-‘omics datasets on complex diseases are becoming more readily available and there is a need to set standards and good practices for integrated analysis of biological, clinical and...

Descripción completa

Detalles Bibliográficos
Autores principales: De Meulder, Bertrand, Lefaudeux, Diane, Bansal, Aruna T., Mazein, Alexander, Chaiboonchoe, Amphun, Ahmed, Hassan, Balaur, Irina, Saqi, Mansoor, Pellet, Johann, Ballereau, Stéphane, Lemonnier, Nathanaël, Sun, Kai, Pandis, Ioannis, Yang, Xian, Batuwitage, Manohara, Kretsos, Kosmas, van Eyll, Jonathan, Bedding, Alun, Davison, Timothy, Dodson, Paul, Larminie, Christopher, Postle, Anthony, Corfield, Julie, Djukanovic, Ratko, Chung, Kian Fan, Adcock, Ian M., Guo, Yi-Ke, Sterk, Peter J., Manta, Alexander, Rowe, Anthony, Baribaud, Frédéric, Auffray, Charles
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5975674/
https://www.ncbi.nlm.nih.gov/pubmed/29843806
http://dx.doi.org/10.1186/s12918-018-0556-z
Descripción
Sumario:BACKGROUND: Multilevel data integration is becoming a major area of research in systems biology. Within this area, multi-‘omics datasets on complex diseases are becoming more readily available and there is a need to set standards and good practices for integrated analysis of biological, clinical and environmental data. We present a framework to plan and generate single and multi-‘omics signatures of disease states. METHODS: The framework is divided into four major steps: dataset subsetting, feature filtering, ‘omics-based clustering and biomarker identification. RESULTS: We illustrate the usefulness of this framework by identifying potential patient clusters based on integrated multi-‘omics signatures in a publicly available ovarian cystadenocarcinoma dataset. The analysis generated a higher number of stable and clinically relevant clusters than previously reported, and enabled the generation of predictive models of patient outcomes. CONCLUSIONS: This framework will help health researchers plan and perform multi-‘omics big data analyses to generate hypotheses and make sense of their rich, diverse and ever growing datasets, to enable implementation of translational P4 medicine. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12918-018-0556-z) contains supplementary material, which is available to authorized users.