Cargando…

Evaluation of O2PLS in Omics data integration

BACKGROUND: Rapid computational and technological developments made large amounts of omics data available in different biological levels. It is becoming clear that simultaneous data analysis methods are needed for better interpretation and understanding of the underlying systems biology. Different m...

Descripción completa

Detalles Bibliográficos
Autores principales: Bouhaddani, Said el, Houwing-Duistermaat, Jeanine, Salo, Perttu, Perola, Markus, Jongbloed, Geurt, Uh, Hae-Won
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4959391/
https://www.ncbi.nlm.nih.gov/pubmed/26822911
http://dx.doi.org/10.1186/s12859-015-0854-z
_version_ 1782444397554565120
author Bouhaddani, Said el
Houwing-Duistermaat, Jeanine
Salo, Perttu
Perola, Markus
Jongbloed, Geurt
Uh, Hae-Won
author_facet Bouhaddani, Said el
Houwing-Duistermaat, Jeanine
Salo, Perttu
Perola, Markus
Jongbloed, Geurt
Uh, Hae-Won
author_sort Bouhaddani, Said el
collection PubMed
description BACKGROUND: Rapid computational and technological developments made large amounts of omics data available in different biological levels. It is becoming clear that simultaneous data analysis methods are needed for better interpretation and understanding of the underlying systems biology. Different methods have been proposed for this task, among them Partial Least Squares (PLS) related methods. To also deal with orthogonal variation, systematic variation in the data unrelated to one another, we consider the Two-way Orthogonal PLS (O2PLS): an integrative data analysis method which is capable of modeling systematic variation, while providing more parsimonious models aiding interpretation. RESULTS: A simulation study to assess the performance of O2PLS showed positive results in both low and higher dimensions. More noise (50 % of the data) only affected the systematic part estimates. A data analysis was conducted using data on metabolomics and transcriptomics from a large Finnish cohort (DILGOM). A previous sequential study, using the same data, showed significant correlations between the Lipo-Leukocyte (LL) module and lipoprotein metabolites. The O2PLS results were in agreement with these findings, identifying almost the same set of co-varying variables. Moreover, our integrative approach identified other associative genes and metabolites, while taking into account systematic variation in the data. Including orthogonal components enhanced overall fit, but the orthogonal variation was difficult to interpret. CONCLUSIONS: Simulations showed that the O2PLS estimates were close to the true parameters in both low and higher dimensions. In the presence of more noise (50 %), the orthogonal part estimates could not distinguish well between joint and unique variation. The joint estimates were not systematically affected. Simultaneous analysis with O2PLS on metabolome and transcriptome data showed that the LL module, together with VLDL and HDL metabolites, were important for the metabolomic and transcriptomic relation. This is in agreement with an earlier study. In addition more gene expression and metabolites are identified being important for the joint covariation.
format Online
Article
Text
id pubmed-4959391
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-49593912016-08-01 Evaluation of O2PLS in Omics data integration Bouhaddani, Said el Houwing-Duistermaat, Jeanine Salo, Perttu Perola, Markus Jongbloed, Geurt Uh, Hae-Won BMC Bioinformatics Proceedings BACKGROUND: Rapid computational and technological developments made large amounts of omics data available in different biological levels. It is becoming clear that simultaneous data analysis methods are needed for better interpretation and understanding of the underlying systems biology. Different methods have been proposed for this task, among them Partial Least Squares (PLS) related methods. To also deal with orthogonal variation, systematic variation in the data unrelated to one another, we consider the Two-way Orthogonal PLS (O2PLS): an integrative data analysis method which is capable of modeling systematic variation, while providing more parsimonious models aiding interpretation. RESULTS: A simulation study to assess the performance of O2PLS showed positive results in both low and higher dimensions. More noise (50 % of the data) only affected the systematic part estimates. A data analysis was conducted using data on metabolomics and transcriptomics from a large Finnish cohort (DILGOM). A previous sequential study, using the same data, showed significant correlations between the Lipo-Leukocyte (LL) module and lipoprotein metabolites. The O2PLS results were in agreement with these findings, identifying almost the same set of co-varying variables. Moreover, our integrative approach identified other associative genes and metabolites, while taking into account systematic variation in the data. Including orthogonal components enhanced overall fit, but the orthogonal variation was difficult to interpret. CONCLUSIONS: Simulations showed that the O2PLS estimates were close to the true parameters in both low and higher dimensions. In the presence of more noise (50 %), the orthogonal part estimates could not distinguish well between joint and unique variation. The joint estimates were not systematically affected. Simultaneous analysis with O2PLS on metabolome and transcriptome data showed that the LL module, together with VLDL and HDL metabolites, were important for the metabolomic and transcriptomic relation. This is in agreement with an earlier study. In addition more gene expression and metabolites are identified being important for the joint covariation. BioMed Central 2016-01-20 /pmc/articles/PMC4959391/ /pubmed/26822911 http://dx.doi.org/10.1186/s12859-015-0854-z Text en © Bouhaddani et al. 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Bouhaddani, Said el
Houwing-Duistermaat, Jeanine
Salo, Perttu
Perola, Markus
Jongbloed, Geurt
Uh, Hae-Won
Evaluation of O2PLS in Omics data integration
title Evaluation of O2PLS in Omics data integration
title_full Evaluation of O2PLS in Omics data integration
title_fullStr Evaluation of O2PLS in Omics data integration
title_full_unstemmed Evaluation of O2PLS in Omics data integration
title_short Evaluation of O2PLS in Omics data integration
title_sort evaluation of o2pls in omics data integration
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4959391/
https://www.ncbi.nlm.nih.gov/pubmed/26822911
http://dx.doi.org/10.1186/s12859-015-0854-z
work_keys_str_mv AT bouhaddanisaidel evaluationofo2plsinomicsdataintegration
AT houwingduistermaatjeanine evaluationofo2plsinomicsdataintegration
AT saloperttu evaluationofo2plsinomicsdataintegration
AT perolamarkus evaluationofo2plsinomicsdataintegration
AT jongbloedgeurt evaluationofo2plsinomicsdataintegration
AT uhhaewon evaluationofo2plsinomicsdataintegration