Cargando…

Separating common from distinctive variation

BACKGROUND: Joint and individual variation explained (JIVE), distinct and common simultaneous component analysis (DISCO) and O2-PLS, a two-block (X-Y) latent variable regression method with an integral OSC filter can all be used for the integrated analysis of multiple data sets and decompose them in...

Descripción completa

Detalles Bibliográficos
Autores principales: van der Kloet, Frans M., Sebastián-León, Patricia, Conesa, Ana, Smilde, Age K., Westerhuis, Johan A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4905617/
https://www.ncbi.nlm.nih.gov/pubmed/27294690
http://dx.doi.org/10.1186/s12859-016-1037-2
_version_ 1782437281388298240
author van der Kloet, Frans M.
Sebastián-León, Patricia
Conesa, Ana
Smilde, Age K.
Westerhuis, Johan A.
author_facet van der Kloet, Frans M.
Sebastián-León, Patricia
Conesa, Ana
Smilde, Age K.
Westerhuis, Johan A.
author_sort van der Kloet, Frans M.
collection PubMed
description BACKGROUND: Joint and individual variation explained (JIVE), distinct and common simultaneous component analysis (DISCO) and O2-PLS, a two-block (X-Y) latent variable regression method with an integral OSC filter can all be used for the integrated analysis of multiple data sets and decompose them in three terms: a low(er)-rank approximation capturing common variation across data sets, low(er)-rank approximations for structured variation distinctive for each data set, and residual noise. In this paper these three methods are compared with respect to their mathematical properties and their respective ways of defining common and distinctive variation. RESULTS: The methods are all applied on simulated data and mRNA and miRNA data-sets from GlioBlastoma Multiform (GBM) brain tumors to examine their overlap and differences. When the common variation is abundant, all methods are able to find the correct solution. With real data however, complexities in the data are treated differently by the three methods. CONCLUSIONS: All three methods have their own approach to estimate common and distinctive variation with their specific strength and weaknesses. Due to their orthogonality properties and their used algorithms their view on the data is slightly different. By assuming orthogonality between common and distinctive, true natural or biological phenomena that may not be orthogonal at all might be misinterpreted. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1037-2) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4905617
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-49056172016-06-14 Separating common from distinctive variation van der Kloet, Frans M. Sebastián-León, Patricia Conesa, Ana Smilde, Age K. Westerhuis, Johan A. BMC Bioinformatics Research BACKGROUND: Joint and individual variation explained (JIVE), distinct and common simultaneous component analysis (DISCO) and O2-PLS, a two-block (X-Y) latent variable regression method with an integral OSC filter can all be used for the integrated analysis of multiple data sets and decompose them in three terms: a low(er)-rank approximation capturing common variation across data sets, low(er)-rank approximations for structured variation distinctive for each data set, and residual noise. In this paper these three methods are compared with respect to their mathematical properties and their respective ways of defining common and distinctive variation. RESULTS: The methods are all applied on simulated data and mRNA and miRNA data-sets from GlioBlastoma Multiform (GBM) brain tumors to examine their overlap and differences. When the common variation is abundant, all methods are able to find the correct solution. With real data however, complexities in the data are treated differently by the three methods. CONCLUSIONS: All three methods have their own approach to estimate common and distinctive variation with their specific strength and weaknesses. Due to their orthogonality properties and their used algorithms their view on the data is slightly different. By assuming orthogonality between common and distinctive, true natural or biological phenomena that may not be orthogonal at all might be misinterpreted. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1037-2) contains supplementary material, which is available to authorized users. BioMed Central 2016-06-06 /pmc/articles/PMC4905617/ /pubmed/27294690 http://dx.doi.org/10.1186/s12859-016-1037-2 Text en © van der Kloet et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
van der Kloet, Frans M.
Sebastián-León, Patricia
Conesa, Ana
Smilde, Age K.
Westerhuis, Johan A.
Separating common from distinctive variation
title Separating common from distinctive variation
title_full Separating common from distinctive variation
title_fullStr Separating common from distinctive variation
title_full_unstemmed Separating common from distinctive variation
title_short Separating common from distinctive variation
title_sort separating common from distinctive variation
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4905617/
https://www.ncbi.nlm.nih.gov/pubmed/27294690
http://dx.doi.org/10.1186/s12859-016-1037-2
work_keys_str_mv AT vanderkloetfransm separatingcommonfromdistinctivevariation
AT sebastianleonpatricia separatingcommonfromdistinctivevariation
AT conesaana separatingcommonfromdistinctivevariation
AT smildeagek separatingcommonfromdistinctivevariation
AT westerhuisjohana separatingcommonfromdistinctivevariation