Cargando…

A Higher-Order Generalized Singular Value Decomposition for Comparison of Global mRNA Expression from Multiple Organisms

The number of high-dimensional datasets recording multiple aspects of a single phenomenon is increasing in many areas of science, accompanied by a need for mathematical frameworks that can compare multiple large-scale matrices with different row dimensions. The only such framework to date, the gener...

Descripción completa

Detalles Bibliográficos
Autores principales: Ponnapalli, Sri Priya, Saunders, Michael A., Van Loan, Charles F., Alter, Orly
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3245232/
https://www.ncbi.nlm.nih.gov/pubmed/22216090
http://dx.doi.org/10.1371/journal.pone.0028072
_version_ 1782219825766989824
author Ponnapalli, Sri Priya
Saunders, Michael A.
Van Loan, Charles F.
Alter, Orly
author_facet Ponnapalli, Sri Priya
Saunders, Michael A.
Van Loan, Charles F.
Alter, Orly
author_sort Ponnapalli, Sri Priya
collection PubMed
description The number of high-dimensional datasets recording multiple aspects of a single phenomenon is increasing in many areas of science, accompanied by a need for mathematical frameworks that can compare multiple large-scale matrices with different row dimensions. The only such framework to date, the generalized singular value decomposition (GSVD), is limited to two matrices. We mathematically define a higher-order GSVD (HO GSVD) for N≥2 matrices [Image: see text], each with full column rank. Each matrix is exactly factored as D(i) = U(i)Σ(i)V(T), where V, identical in all factorizations, is obtained from the eigensystem SV = VΛ of the arithmetic mean S of all pairwise quotients [Image: see text] of the matrices [Image: see text], i≠j. We prove that this decomposition extends to higher orders almost all of the mathematical properties of the GSVD. The matrix S is nondefective with V and Λ real. Its eigenvalues satisfy λ(k)≥1. Equality holds if and only if the corresponding eigenvector v(k) is a right basis vector of equal significance in all matrices D(i) and D(j), that is σ(i,k)/σ(j,k) = 1 for all i and j, and the corresponding left basis vector u(i,k) is orthogonal to all other vectors in U(i) for all i. The eigenvalues λ(k) = 1, therefore, define the “common HO GSVD subspace.” We illustrate the HO GSVD with a comparison of genome-scale cell-cycle mRNA expression from S. pombe, S. cerevisiae and human. Unlike existing algorithms, a mapping among the genes of these disparate organisms is not required. We find that the approximately common HO GSVD subspace represents the cell-cycle mRNA expression oscillations, which are similar among the datasets. Simultaneous reconstruction in the common subspace, therefore, removes the experimental artifacts, which are dissimilar, from the datasets. In the simultaneous sequence-independent classification of the genes of the three organisms in this common subspace, genes of highly conserved sequences but significantly different cell-cycle peak times are correctly classified.
format Online
Article
Text
id pubmed-3245232
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-32452322012-01-03 A Higher-Order Generalized Singular Value Decomposition for Comparison of Global mRNA Expression from Multiple Organisms Ponnapalli, Sri Priya Saunders, Michael A. Van Loan, Charles F. Alter, Orly PLoS One Research Article The number of high-dimensional datasets recording multiple aspects of a single phenomenon is increasing in many areas of science, accompanied by a need for mathematical frameworks that can compare multiple large-scale matrices with different row dimensions. The only such framework to date, the generalized singular value decomposition (GSVD), is limited to two matrices. We mathematically define a higher-order GSVD (HO GSVD) for N≥2 matrices [Image: see text], each with full column rank. Each matrix is exactly factored as D(i) = U(i)Σ(i)V(T), where V, identical in all factorizations, is obtained from the eigensystem SV = VΛ of the arithmetic mean S of all pairwise quotients [Image: see text] of the matrices [Image: see text], i≠j. We prove that this decomposition extends to higher orders almost all of the mathematical properties of the GSVD. The matrix S is nondefective with V and Λ real. Its eigenvalues satisfy λ(k)≥1. Equality holds if and only if the corresponding eigenvector v(k) is a right basis vector of equal significance in all matrices D(i) and D(j), that is σ(i,k)/σ(j,k) = 1 for all i and j, and the corresponding left basis vector u(i,k) is orthogonal to all other vectors in U(i) for all i. The eigenvalues λ(k) = 1, therefore, define the “common HO GSVD subspace.” We illustrate the HO GSVD with a comparison of genome-scale cell-cycle mRNA expression from S. pombe, S. cerevisiae and human. Unlike existing algorithms, a mapping among the genes of these disparate organisms is not required. We find that the approximately common HO GSVD subspace represents the cell-cycle mRNA expression oscillations, which are similar among the datasets. Simultaneous reconstruction in the common subspace, therefore, removes the experimental artifacts, which are dissimilar, from the datasets. In the simultaneous sequence-independent classification of the genes of the three organisms in this common subspace, genes of highly conserved sequences but significantly different cell-cycle peak times are correctly classified. Public Library of Science 2011-12-22 /pmc/articles/PMC3245232/ /pubmed/22216090 http://dx.doi.org/10.1371/journal.pone.0028072 Text en Ponnapalli et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Ponnapalli, Sri Priya
Saunders, Michael A.
Van Loan, Charles F.
Alter, Orly
A Higher-Order Generalized Singular Value Decomposition for Comparison of Global mRNA Expression from Multiple Organisms
title A Higher-Order Generalized Singular Value Decomposition for Comparison of Global mRNA Expression from Multiple Organisms
title_full A Higher-Order Generalized Singular Value Decomposition for Comparison of Global mRNA Expression from Multiple Organisms
title_fullStr A Higher-Order Generalized Singular Value Decomposition for Comparison of Global mRNA Expression from Multiple Organisms
title_full_unstemmed A Higher-Order Generalized Singular Value Decomposition for Comparison of Global mRNA Expression from Multiple Organisms
title_short A Higher-Order Generalized Singular Value Decomposition for Comparison of Global mRNA Expression from Multiple Organisms
title_sort higher-order generalized singular value decomposition for comparison of global mrna expression from multiple organisms
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3245232/
https://www.ncbi.nlm.nih.gov/pubmed/22216090
http://dx.doi.org/10.1371/journal.pone.0028072
work_keys_str_mv AT ponnapallisripriya ahigherordergeneralizedsingularvaluedecompositionforcomparisonofglobalmrnaexpressionfrommultipleorganisms
AT saundersmichaela ahigherordergeneralizedsingularvaluedecompositionforcomparisonofglobalmrnaexpressionfrommultipleorganisms
AT vanloancharlesf ahigherordergeneralizedsingularvaluedecompositionforcomparisonofglobalmrnaexpressionfrommultipleorganisms
AT alterorly ahigherordergeneralizedsingularvaluedecompositionforcomparisonofglobalmrnaexpressionfrommultipleorganisms
AT ponnapallisripriya higherordergeneralizedsingularvaluedecompositionforcomparisonofglobalmrnaexpressionfrommultipleorganisms
AT saundersmichaela higherordergeneralizedsingularvaluedecompositionforcomparisonofglobalmrnaexpressionfrommultipleorganisms
AT vanloancharlesf higherordergeneralizedsingularvaluedecompositionforcomparisonofglobalmrnaexpressionfrommultipleorganisms
AT alterorly higherordergeneralizedsingularvaluedecompositionforcomparisonofglobalmrnaexpressionfrommultipleorganisms