Cargando…

Vector algebra in the analysis of genome-wide expression data

BACKGROUND: Data from thousands of transcription-profiling experiments in organisms ranging from yeast to humans are now publicly available. How best to analyze these data remains an important challenge. A variety of tools have been used for this purpose, including hierarchical clustering, self-orga...

Descripción completa

Detalles Bibliográficos
Autores principales: Kuruvilla, Finny G, Park, Peter J, Schreiber, Stuart L
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2002
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC88809/
https://www.ncbi.nlm.nih.gov/pubmed/11897023
_version_ 1782120187279966208
author Kuruvilla, Finny G
Park, Peter J
Schreiber, Stuart L
author_facet Kuruvilla, Finny G
Park, Peter J
Schreiber, Stuart L
author_sort Kuruvilla, Finny G
collection PubMed
description BACKGROUND: Data from thousands of transcription-profiling experiments in organisms ranging from yeast to humans are now publicly available. How best to analyze these data remains an important challenge. A variety of tools have been used for this purpose, including hierarchical clustering, self-organizing maps and principal components analysis. In particular, concepts from vector algebra have proven useful in the study of genome-wide expression data. RESULTS: Here we present a framework based on vector algebra for the analysis of transcription profiles that is geometrically intuitive and computationally efficient. Concepts in vector algebra such as angles, magnitudes, subspaces, singular value decomposition, bases and projections have natural and powerful interpretations in the analysis of microarray data. Angles in particular offer a rigorous method of defining 'similarity' and are useful in evaluating the claims of a microarray-based study. We present a sample analysis of cells treated with rapamycin, an immunosuppressant whose effects have been extensively studied with microarrays. In addition, the algebraic concept of a basis for a space affords the opportunity to simplify data analysis and uncover a limited number of expression vectors to span the transcriptional range of cell behavior. CONCLUSIONS: This framework represents a compact, powerful and scalable construction for analysis and computation. As the amount of microarray data in the public domain grows, these vector-based methods are relevant in determining statistical significance. These approaches are also well suited to extract biologically meaningful information in the analysis of signaling networks.
format Text
id pubmed-88809
institution National Center for Biotechnology Information
language English
publishDate 2002
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-888092002-03-18 Vector algebra in the analysis of genome-wide expression data Kuruvilla, Finny G Park, Peter J Schreiber, Stuart L Genome Biol Research BACKGROUND: Data from thousands of transcription-profiling experiments in organisms ranging from yeast to humans are now publicly available. How best to analyze these data remains an important challenge. A variety of tools have been used for this purpose, including hierarchical clustering, self-organizing maps and principal components analysis. In particular, concepts from vector algebra have proven useful in the study of genome-wide expression data. RESULTS: Here we present a framework based on vector algebra for the analysis of transcription profiles that is geometrically intuitive and computationally efficient. Concepts in vector algebra such as angles, magnitudes, subspaces, singular value decomposition, bases and projections have natural and powerful interpretations in the analysis of microarray data. Angles in particular offer a rigorous method of defining 'similarity' and are useful in evaluating the claims of a microarray-based study. We present a sample analysis of cells treated with rapamycin, an immunosuppressant whose effects have been extensively studied with microarrays. In addition, the algebraic concept of a basis for a space affords the opportunity to simplify data analysis and uncover a limited number of expression vectors to span the transcriptional range of cell behavior. CONCLUSIONS: This framework represents a compact, powerful and scalable construction for analysis and computation. As the amount of microarray data in the public domain grows, these vector-based methods are relevant in determining statistical significance. These approaches are also well suited to extract biologically meaningful information in the analysis of signaling networks. BioMed Central 2002 2002-02-13 /pmc/articles/PMC88809/ /pubmed/11897023 Text en Copyright © 2002 Kuruvilla et al., licensee BioMed Central Ltd
spellingShingle Research
Kuruvilla, Finny G
Park, Peter J
Schreiber, Stuart L
Vector algebra in the analysis of genome-wide expression data
title Vector algebra in the analysis of genome-wide expression data
title_full Vector algebra in the analysis of genome-wide expression data
title_fullStr Vector algebra in the analysis of genome-wide expression data
title_full_unstemmed Vector algebra in the analysis of genome-wide expression data
title_short Vector algebra in the analysis of genome-wide expression data
title_sort vector algebra in the analysis of genome-wide expression data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC88809/
https://www.ncbi.nlm.nih.gov/pubmed/11897023
work_keys_str_mv AT kuruvillafinnyg vectoralgebraintheanalysisofgenomewideexpressiondata
AT parkpeterj vectoralgebraintheanalysisofgenomewideexpressiondata
AT schreiberstuartl vectoralgebraintheanalysisofgenomewideexpressiondata