Cargando…
A factor analysis model for functional genomics
BACKGROUND: Expression array data are used to predict biological functions of uncharacterized genes by comparing their expression profiles to those of characterized genes. While biologically plausible, this is both statistically and computationally challenging. Typical approaches are computationally...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1468435/ https://www.ncbi.nlm.nih.gov/pubmed/16630343 http://dx.doi.org/10.1186/1471-2105-7-216 |
_version_ | 1782127564393807872 |
---|---|
author | Kustra, Rafal Shioda, Romy Zhu, Mu |
author_facet | Kustra, Rafal Shioda, Romy Zhu, Mu |
author_sort | Kustra, Rafal |
collection | PubMed |
description | BACKGROUND: Expression array data are used to predict biological functions of uncharacterized genes by comparing their expression profiles to those of characterized genes. While biologically plausible, this is both statistically and computationally challenging. Typical approaches are computationally expensive and ignore correlations among expression profiles and functional categories. RESULTS: We propose a factor analysis model (FAM) for functional genomics and give a two-step algorithm, using genome-wide expression data for yeast and a subset of Gene-Ontology Biological Process functional annotations. We show that the predictive performance of our method is comparable to the current best approach while our total computation time was faster by a factor of 4000. We discuss the unique challenges in performance evaluation of algorithms used for genome-wide functions genomics. Finally, we discuss extensions to our method that can incorporate the inherent correlation structure of the functional categories to further improve predictive performance. CONCLUSION: Our factor analysis model is a computationally efficient technique for functional genomics and provides a clear and unified statistical framework with potential for incorporating important gene ontology information to improve predictions. |
format | Text |
id | pubmed-1468435 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-14684352006-06-07 A factor analysis model for functional genomics Kustra, Rafal Shioda, Romy Zhu, Mu BMC Bioinformatics Methodology Article BACKGROUND: Expression array data are used to predict biological functions of uncharacterized genes by comparing their expression profiles to those of characterized genes. While biologically plausible, this is both statistically and computationally challenging. Typical approaches are computationally expensive and ignore correlations among expression profiles and functional categories. RESULTS: We propose a factor analysis model (FAM) for functional genomics and give a two-step algorithm, using genome-wide expression data for yeast and a subset of Gene-Ontology Biological Process functional annotations. We show that the predictive performance of our method is comparable to the current best approach while our total computation time was faster by a factor of 4000. We discuss the unique challenges in performance evaluation of algorithms used for genome-wide functions genomics. Finally, we discuss extensions to our method that can incorporate the inherent correlation structure of the functional categories to further improve predictive performance. CONCLUSION: Our factor analysis model is a computationally efficient technique for functional genomics and provides a clear and unified statistical framework with potential for incorporating important gene ontology information to improve predictions. BioMed Central 2006-04-21 /pmc/articles/PMC1468435/ /pubmed/16630343 http://dx.doi.org/10.1186/1471-2105-7-216 Text en Copyright © 2006 Kustra et al; licensee BioMed Central Ltd. |
spellingShingle | Methodology Article Kustra, Rafal Shioda, Romy Zhu, Mu A factor analysis model for functional genomics |
title | A factor analysis model for functional genomics |
title_full | A factor analysis model for functional genomics |
title_fullStr | A factor analysis model for functional genomics |
title_full_unstemmed | A factor analysis model for functional genomics |
title_short | A factor analysis model for functional genomics |
title_sort | factor analysis model for functional genomics |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1468435/ https://www.ncbi.nlm.nih.gov/pubmed/16630343 http://dx.doi.org/10.1186/1471-2105-7-216 |
work_keys_str_mv | AT kustrarafal afactoranalysismodelforfunctionalgenomics AT shiodaromy afactoranalysismodelforfunctionalgenomics AT zhumu afactoranalysismodelforfunctionalgenomics AT kustrarafal factoranalysismodelforfunctionalgenomics AT shiodaromy factoranalysismodelforfunctionalgenomics AT zhumu factoranalysismodelforfunctionalgenomics |