Cargando…

Improve Survival Prediction Using Principal Components of Gene Expression Data

The purpose of many microarray studies is to find the association between gene expression and sample characteristics such as treatment type or sample phenotype. There has been a surge of efforts developing different methods for delineating the association. Aside from the high dimensionality of micro...

Descripción completa

Detalles Bibliográficos
Autores principales: Shen, Yi-Jing, Huang, Shu-Guang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5054035/
https://www.ncbi.nlm.nih.gov/pubmed/16970550
http://dx.doi.org/10.1016/S1672-0229(06)60022-3
_version_ 1782458511648620544
author Shen, Yi-Jing
Huang, Shu-Guang
author_facet Shen, Yi-Jing
Huang, Shu-Guang
author_sort Shen, Yi-Jing
collection PubMed
description The purpose of many microarray studies is to find the association between gene expression and sample characteristics such as treatment type or sample phenotype. There has been a surge of efforts developing different methods for delineating the association. Aside from the high dimensionality of microarray data, one well recognized challenge is the fact that genes could be complicatedly inter-related, thus making many statistical methods inappropriate to use directly on the expression data. Multivariate methods such as principal component analysis (PCA) and clustering are often used as a part of the effort to capture the gene correlation, and the derived components or clusters are used to describe the association between gene expression and sample phenotype. We propose a method for patient population dichotomization using maximally selected test statistics in combination with the PCA method, which shows favorable results. The proposed method is compared with a currently well-recognized method.
format Online
Article
Text
id pubmed-5054035
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-50540352016-10-14 Improve Survival Prediction Using Principal Components of Gene Expression Data Shen, Yi-Jing Huang, Shu-Guang Genomics Proteomics Bioinformatics Article The purpose of many microarray studies is to find the association between gene expression and sample characteristics such as treatment type or sample phenotype. There has been a surge of efforts developing different methods for delineating the association. Aside from the high dimensionality of microarray data, one well recognized challenge is the fact that genes could be complicatedly inter-related, thus making many statistical methods inappropriate to use directly on the expression data. Multivariate methods such as principal component analysis (PCA) and clustering are often used as a part of the effort to capture the gene correlation, and the derived components or clusters are used to describe the association between gene expression and sample phenotype. We propose a method for patient population dichotomization using maximally selected test statistics in combination with the PCA method, which shows favorable results. The proposed method is compared with a currently well-recognized method. Elsevier 2006 2006-08-22 /pmc/articles/PMC5054035/ /pubmed/16970550 http://dx.doi.org/10.1016/S1672-0229(06)60022-3 Text en © 2006 Beijing Institute of Genomics http://creativecommons.org/licenses/by-nc-sa/3.0/ This is an open access article under the CC BY-NC-SA license (http://creativecommons.org/licenses/by-nc-sa/3.0/).
spellingShingle Article
Shen, Yi-Jing
Huang, Shu-Guang
Improve Survival Prediction Using Principal Components of Gene Expression Data
title Improve Survival Prediction Using Principal Components of Gene Expression Data
title_full Improve Survival Prediction Using Principal Components of Gene Expression Data
title_fullStr Improve Survival Prediction Using Principal Components of Gene Expression Data
title_full_unstemmed Improve Survival Prediction Using Principal Components of Gene Expression Data
title_short Improve Survival Prediction Using Principal Components of Gene Expression Data
title_sort improve survival prediction using principal components of gene expression data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5054035/
https://www.ncbi.nlm.nih.gov/pubmed/16970550
http://dx.doi.org/10.1016/S1672-0229(06)60022-3
work_keys_str_mv AT shenyijing improvesurvivalpredictionusingprincipalcomponentsofgeneexpressiondata
AT huangshuguang improvesurvivalpredictionusingprincipalcomponentsofgeneexpressiondata