Cargando…

Mining Gene Expression Profiles: An Integrated Implementation of Kernel Principal Component Analysis and Singular Value Decomposition

The detection of genes that show similar profiles under different experimental conditions is often an initial step in inferring the biological significance of such genes. Visualization tools are used to identify genes with similar profiles in microarray studies. Given the large number of genes recor...

Descripción completa

Detalles Bibliográficos
Autores principales: Reverter, Ferran, Vegas, Esteban, Sánchez, Pedro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5054124/
https://www.ncbi.nlm.nih.gov/pubmed/20970748
http://dx.doi.org/10.1016/S1672-0229(10)60022-8
_version_ 1782458532089561088
author Reverter, Ferran
Vegas, Esteban
Sánchez, Pedro
author_facet Reverter, Ferran
Vegas, Esteban
Sánchez, Pedro
author_sort Reverter, Ferran
collection PubMed
description The detection of genes that show similar profiles under different experimental conditions is often an initial step in inferring the biological significance of such genes. Visualization tools are used to identify genes with similar profiles in microarray studies. Given the large number of genes recorded in microarray experiments, gene expression data are generally displayed on a low dimensional plot, based on linear methods. However, microarray data show nonlinearity, due to high-order terms of interaction between genes, so alternative approaches, such as kernel methods, may be more appropriate. We introduce a technique that combines kernel principal component analysis (KPCA) and Biplot to visualize gene expression profiles. Our approach relies on the singular value decomposition of the input matrix and incorporates an additional step that involves KPCA. The main properties of our method are the extraction of nonlinear features and the preservation of the input variables (genes) in the output display. We apply this algorithm to colon tumor, leukemia and lymphoma datasets. Our approach reveals the underlying structure of the gene expression profiles and provides a more intuitive understanding of the gene and sample association.
format Online
Article
Text
id pubmed-5054124
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-50541242016-10-14 Mining Gene Expression Profiles: An Integrated Implementation of Kernel Principal Component Analysis and Singular Value Decomposition Reverter, Ferran Vegas, Esteban Sánchez, Pedro Genomics Proteomics Bioinformatics Method The detection of genes that show similar profiles under different experimental conditions is often an initial step in inferring the biological significance of such genes. Visualization tools are used to identify genes with similar profiles in microarray studies. Given the large number of genes recorded in microarray experiments, gene expression data are generally displayed on a low dimensional plot, based on linear methods. However, microarray data show nonlinearity, due to high-order terms of interaction between genes, so alternative approaches, such as kernel methods, may be more appropriate. We introduce a technique that combines kernel principal component analysis (KPCA) and Biplot to visualize gene expression profiles. Our approach relies on the singular value decomposition of the input matrix and incorporates an additional step that involves KPCA. The main properties of our method are the extraction of nonlinear features and the preservation of the input variables (genes) in the output display. We apply this algorithm to colon tumor, leukemia and lymphoma datasets. Our approach reveals the underlying structure of the gene expression profiles and provides a more intuitive understanding of the gene and sample association. Elsevier 2010-09 2010-10-21 /pmc/articles/PMC5054124/ /pubmed/20970748 http://dx.doi.org/10.1016/S1672-0229(10)60022-8 Text en © 2010 Beijing Institute of Genomics http://creativecommons.org/licenses/by-nc-sa/3.0/ This is an open access article under the CC BY-NC-SA license (http://creativecommons.org/licenses/by-nc-sa/3.0/).
spellingShingle Method
Reverter, Ferran
Vegas, Esteban
Sánchez, Pedro
Mining Gene Expression Profiles: An Integrated Implementation of Kernel Principal Component Analysis and Singular Value Decomposition
title Mining Gene Expression Profiles: An Integrated Implementation of Kernel Principal Component Analysis and Singular Value Decomposition
title_full Mining Gene Expression Profiles: An Integrated Implementation of Kernel Principal Component Analysis and Singular Value Decomposition
title_fullStr Mining Gene Expression Profiles: An Integrated Implementation of Kernel Principal Component Analysis and Singular Value Decomposition
title_full_unstemmed Mining Gene Expression Profiles: An Integrated Implementation of Kernel Principal Component Analysis and Singular Value Decomposition
title_short Mining Gene Expression Profiles: An Integrated Implementation of Kernel Principal Component Analysis and Singular Value Decomposition
title_sort mining gene expression profiles: an integrated implementation of kernel principal component analysis and singular value decomposition
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5054124/
https://www.ncbi.nlm.nih.gov/pubmed/20970748
http://dx.doi.org/10.1016/S1672-0229(10)60022-8
work_keys_str_mv AT reverterferran mininggeneexpressionprofilesanintegratedimplementationofkernelprincipalcomponentanalysisandsingularvaluedecomposition
AT vegasesteban mininggeneexpressionprofilesanintegratedimplementationofkernelprincipalcomponentanalysisandsingularvaluedecomposition
AT sanchezpedro mininggeneexpressionprofilesanintegratedimplementationofkernelprincipalcomponentanalysisandsingularvaluedecomposition