Cargando…
Mining Gene Expression Profiles: An Integrated Implementation of Kernel Principal Component Analysis and Singular Value Decomposition
The detection of genes that show similar profiles under different experimental conditions is often an initial step in inferring the biological significance of such genes. Visualization tools are used to identify genes with similar profiles in microarray studies. Given the large number of genes recor...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5054124/ https://www.ncbi.nlm.nih.gov/pubmed/20970748 http://dx.doi.org/10.1016/S1672-0229(10)60022-8 |
_version_ | 1782458532089561088 |
---|---|
author | Reverter, Ferran Vegas, Esteban Sánchez, Pedro |
author_facet | Reverter, Ferran Vegas, Esteban Sánchez, Pedro |
author_sort | Reverter, Ferran |
collection | PubMed |
description | The detection of genes that show similar profiles under different experimental conditions is often an initial step in inferring the biological significance of such genes. Visualization tools are used to identify genes with similar profiles in microarray studies. Given the large number of genes recorded in microarray experiments, gene expression data are generally displayed on a low dimensional plot, based on linear methods. However, microarray data show nonlinearity, due to high-order terms of interaction between genes, so alternative approaches, such as kernel methods, may be more appropriate. We introduce a technique that combines kernel principal component analysis (KPCA) and Biplot to visualize gene expression profiles. Our approach relies on the singular value decomposition of the input matrix and incorporates an additional step that involves KPCA. The main properties of our method are the extraction of nonlinear features and the preservation of the input variables (genes) in the output display. We apply this algorithm to colon tumor, leukemia and lymphoma datasets. Our approach reveals the underlying structure of the gene expression profiles and provides a more intuitive understanding of the gene and sample association. |
format | Online Article Text |
id | pubmed-5054124 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-50541242016-10-14 Mining Gene Expression Profiles: An Integrated Implementation of Kernel Principal Component Analysis and Singular Value Decomposition Reverter, Ferran Vegas, Esteban Sánchez, Pedro Genomics Proteomics Bioinformatics Method The detection of genes that show similar profiles under different experimental conditions is often an initial step in inferring the biological significance of such genes. Visualization tools are used to identify genes with similar profiles in microarray studies. Given the large number of genes recorded in microarray experiments, gene expression data are generally displayed on a low dimensional plot, based on linear methods. However, microarray data show nonlinearity, due to high-order terms of interaction between genes, so alternative approaches, such as kernel methods, may be more appropriate. We introduce a technique that combines kernel principal component analysis (KPCA) and Biplot to visualize gene expression profiles. Our approach relies on the singular value decomposition of the input matrix and incorporates an additional step that involves KPCA. The main properties of our method are the extraction of nonlinear features and the preservation of the input variables (genes) in the output display. We apply this algorithm to colon tumor, leukemia and lymphoma datasets. Our approach reveals the underlying structure of the gene expression profiles and provides a more intuitive understanding of the gene and sample association. Elsevier 2010-09 2010-10-21 /pmc/articles/PMC5054124/ /pubmed/20970748 http://dx.doi.org/10.1016/S1672-0229(10)60022-8 Text en © 2010 Beijing Institute of Genomics http://creativecommons.org/licenses/by-nc-sa/3.0/ This is an open access article under the CC BY-NC-SA license (http://creativecommons.org/licenses/by-nc-sa/3.0/). |
spellingShingle | Method Reverter, Ferran Vegas, Esteban Sánchez, Pedro Mining Gene Expression Profiles: An Integrated Implementation of Kernel Principal Component Analysis and Singular Value Decomposition |
title | Mining Gene Expression Profiles: An Integrated Implementation of Kernel Principal Component Analysis and Singular Value Decomposition |
title_full | Mining Gene Expression Profiles: An Integrated Implementation of Kernel Principal Component Analysis and Singular Value Decomposition |
title_fullStr | Mining Gene Expression Profiles: An Integrated Implementation of Kernel Principal Component Analysis and Singular Value Decomposition |
title_full_unstemmed | Mining Gene Expression Profiles: An Integrated Implementation of Kernel Principal Component Analysis and Singular Value Decomposition |
title_short | Mining Gene Expression Profiles: An Integrated Implementation of Kernel Principal Component Analysis and Singular Value Decomposition |
title_sort | mining gene expression profiles: an integrated implementation of kernel principal component analysis and singular value decomposition |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5054124/ https://www.ncbi.nlm.nih.gov/pubmed/20970748 http://dx.doi.org/10.1016/S1672-0229(10)60022-8 |
work_keys_str_mv | AT reverterferran mininggeneexpressionprofilesanintegratedimplementationofkernelprincipalcomponentanalysisandsingularvaluedecomposition AT vegasesteban mininggeneexpressionprofilesanintegratedimplementationofkernelprincipalcomponentanalysisandsingularvaluedecomposition AT sanchezpedro mininggeneexpressionprofilesanintegratedimplementationofkernelprincipalcomponentanalysisandsingularvaluedecomposition |