Cargando…

svdPPCS: an effective singular value decomposition-based method for conserved and divergent co-expression gene module identification

BACKGROUND: Comparative analysis of gene expression profiling of multiple biological categories, such as different species of organisms or different kinds of tissue, promises to enhance the fundamental understanding of the universality as well as the specialization of mechanisms and related biologic...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Wensheng, Edwards, Andrea, Fan, Wei, Zhu, Dongxiao, Zhang, Kun
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2905369/
https://www.ncbi.nlm.nih.gov/pubmed/20565989
http://dx.doi.org/10.1186/1471-2105-11-338
_version_ 1782183956087570432
author Zhang, Wensheng
Edwards, Andrea
Fan, Wei
Zhu, Dongxiao
Zhang, Kun
author_facet Zhang, Wensheng
Edwards, Andrea
Fan, Wei
Zhu, Dongxiao
Zhang, Kun
author_sort Zhang, Wensheng
collection PubMed
description BACKGROUND: Comparative analysis of gene expression profiling of multiple biological categories, such as different species of organisms or different kinds of tissue, promises to enhance the fundamental understanding of the universality as well as the specialization of mechanisms and related biological themes. Grouping genes with a similar expression pattern or exhibiting co-expression together is a starting point in understanding and analyzing gene expression data. In recent literature, gene module level analysis is advocated in order to understand biological network design and system behaviors in disease and life processes; however, practical difficulties often lie in the implementation of existing methods. RESULTS: Using the singular value decomposition (SVD) technique, we developed a new computational tool, named svdPPCS (SVD-based Pattern Pairing and Chart Splitting), to identify conserved and divergent co-expression modules of two sets of microarray experiments. In the proposed methods, gene modules are identified by splitting the two-way chart coordinated with a pair of left singular vectors factorized from the gene expression matrices of the two biological categories. Importantly, the cutoffs are determined by a data-driven algorithm using the well-defined statistic, SVD-p. The implementation was illustrated on two time series microarray data sets generated from the samples of accessory gland (ACG) and malpighian tubule (MT) tissues of the line W(118 )of M. drosophila. Two conserved modules and six divergent modules, each of which has a unique characteristic profile across tissue kinds and aging processes, were identified. The number of genes contained in these models ranged from five to a few hundred. Three to over a hundred GO terms were over-represented in individual modules with FDR < 0.1. One divergent module suggested the tissue-specific relationship between the expressions of mitochondrion-related genes and the aging process. This finding, together with others, may be of biological significance. The validity of the proposed SVD-based method was further verified by a simulation study, as well as the comparisons with regression analysis and cubic spline regression analysis plus PAM based clustering. CONCLUSIONS: svdPPCS is a novel computational tool for the comparative analysis of transcriptional profiling. It especially fits the comparison of time series data of related organisms or different tissues of the same organism under equivalent or similar experimental conditions. The general scheme can be directly extended to the comparisons of multiple data sets. It also can be applied to the integration of data sets from different platforms and of different sources.
format Text
id pubmed-2905369
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29053692010-07-17 svdPPCS: an effective singular value decomposition-based method for conserved and divergent co-expression gene module identification Zhang, Wensheng Edwards, Andrea Fan, Wei Zhu, Dongxiao Zhang, Kun BMC Bioinformatics Methodology Article BACKGROUND: Comparative analysis of gene expression profiling of multiple biological categories, such as different species of organisms or different kinds of tissue, promises to enhance the fundamental understanding of the universality as well as the specialization of mechanisms and related biological themes. Grouping genes with a similar expression pattern or exhibiting co-expression together is a starting point in understanding and analyzing gene expression data. In recent literature, gene module level analysis is advocated in order to understand biological network design and system behaviors in disease and life processes; however, practical difficulties often lie in the implementation of existing methods. RESULTS: Using the singular value decomposition (SVD) technique, we developed a new computational tool, named svdPPCS (SVD-based Pattern Pairing and Chart Splitting), to identify conserved and divergent co-expression modules of two sets of microarray experiments. In the proposed methods, gene modules are identified by splitting the two-way chart coordinated with a pair of left singular vectors factorized from the gene expression matrices of the two biological categories. Importantly, the cutoffs are determined by a data-driven algorithm using the well-defined statistic, SVD-p. The implementation was illustrated on two time series microarray data sets generated from the samples of accessory gland (ACG) and malpighian tubule (MT) tissues of the line W(118 )of M. drosophila. Two conserved modules and six divergent modules, each of which has a unique characteristic profile across tissue kinds and aging processes, were identified. The number of genes contained in these models ranged from five to a few hundred. Three to over a hundred GO terms were over-represented in individual modules with FDR < 0.1. One divergent module suggested the tissue-specific relationship between the expressions of mitochondrion-related genes and the aging process. This finding, together with others, may be of biological significance. The validity of the proposed SVD-based method was further verified by a simulation study, as well as the comparisons with regression analysis and cubic spline regression analysis plus PAM based clustering. CONCLUSIONS: svdPPCS is a novel computational tool for the comparative analysis of transcriptional profiling. It especially fits the comparison of time series data of related organisms or different tissues of the same organism under equivalent or similar experimental conditions. The general scheme can be directly extended to the comparisons of multiple data sets. It also can be applied to the integration of data sets from different platforms and of different sources. BioMed Central 2010-06-22 /pmc/articles/PMC2905369/ /pubmed/20565989 http://dx.doi.org/10.1186/1471-2105-11-338 Text en Copyright ©2010 Zhang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Zhang, Wensheng
Edwards, Andrea
Fan, Wei
Zhu, Dongxiao
Zhang, Kun
svdPPCS: an effective singular value decomposition-based method for conserved and divergent co-expression gene module identification
title svdPPCS: an effective singular value decomposition-based method for conserved and divergent co-expression gene module identification
title_full svdPPCS: an effective singular value decomposition-based method for conserved and divergent co-expression gene module identification
title_fullStr svdPPCS: an effective singular value decomposition-based method for conserved and divergent co-expression gene module identification
title_full_unstemmed svdPPCS: an effective singular value decomposition-based method for conserved and divergent co-expression gene module identification
title_short svdPPCS: an effective singular value decomposition-based method for conserved and divergent co-expression gene module identification
title_sort svdppcs: an effective singular value decomposition-based method for conserved and divergent co-expression gene module identification
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2905369/
https://www.ncbi.nlm.nih.gov/pubmed/20565989
http://dx.doi.org/10.1186/1471-2105-11-338
work_keys_str_mv AT zhangwensheng svdppcsaneffectivesingularvaluedecompositionbasedmethodforconservedanddivergentcoexpressiongenemoduleidentification
AT edwardsandrea svdppcsaneffectivesingularvaluedecompositionbasedmethodforconservedanddivergentcoexpressiongenemoduleidentification
AT fanwei svdppcsaneffectivesingularvaluedecompositionbasedmethodforconservedanddivergentcoexpressiongenemoduleidentification
AT zhudongxiao svdppcsaneffectivesingularvaluedecompositionbasedmethodforconservedanddivergentcoexpressiongenemoduleidentification
AT zhangkun svdppcsaneffectivesingularvaluedecompositionbasedmethodforconservedanddivergentcoexpressiongenemoduleidentification