Cargando…

Evaluation of Gene Association Methods for Coexpression Network Construction and Biological Knowledge Discovery

BACKGROUND: Constructing coexpression networks and performing network analysis using large-scale gene expression data sets is an effective way to uncover new biological knowledge; however, the methods used for gene association in constructing these coexpression networks have not been thoroughly eval...

Descripción completa

Detalles Bibliográficos
Autores principales: Kumari, Sapna, Nie, Jeff, Chen, Huann-Sheng, Ma, Hao, Stewart, Ron, Li, Xiang, Lu, Meng-Zhu, Taylor, William M., Wei, Hairong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3511551/
https://www.ncbi.nlm.nih.gov/pubmed/23226279
http://dx.doi.org/10.1371/journal.pone.0050411
_version_ 1782251635015155712
author Kumari, Sapna
Nie, Jeff
Chen, Huann-Sheng
Ma, Hao
Stewart, Ron
Li, Xiang
Lu, Meng-Zhu
Taylor, William M.
Wei, Hairong
author_facet Kumari, Sapna
Nie, Jeff
Chen, Huann-Sheng
Ma, Hao
Stewart, Ron
Li, Xiang
Lu, Meng-Zhu
Taylor, William M.
Wei, Hairong
author_sort Kumari, Sapna
collection PubMed
description BACKGROUND: Constructing coexpression networks and performing network analysis using large-scale gene expression data sets is an effective way to uncover new biological knowledge; however, the methods used for gene association in constructing these coexpression networks have not been thoroughly evaluated. Since different methods lead to structurally different coexpression networks and provide different information, selecting the optimal gene association method is critical. METHODS AND RESULTS: In this study, we compared eight gene association methods – Spearman rank correlation, Weighted Rank Correlation, Kendall, Hoeffding's D measure, Theil-Sen, Rank Theil-Sen, Distance Covariance, and Pearson – and focused on their true knowledge discovery rates in associating pathway genes and construction coordination networks of regulatory genes. We also examined the behaviors of different methods to microarray data with different properties, and whether the biological processes affect the efficiency of different methods. CONCLUSIONS: We found that the Spearman, Hoeffding and Kendall methods are effective in identifying coexpressed pathway genes, whereas the Theil-sen, Rank Theil-Sen, Spearman, and Weighted Rank methods perform well in identifying coordinated transcription factors that control the same biological processes and traits. Surprisingly, the widely used Pearson method is generally less efficient, and so is the Distance Covariance method that can find gene pairs of multiple relationships. Some analyses we did clearly show Pearson and Distance Covariance methods have distinct behaviors as compared to all other six methods. The efficiencies of different methods vary with the data properties to some degree and are largely contingent upon the biological processes, which necessitates the pre-analysis to identify the best performing method for gene association and coexpression network construction.
format Online
Article
Text
id pubmed-3511551
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-35115512012-12-05 Evaluation of Gene Association Methods for Coexpression Network Construction and Biological Knowledge Discovery Kumari, Sapna Nie, Jeff Chen, Huann-Sheng Ma, Hao Stewart, Ron Li, Xiang Lu, Meng-Zhu Taylor, William M. Wei, Hairong PLoS One Research Article BACKGROUND: Constructing coexpression networks and performing network analysis using large-scale gene expression data sets is an effective way to uncover new biological knowledge; however, the methods used for gene association in constructing these coexpression networks have not been thoroughly evaluated. Since different methods lead to structurally different coexpression networks and provide different information, selecting the optimal gene association method is critical. METHODS AND RESULTS: In this study, we compared eight gene association methods – Spearman rank correlation, Weighted Rank Correlation, Kendall, Hoeffding's D measure, Theil-Sen, Rank Theil-Sen, Distance Covariance, and Pearson – and focused on their true knowledge discovery rates in associating pathway genes and construction coordination networks of regulatory genes. We also examined the behaviors of different methods to microarray data with different properties, and whether the biological processes affect the efficiency of different methods. CONCLUSIONS: We found that the Spearman, Hoeffding and Kendall methods are effective in identifying coexpressed pathway genes, whereas the Theil-sen, Rank Theil-Sen, Spearman, and Weighted Rank methods perform well in identifying coordinated transcription factors that control the same biological processes and traits. Surprisingly, the widely used Pearson method is generally less efficient, and so is the Distance Covariance method that can find gene pairs of multiple relationships. Some analyses we did clearly show Pearson and Distance Covariance methods have distinct behaviors as compared to all other six methods. The efficiencies of different methods vary with the data properties to some degree and are largely contingent upon the biological processes, which necessitates the pre-analysis to identify the best performing method for gene association and coexpression network construction. Public Library of Science 2012-11-30 /pmc/articles/PMC3511551/ /pubmed/23226279 http://dx.doi.org/10.1371/journal.pone.0050411 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration, which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose.
spellingShingle Research Article
Kumari, Sapna
Nie, Jeff
Chen, Huann-Sheng
Ma, Hao
Stewart, Ron
Li, Xiang
Lu, Meng-Zhu
Taylor, William M.
Wei, Hairong
Evaluation of Gene Association Methods for Coexpression Network Construction and Biological Knowledge Discovery
title Evaluation of Gene Association Methods for Coexpression Network Construction and Biological Knowledge Discovery
title_full Evaluation of Gene Association Methods for Coexpression Network Construction and Biological Knowledge Discovery
title_fullStr Evaluation of Gene Association Methods for Coexpression Network Construction and Biological Knowledge Discovery
title_full_unstemmed Evaluation of Gene Association Methods for Coexpression Network Construction and Biological Knowledge Discovery
title_short Evaluation of Gene Association Methods for Coexpression Network Construction and Biological Knowledge Discovery
title_sort evaluation of gene association methods for coexpression network construction and biological knowledge discovery
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3511551/
https://www.ncbi.nlm.nih.gov/pubmed/23226279
http://dx.doi.org/10.1371/journal.pone.0050411
work_keys_str_mv AT kumarisapna evaluationofgeneassociationmethodsforcoexpressionnetworkconstructionandbiologicalknowledgediscovery
AT niejeff evaluationofgeneassociationmethodsforcoexpressionnetworkconstructionandbiologicalknowledgediscovery
AT chenhuannsheng evaluationofgeneassociationmethodsforcoexpressionnetworkconstructionandbiologicalknowledgediscovery
AT mahao evaluationofgeneassociationmethodsforcoexpressionnetworkconstructionandbiologicalknowledgediscovery
AT stewartron evaluationofgeneassociationmethodsforcoexpressionnetworkconstructionandbiologicalknowledgediscovery
AT lixiang evaluationofgeneassociationmethodsforcoexpressionnetworkconstructionandbiologicalknowledgediscovery
AT lumengzhu evaluationofgeneassociationmethodsforcoexpressionnetworkconstructionandbiologicalknowledgediscovery
AT taylorwilliamm evaluationofgeneassociationmethodsforcoexpressionnetworkconstructionandbiologicalknowledgediscovery
AT weihairong evaluationofgeneassociationmethodsforcoexpressionnetworkconstructionandbiologicalknowledgediscovery