Cargando…

Comparison of threshold selection methods for microarray gene co-expression matrices

BACKGROUND: Network and clustering analyses of microarray co-expression correlation data often require application of a threshold to discard small correlations, thus reducing computational demands and decreasing the number of uninformative correlations. This study investigated threshold selection in...

Descripción completa

Detalles Bibliográficos
Autores principales: Borate, Bhavesh R, Chesler, Elissa J, Langston, Michael A, Saxton, Arnold M, Voy, Brynn H
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2794870/
https://www.ncbi.nlm.nih.gov/pubmed/19954523
http://dx.doi.org/10.1186/1756-0500-2-240
_version_ 1782175407154397184
author Borate, Bhavesh R
Chesler, Elissa J
Langston, Michael A
Saxton, Arnold M
Voy, Brynn H
author_facet Borate, Bhavesh R
Chesler, Elissa J
Langston, Michael A
Saxton, Arnold M
Voy, Brynn H
author_sort Borate, Bhavesh R
collection PubMed
description BACKGROUND: Network and clustering analyses of microarray co-expression correlation data often require application of a threshold to discard small correlations, thus reducing computational demands and decreasing the number of uninformative correlations. This study investigated threshold selection in the context of combinatorial network analysis of transcriptome data. FINDINGS: Six conceptually diverse methods - based on number of maximal cliques, correlation of control spots with expressed genes, top 1% of correlations, spectral graph clustering, Bonferroni correction of p-values, and statistical power - were used to estimate a correlation threshold for three time-series microarray datasets. The validity of thresholds was tested by comparison to thresholds derived from Gene Ontology information. Stability and reliability of the best methods were evaluated with block bootstrapping. Two threshold methods, number of maximal cliques and spectral graph, used information in the correlation matrix structure and performed well in terms of stability. Comparison to Gene Ontology found thresholds from number of maximal cliques extracted from a co-expression matrix were the most biologically valid. Approaches to improve both methods were suggested. CONCLUSION: Threshold selection approaches based on network structure of gene relationships gave thresholds with greater relevance to curated biological relationships than approaches based on statistical pair-wise relationships.
format Text
id pubmed-2794870
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27948702009-12-17 Comparison of threshold selection methods for microarray gene co-expression matrices Borate, Bhavesh R Chesler, Elissa J Langston, Michael A Saxton, Arnold M Voy, Brynn H BMC Res Notes Short Report BACKGROUND: Network and clustering analyses of microarray co-expression correlation data often require application of a threshold to discard small correlations, thus reducing computational demands and decreasing the number of uninformative correlations. This study investigated threshold selection in the context of combinatorial network analysis of transcriptome data. FINDINGS: Six conceptually diverse methods - based on number of maximal cliques, correlation of control spots with expressed genes, top 1% of correlations, spectral graph clustering, Bonferroni correction of p-values, and statistical power - were used to estimate a correlation threshold for three time-series microarray datasets. The validity of thresholds was tested by comparison to thresholds derived from Gene Ontology information. Stability and reliability of the best methods were evaluated with block bootstrapping. Two threshold methods, number of maximal cliques and spectral graph, used information in the correlation matrix structure and performed well in terms of stability. Comparison to Gene Ontology found thresholds from number of maximal cliques extracted from a co-expression matrix were the most biologically valid. Approaches to improve both methods were suggested. CONCLUSION: Threshold selection approaches based on network structure of gene relationships gave thresholds with greater relevance to curated biological relationships than approaches based on statistical pair-wise relationships. BioMed Central 2009-12-02 /pmc/articles/PMC2794870/ /pubmed/19954523 http://dx.doi.org/10.1186/1756-0500-2-240 Text en Copyright ©2009 Saxton et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Short Report
Borate, Bhavesh R
Chesler, Elissa J
Langston, Michael A
Saxton, Arnold M
Voy, Brynn H
Comparison of threshold selection methods for microarray gene co-expression matrices
title Comparison of threshold selection methods for microarray gene co-expression matrices
title_full Comparison of threshold selection methods for microarray gene co-expression matrices
title_fullStr Comparison of threshold selection methods for microarray gene co-expression matrices
title_full_unstemmed Comparison of threshold selection methods for microarray gene co-expression matrices
title_short Comparison of threshold selection methods for microarray gene co-expression matrices
title_sort comparison of threshold selection methods for microarray gene co-expression matrices
topic Short Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2794870/
https://www.ncbi.nlm.nih.gov/pubmed/19954523
http://dx.doi.org/10.1186/1756-0500-2-240
work_keys_str_mv AT boratebhaveshr comparisonofthresholdselectionmethodsformicroarraygenecoexpressionmatrices
AT cheslerelissaj comparisonofthresholdselectionmethodsformicroarraygenecoexpressionmatrices
AT langstonmichaela comparisonofthresholdselectionmethodsformicroarraygenecoexpressionmatrices
AT saxtonarnoldm comparisonofthresholdselectionmethodsformicroarraygenecoexpressionmatrices
AT voybrynnh comparisonofthresholdselectionmethodsformicroarraygenecoexpressionmatrices