Cargando…

Exploiting Identifiability and Intergene Correlation for Improved Detection of Differential Expression

Accurate differential analysis of microarray data strongly depends on effective treatment of intergene correlation. Such dependence is ordinarily accounted for in terms of its effect on significance cutoffs. In this paper, it is shown that correlation can, in fact, be exploited to share information...

Descripción completa

Detalles Bibliográficos
Autores principales: Deller, J. R., Radha, Hayder, McCormick, J. Justin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4393076/
https://www.ncbi.nlm.nih.gov/pubmed/25937946
http://dx.doi.org/10.1155/2013/404717
_version_ 1782366095956508672
author Deller, J. R.
Radha, Hayder
McCormick, J. Justin
author_facet Deller, J. R.
Radha, Hayder
McCormick, J. Justin
author_sort Deller, J. R.
collection PubMed
description Accurate differential analysis of microarray data strongly depends on effective treatment of intergene correlation. Such dependence is ordinarily accounted for in terms of its effect on significance cutoffs. In this paper, it is shown that correlation can, in fact, be exploited to share information across tests and reorder expression differentials for increased statistical power, regardless of the threshold. Significantly improved differential analysis is the result of two simple measures: (i) adjusting test statistics to exploit information from identifiable genes (the large subset of genes represented on a microarray that can be classified a priori as nondifferential with very high confidence], but (ii) doing so in a way that accounts for linear dependencies among identifiable and nonidentifiable genes. A method is developed that builds upon the widely used two-sample t-statistic approach and uses analysis in Hilbert space to decompose the nonidentified gene vector into two components that are correlated and uncorrelated with the identified set. In the application to data derived from a widely studied prostate cancer database, the proposed method outperforms some of the most highly regarded approaches published to date. Algorithms in MATLAB and in R are available for public download.
format Online
Article
Text
id pubmed-4393076
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-43930762015-05-03 Exploiting Identifiability and Intergene Correlation for Improved Detection of Differential Expression Deller, J. R. Radha, Hayder McCormick, J. Justin ISRN Bioinform Research Article Accurate differential analysis of microarray data strongly depends on effective treatment of intergene correlation. Such dependence is ordinarily accounted for in terms of its effect on significance cutoffs. In this paper, it is shown that correlation can, in fact, be exploited to share information across tests and reorder expression differentials for increased statistical power, regardless of the threshold. Significantly improved differential analysis is the result of two simple measures: (i) adjusting test statistics to exploit information from identifiable genes (the large subset of genes represented on a microarray that can be classified a priori as nondifferential with very high confidence], but (ii) doing so in a way that accounts for linear dependencies among identifiable and nonidentifiable genes. A method is developed that builds upon the widely used two-sample t-statistic approach and uses analysis in Hilbert space to decompose the nonidentified gene vector into two components that are correlated and uncorrelated with the identified set. In the application to data derived from a widely studied prostate cancer database, the proposed method outperforms some of the most highly regarded approaches published to date. Algorithms in MATLAB and in R are available for public download. Hindawi Publishing Corporation 2013-06-03 /pmc/articles/PMC4393076/ /pubmed/25937946 http://dx.doi.org/10.1155/2013/404717 Text en Copyright © 2013 J. R. Deller Jr. et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Deller, J. R.
Radha, Hayder
McCormick, J. Justin
Exploiting Identifiability and Intergene Correlation for Improved Detection of Differential Expression
title Exploiting Identifiability and Intergene Correlation for Improved Detection of Differential Expression
title_full Exploiting Identifiability and Intergene Correlation for Improved Detection of Differential Expression
title_fullStr Exploiting Identifiability and Intergene Correlation for Improved Detection of Differential Expression
title_full_unstemmed Exploiting Identifiability and Intergene Correlation for Improved Detection of Differential Expression
title_short Exploiting Identifiability and Intergene Correlation for Improved Detection of Differential Expression
title_sort exploiting identifiability and intergene correlation for improved detection of differential expression
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4393076/
https://www.ncbi.nlm.nih.gov/pubmed/25937946
http://dx.doi.org/10.1155/2013/404717
work_keys_str_mv AT dellerjr exploitingidentifiabilityandintergenecorrelationforimproveddetectionofdifferentialexpression
AT radhahayder exploitingidentifiabilityandintergenecorrelationforimproveddetectionofdifferentialexpression
AT mccormickjjustin exploitingidentifiabilityandintergenecorrelationforimproveddetectionofdifferentialexpression