Cargando…
Exploiting Identifiability and Intergene Correlation for Improved Detection of Differential Expression
Accurate differential analysis of microarray data strongly depends on effective treatment of intergene correlation. Such dependence is ordinarily accounted for in terms of its effect on significance cutoffs. In this paper, it is shown that correlation can, in fact, be exploited to share information...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4393076/ https://www.ncbi.nlm.nih.gov/pubmed/25937946 http://dx.doi.org/10.1155/2013/404717 |
_version_ | 1782366095956508672 |
---|---|
author | Deller, J. R. Radha, Hayder McCormick, J. Justin |
author_facet | Deller, J. R. Radha, Hayder McCormick, J. Justin |
author_sort | Deller, J. R. |
collection | PubMed |
description | Accurate differential analysis of microarray data strongly depends on effective treatment of intergene correlation. Such dependence is ordinarily accounted for in terms of its effect on significance cutoffs. In this paper, it is shown that correlation can, in fact, be exploited to share information across tests and reorder expression differentials for increased statistical power, regardless of the threshold. Significantly improved differential analysis is the result of two simple measures: (i) adjusting test statistics to exploit information from identifiable genes (the large subset of genes represented on a microarray that can be classified a priori as nondifferential with very high confidence], but (ii) doing so in a way that accounts for linear dependencies among identifiable and nonidentifiable genes. A method is developed that builds upon the widely used two-sample t-statistic approach and uses analysis in Hilbert space to decompose the nonidentified gene vector into two components that are correlated and uncorrelated with the identified set. In the application to data derived from a widely studied prostate cancer database, the proposed method outperforms some of the most highly regarded approaches published to date. Algorithms in MATLAB and in R are available for public download. |
format | Online Article Text |
id | pubmed-4393076 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-43930762015-05-03 Exploiting Identifiability and Intergene Correlation for Improved Detection of Differential Expression Deller, J. R. Radha, Hayder McCormick, J. Justin ISRN Bioinform Research Article Accurate differential analysis of microarray data strongly depends on effective treatment of intergene correlation. Such dependence is ordinarily accounted for in terms of its effect on significance cutoffs. In this paper, it is shown that correlation can, in fact, be exploited to share information across tests and reorder expression differentials for increased statistical power, regardless of the threshold. Significantly improved differential analysis is the result of two simple measures: (i) adjusting test statistics to exploit information from identifiable genes (the large subset of genes represented on a microarray that can be classified a priori as nondifferential with very high confidence], but (ii) doing so in a way that accounts for linear dependencies among identifiable and nonidentifiable genes. A method is developed that builds upon the widely used two-sample t-statistic approach and uses analysis in Hilbert space to decompose the nonidentified gene vector into two components that are correlated and uncorrelated with the identified set. In the application to data derived from a widely studied prostate cancer database, the proposed method outperforms some of the most highly regarded approaches published to date. Algorithms in MATLAB and in R are available for public download. Hindawi Publishing Corporation 2013-06-03 /pmc/articles/PMC4393076/ /pubmed/25937946 http://dx.doi.org/10.1155/2013/404717 Text en Copyright © 2013 J. R. Deller Jr. et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Deller, J. R. Radha, Hayder McCormick, J. Justin Exploiting Identifiability and Intergene Correlation for Improved Detection of Differential Expression |
title | Exploiting Identifiability and Intergene Correlation for Improved Detection of Differential Expression |
title_full | Exploiting Identifiability and Intergene Correlation for Improved Detection of Differential Expression |
title_fullStr | Exploiting Identifiability and Intergene Correlation for Improved Detection of Differential Expression |
title_full_unstemmed | Exploiting Identifiability and Intergene Correlation for Improved Detection of Differential Expression |
title_short | Exploiting Identifiability and Intergene Correlation for Improved Detection of Differential Expression |
title_sort | exploiting identifiability and intergene correlation for improved detection of differential expression |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4393076/ https://www.ncbi.nlm.nih.gov/pubmed/25937946 http://dx.doi.org/10.1155/2013/404717 |
work_keys_str_mv | AT dellerjr exploitingidentifiabilityandintergenecorrelationforimproveddetectionofdifferentialexpression AT radhahayder exploitingidentifiabilityandintergenecorrelationforimproveddetectionofdifferentialexpression AT mccormickjjustin exploitingidentifiabilityandintergenecorrelationforimproveddetectionofdifferentialexpression |