Cargando…
A Normalized Tree Index for identification of correlated clinical parameters in microarray experiments
BACKGROUND: Measurements on gene level are widely used to gain new insights in complex diseases e.g. cancer. A promising approach to understand basic biological mechanisms is to combine gene expression profiles and classical clinical parameters. However, the computation of a correlation coefficient...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3035591/ https://www.ncbi.nlm.nih.gov/pubmed/21247420 http://dx.doi.org/10.1186/1756-0381-4-2 |
_version_ | 1782197790978342912 |
---|---|
author | Martin, Christian W Tauchen, Anika Becker, Anke Nattkemper, Tim W |
author_facet | Martin, Christian W Tauchen, Anika Becker, Anke Nattkemper, Tim W |
author_sort | Martin, Christian W |
collection | PubMed |
description | BACKGROUND: Measurements on gene level are widely used to gain new insights in complex diseases e.g. cancer. A promising approach to understand basic biological mechanisms is to combine gene expression profiles and classical clinical parameters. However, the computation of a correlation coefficient between high-dimensional data and such parameters is not covered by traditional statistical methods. METHODS: We propose a novel index, the Normalized Tree Index (NTI), to compute a correlation coefficient between the clustering result of high-dimensional microarray data and nominal clinical parameters. The NTI detects correlations between hierarchically clustered microarray data and nominal clinical parameters (labels) and gives a measurement of significance in terms of an empiric p-value of the identified correlations. Therefore, the microarray data is clustered by hierarchical agglomerative clustering using standard settings. In a second step, the computed cluster tree is evaluated. For each label, a NTI is computed measuring the correlation between that label and the clustered microarray data. RESULTS: The NTI successfully identifies correlated clinical parameters at different levels of significance when applied on two real-world microarray breast cancer data sets. Some of the identified highly correlated labels confirm the actual state of knowledge whereas others help to identify new risk factors and provide a good basis to formulate new hypothesis. CONCLUSIONS: The NTI is a valuable tool in the domain of biomedical data analysis. It allows the identification of correlations between high-dimensional data and nominal labels, while at the same time a p-value measures the level of significance of the detected correlations. |
format | Text |
id | pubmed-3035591 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-30355912011-02-17 A Normalized Tree Index for identification of correlated clinical parameters in microarray experiments Martin, Christian W Tauchen, Anika Becker, Anke Nattkemper, Tim W BioData Min Research BACKGROUND: Measurements on gene level are widely used to gain new insights in complex diseases e.g. cancer. A promising approach to understand basic biological mechanisms is to combine gene expression profiles and classical clinical parameters. However, the computation of a correlation coefficient between high-dimensional data and such parameters is not covered by traditional statistical methods. METHODS: We propose a novel index, the Normalized Tree Index (NTI), to compute a correlation coefficient between the clustering result of high-dimensional microarray data and nominal clinical parameters. The NTI detects correlations between hierarchically clustered microarray data and nominal clinical parameters (labels) and gives a measurement of significance in terms of an empiric p-value of the identified correlations. Therefore, the microarray data is clustered by hierarchical agglomerative clustering using standard settings. In a second step, the computed cluster tree is evaluated. For each label, a NTI is computed measuring the correlation between that label and the clustered microarray data. RESULTS: The NTI successfully identifies correlated clinical parameters at different levels of significance when applied on two real-world microarray breast cancer data sets. Some of the identified highly correlated labels confirm the actual state of knowledge whereas others help to identify new risk factors and provide a good basis to formulate new hypothesis. CONCLUSIONS: The NTI is a valuable tool in the domain of biomedical data analysis. It allows the identification of correlations between high-dimensional data and nominal labels, while at the same time a p-value measures the level of significance of the detected correlations. BioMed Central 2011-01-19 /pmc/articles/PMC3035591/ /pubmed/21247420 http://dx.doi.org/10.1186/1756-0381-4-2 Text en Copyright ©2011 Martin et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Martin, Christian W Tauchen, Anika Becker, Anke Nattkemper, Tim W A Normalized Tree Index for identification of correlated clinical parameters in microarray experiments |
title | A Normalized Tree Index for identification of correlated clinical parameters in microarray experiments |
title_full | A Normalized Tree Index for identification of correlated clinical parameters in microarray experiments |
title_fullStr | A Normalized Tree Index for identification of correlated clinical parameters in microarray experiments |
title_full_unstemmed | A Normalized Tree Index for identification of correlated clinical parameters in microarray experiments |
title_short | A Normalized Tree Index for identification of correlated clinical parameters in microarray experiments |
title_sort | normalized tree index for identification of correlated clinical parameters in microarray experiments |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3035591/ https://www.ncbi.nlm.nih.gov/pubmed/21247420 http://dx.doi.org/10.1186/1756-0381-4-2 |
work_keys_str_mv | AT martinchristianw anormalizedtreeindexforidentificationofcorrelatedclinicalparametersinmicroarrayexperiments AT tauchenanika anormalizedtreeindexforidentificationofcorrelatedclinicalparametersinmicroarrayexperiments AT beckeranke anormalizedtreeindexforidentificationofcorrelatedclinicalparametersinmicroarrayexperiments AT nattkempertimw anormalizedtreeindexforidentificationofcorrelatedclinicalparametersinmicroarrayexperiments AT martinchristianw normalizedtreeindexforidentificationofcorrelatedclinicalparametersinmicroarrayexperiments AT tauchenanika normalizedtreeindexforidentificationofcorrelatedclinicalparametersinmicroarrayexperiments AT beckeranke normalizedtreeindexforidentificationofcorrelatedclinicalparametersinmicroarrayexperiments AT nattkempertimw normalizedtreeindexforidentificationofcorrelatedclinicalparametersinmicroarrayexperiments |