Cargando…

A Normalized Tree Index for identification of correlated clinical parameters in microarray experiments

BACKGROUND: Measurements on gene level are widely used to gain new insights in complex diseases e.g. cancer. A promising approach to understand basic biological mechanisms is to combine gene expression profiles and classical clinical parameters. However, the computation of a correlation coefficient...

Descripción completa

Detalles Bibliográficos
Autores principales: Martin, Christian W, Tauchen, Anika, Becker, Anke, Nattkemper, Tim W
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3035591/
https://www.ncbi.nlm.nih.gov/pubmed/21247420
http://dx.doi.org/10.1186/1756-0381-4-2
_version_ 1782197790978342912
author Martin, Christian W
Tauchen, Anika
Becker, Anke
Nattkemper, Tim W
author_facet Martin, Christian W
Tauchen, Anika
Becker, Anke
Nattkemper, Tim W
author_sort Martin, Christian W
collection PubMed
description BACKGROUND: Measurements on gene level are widely used to gain new insights in complex diseases e.g. cancer. A promising approach to understand basic biological mechanisms is to combine gene expression profiles and classical clinical parameters. However, the computation of a correlation coefficient between high-dimensional data and such parameters is not covered by traditional statistical methods. METHODS: We propose a novel index, the Normalized Tree Index (NTI), to compute a correlation coefficient between the clustering result of high-dimensional microarray data and nominal clinical parameters. The NTI detects correlations between hierarchically clustered microarray data and nominal clinical parameters (labels) and gives a measurement of significance in terms of an empiric p-value of the identified correlations. Therefore, the microarray data is clustered by hierarchical agglomerative clustering using standard settings. In a second step, the computed cluster tree is evaluated. For each label, a NTI is computed measuring the correlation between that label and the clustered microarray data. RESULTS: The NTI successfully identifies correlated clinical parameters at different levels of significance when applied on two real-world microarray breast cancer data sets. Some of the identified highly correlated labels confirm the actual state of knowledge whereas others help to identify new risk factors and provide a good basis to formulate new hypothesis. CONCLUSIONS: The NTI is a valuable tool in the domain of biomedical data analysis. It allows the identification of correlations between high-dimensional data and nominal labels, while at the same time a p-value measures the level of significance of the detected correlations.
format Text
id pubmed-3035591
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30355912011-02-17 A Normalized Tree Index for identification of correlated clinical parameters in microarray experiments Martin, Christian W Tauchen, Anika Becker, Anke Nattkemper, Tim W BioData Min Research BACKGROUND: Measurements on gene level are widely used to gain new insights in complex diseases e.g. cancer. A promising approach to understand basic biological mechanisms is to combine gene expression profiles and classical clinical parameters. However, the computation of a correlation coefficient between high-dimensional data and such parameters is not covered by traditional statistical methods. METHODS: We propose a novel index, the Normalized Tree Index (NTI), to compute a correlation coefficient between the clustering result of high-dimensional microarray data and nominal clinical parameters. The NTI detects correlations between hierarchically clustered microarray data and nominal clinical parameters (labels) and gives a measurement of significance in terms of an empiric p-value of the identified correlations. Therefore, the microarray data is clustered by hierarchical agglomerative clustering using standard settings. In a second step, the computed cluster tree is evaluated. For each label, a NTI is computed measuring the correlation between that label and the clustered microarray data. RESULTS: The NTI successfully identifies correlated clinical parameters at different levels of significance when applied on two real-world microarray breast cancer data sets. Some of the identified highly correlated labels confirm the actual state of knowledge whereas others help to identify new risk factors and provide a good basis to formulate new hypothesis. CONCLUSIONS: The NTI is a valuable tool in the domain of biomedical data analysis. It allows the identification of correlations between high-dimensional data and nominal labels, while at the same time a p-value measures the level of significance of the detected correlations. BioMed Central 2011-01-19 /pmc/articles/PMC3035591/ /pubmed/21247420 http://dx.doi.org/10.1186/1756-0381-4-2 Text en Copyright ©2011 Martin et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Martin, Christian W
Tauchen, Anika
Becker, Anke
Nattkemper, Tim W
A Normalized Tree Index for identification of correlated clinical parameters in microarray experiments
title A Normalized Tree Index for identification of correlated clinical parameters in microarray experiments
title_full A Normalized Tree Index for identification of correlated clinical parameters in microarray experiments
title_fullStr A Normalized Tree Index for identification of correlated clinical parameters in microarray experiments
title_full_unstemmed A Normalized Tree Index for identification of correlated clinical parameters in microarray experiments
title_short A Normalized Tree Index for identification of correlated clinical parameters in microarray experiments
title_sort normalized tree index for identification of correlated clinical parameters in microarray experiments
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3035591/
https://www.ncbi.nlm.nih.gov/pubmed/21247420
http://dx.doi.org/10.1186/1756-0381-4-2
work_keys_str_mv AT martinchristianw anormalizedtreeindexforidentificationofcorrelatedclinicalparametersinmicroarrayexperiments
AT tauchenanika anormalizedtreeindexforidentificationofcorrelatedclinicalparametersinmicroarrayexperiments
AT beckeranke anormalizedtreeindexforidentificationofcorrelatedclinicalparametersinmicroarrayexperiments
AT nattkempertimw anormalizedtreeindexforidentificationofcorrelatedclinicalparametersinmicroarrayexperiments
AT martinchristianw normalizedtreeindexforidentificationofcorrelatedclinicalparametersinmicroarrayexperiments
AT tauchenanika normalizedtreeindexforidentificationofcorrelatedclinicalparametersinmicroarrayexperiments
AT beckeranke normalizedtreeindexforidentificationofcorrelatedclinicalparametersinmicroarrayexperiments
AT nattkempertimw normalizedtreeindexforidentificationofcorrelatedclinicalparametersinmicroarrayexperiments