Cargando…
Stability of gene contributions and identification of outliers in multivariate analysis of microarray data
BACKGROUND: Multivariate ordination methods are powerful tools for the exploration of complex data structures present in microarray data. These methods have several advantages compared to common gene-by-gene approaches. However, due to their exploratory nature, multivariate ordination methods do not...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2441634/ https://www.ncbi.nlm.nih.gov/pubmed/18570644 http://dx.doi.org/10.1186/1471-2105-9-289 |
_version_ | 1782156626970542080 |
---|---|
author | Baty, Florent Jaeger, Daniel Preiswerk, Frank Schumacher, Martin M Brutsche, Martin H |
author_facet | Baty, Florent Jaeger, Daniel Preiswerk, Frank Schumacher, Martin M Brutsche, Martin H |
author_sort | Baty, Florent |
collection | PubMed |
description | BACKGROUND: Multivariate ordination methods are powerful tools for the exploration of complex data structures present in microarray data. These methods have several advantages compared to common gene-by-gene approaches. However, due to their exploratory nature, multivariate ordination methods do not allow direct statistical testing of the stability of genes. RESULTS: In this study, we developed a computationally efficient algorithm for: i) the assessment of the significance of gene contributions and ii) the identification of sample outliers in multivariate analysis of microarray data. The approach is based on the use of resampling methods including bootstrapping and jackknifing. A statistical package of R functions was developed. This package includes tools for both inferring the statistical significance of gene contributions and identifying outliers among samples. CONCLUSION: The methodology was successfully applied to three published data sets with varying levels of signal intensities. Its relevance was compared with alternative methods. Overall, it proved to be particularly effective for the evaluation of the stability of microarray data. |
format | Text |
id | pubmed-2441634 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-24416342008-07-01 Stability of gene contributions and identification of outliers in multivariate analysis of microarray data Baty, Florent Jaeger, Daniel Preiswerk, Frank Schumacher, Martin M Brutsche, Martin H BMC Bioinformatics Methodology Article BACKGROUND: Multivariate ordination methods are powerful tools for the exploration of complex data structures present in microarray data. These methods have several advantages compared to common gene-by-gene approaches. However, due to their exploratory nature, multivariate ordination methods do not allow direct statistical testing of the stability of genes. RESULTS: In this study, we developed a computationally efficient algorithm for: i) the assessment of the significance of gene contributions and ii) the identification of sample outliers in multivariate analysis of microarray data. The approach is based on the use of resampling methods including bootstrapping and jackknifing. A statistical package of R functions was developed. This package includes tools for both inferring the statistical significance of gene contributions and identifying outliers among samples. CONCLUSION: The methodology was successfully applied to three published data sets with varying levels of signal intensities. Its relevance was compared with alternative methods. Overall, it proved to be particularly effective for the evaluation of the stability of microarray data. BioMed Central 2008-06-20 /pmc/articles/PMC2441634/ /pubmed/18570644 http://dx.doi.org/10.1186/1471-2105-9-289 Text en Copyright © 2008 Baty et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Baty, Florent Jaeger, Daniel Preiswerk, Frank Schumacher, Martin M Brutsche, Martin H Stability of gene contributions and identification of outliers in multivariate analysis of microarray data |
title | Stability of gene contributions and identification of outliers in multivariate analysis of microarray data |
title_full | Stability of gene contributions and identification of outliers in multivariate analysis of microarray data |
title_fullStr | Stability of gene contributions and identification of outliers in multivariate analysis of microarray data |
title_full_unstemmed | Stability of gene contributions and identification of outliers in multivariate analysis of microarray data |
title_short | Stability of gene contributions and identification of outliers in multivariate analysis of microarray data |
title_sort | stability of gene contributions and identification of outliers in multivariate analysis of microarray data |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2441634/ https://www.ncbi.nlm.nih.gov/pubmed/18570644 http://dx.doi.org/10.1186/1471-2105-9-289 |
work_keys_str_mv | AT batyflorent stabilityofgenecontributionsandidentificationofoutliersinmultivariateanalysisofmicroarraydata AT jaegerdaniel stabilityofgenecontributionsandidentificationofoutliersinmultivariateanalysisofmicroarraydata AT preiswerkfrank stabilityofgenecontributionsandidentificationofoutliersinmultivariateanalysisofmicroarraydata AT schumachermartinm stabilityofgenecontributionsandidentificationofoutliersinmultivariateanalysisofmicroarraydata AT brutschemartinh stabilityofgenecontributionsandidentificationofoutliersinmultivariateanalysisofmicroarraydata |