Cargando…

Stability of gene contributions and identification of outliers in multivariate analysis of microarray data

BACKGROUND: Multivariate ordination methods are powerful tools for the exploration of complex data structures present in microarray data. These methods have several advantages compared to common gene-by-gene approaches. However, due to their exploratory nature, multivariate ordination methods do not...

Descripción completa

Detalles Bibliográficos
Autores principales: Baty, Florent, Jaeger, Daniel, Preiswerk, Frank, Schumacher, Martin M, Brutsche, Martin H
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2441634/
https://www.ncbi.nlm.nih.gov/pubmed/18570644
http://dx.doi.org/10.1186/1471-2105-9-289
_version_ 1782156626970542080
author Baty, Florent
Jaeger, Daniel
Preiswerk, Frank
Schumacher, Martin M
Brutsche, Martin H
author_facet Baty, Florent
Jaeger, Daniel
Preiswerk, Frank
Schumacher, Martin M
Brutsche, Martin H
author_sort Baty, Florent
collection PubMed
description BACKGROUND: Multivariate ordination methods are powerful tools for the exploration of complex data structures present in microarray data. These methods have several advantages compared to common gene-by-gene approaches. However, due to their exploratory nature, multivariate ordination methods do not allow direct statistical testing of the stability of genes. RESULTS: In this study, we developed a computationally efficient algorithm for: i) the assessment of the significance of gene contributions and ii) the identification of sample outliers in multivariate analysis of microarray data. The approach is based on the use of resampling methods including bootstrapping and jackknifing. A statistical package of R functions was developed. This package includes tools for both inferring the statistical significance of gene contributions and identifying outliers among samples. CONCLUSION: The methodology was successfully applied to three published data sets with varying levels of signal intensities. Its relevance was compared with alternative methods. Overall, it proved to be particularly effective for the evaluation of the stability of microarray data.
format Text
id pubmed-2441634
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-24416342008-07-01 Stability of gene contributions and identification of outliers in multivariate analysis of microarray data Baty, Florent Jaeger, Daniel Preiswerk, Frank Schumacher, Martin M Brutsche, Martin H BMC Bioinformatics Methodology Article BACKGROUND: Multivariate ordination methods are powerful tools for the exploration of complex data structures present in microarray data. These methods have several advantages compared to common gene-by-gene approaches. However, due to their exploratory nature, multivariate ordination methods do not allow direct statistical testing of the stability of genes. RESULTS: In this study, we developed a computationally efficient algorithm for: i) the assessment of the significance of gene contributions and ii) the identification of sample outliers in multivariate analysis of microarray data. The approach is based on the use of resampling methods including bootstrapping and jackknifing. A statistical package of R functions was developed. This package includes tools for both inferring the statistical significance of gene contributions and identifying outliers among samples. CONCLUSION: The methodology was successfully applied to three published data sets with varying levels of signal intensities. Its relevance was compared with alternative methods. Overall, it proved to be particularly effective for the evaluation of the stability of microarray data. BioMed Central 2008-06-20 /pmc/articles/PMC2441634/ /pubmed/18570644 http://dx.doi.org/10.1186/1471-2105-9-289 Text en Copyright © 2008 Baty et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Baty, Florent
Jaeger, Daniel
Preiswerk, Frank
Schumacher, Martin M
Brutsche, Martin H
Stability of gene contributions and identification of outliers in multivariate analysis of microarray data
title Stability of gene contributions and identification of outliers in multivariate analysis of microarray data
title_full Stability of gene contributions and identification of outliers in multivariate analysis of microarray data
title_fullStr Stability of gene contributions and identification of outliers in multivariate analysis of microarray data
title_full_unstemmed Stability of gene contributions and identification of outliers in multivariate analysis of microarray data
title_short Stability of gene contributions and identification of outliers in multivariate analysis of microarray data
title_sort stability of gene contributions and identification of outliers in multivariate analysis of microarray data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2441634/
https://www.ncbi.nlm.nih.gov/pubmed/18570644
http://dx.doi.org/10.1186/1471-2105-9-289
work_keys_str_mv AT batyflorent stabilityofgenecontributionsandidentificationofoutliersinmultivariateanalysisofmicroarraydata
AT jaegerdaniel stabilityofgenecontributionsandidentificationofoutliersinmultivariateanalysisofmicroarraydata
AT preiswerkfrank stabilityofgenecontributionsandidentificationofoutliersinmultivariateanalysisofmicroarraydata
AT schumachermartinm stabilityofgenecontributionsandidentificationofoutliersinmultivariateanalysisofmicroarraydata
AT brutschemartinh stabilityofgenecontributionsandidentificationofoutliersinmultivariateanalysisofmicroarraydata