Cargando…

Adaptive filtering of microarray gene expression data based on Gaussian mixture decomposition

BACKGROUND: DNA microarrays are used for discovery of genes expressed differentially between various biological conditions. In microarray experiments the number of analyzed samples is often much lower than the number of genes (probe sets) which leads to many false discoveries. Multiple testing corre...

Descripción completa

Detalles Bibliográficos
Autores principales: Marczyk, Michal, Jaksik, Roman, Polanski, Andrzej, Polanska, Joanna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3637832/
https://www.ncbi.nlm.nih.gov/pubmed/23510016
http://dx.doi.org/10.1186/1471-2105-14-101
_version_ 1782475788129402880
author Marczyk, Michal
Jaksik, Roman
Polanski, Andrzej
Polanska, Joanna
author_facet Marczyk, Michal
Jaksik, Roman
Polanski, Andrzej
Polanska, Joanna
author_sort Marczyk, Michal
collection PubMed
description BACKGROUND: DNA microarrays are used for discovery of genes expressed differentially between various biological conditions. In microarray experiments the number of analyzed samples is often much lower than the number of genes (probe sets) which leads to many false discoveries. Multiple testing correction methods control the number of false discoveries but decrease the sensitivity of discovering differentially expressed genes. Concerning this problem, filtering methods for improving the power of detection of differentially expressed genes were proposed in earlier papers. These techniques are two-step procedures, where in the first step some pool of non-informative genes is removed and in the second step only the pool of the retained genes is used for searching for differentially expressed genes. RESULTS: A very important parameter to choose is the proportion between the sizes of the pools of removed and retained genes. A new method, which we propose, allow to determine close to optimal threshold values for sample means and sample variances for gene filtering. The method is adaptive and based on the decomposition of the histogram of gene expression means or variances into mixture of Gaussian components. CONCLUSIONS: By performing analyses of several publicly available datasets and simulated datasets we demonstrate that our adaptive method increases sensitivity of finding differentially expressed genes compared to previous methods of filtering microarray data based on using fixed threshold values.
format Online
Article
Text
id pubmed-3637832
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36378322013-05-01 Adaptive filtering of microarray gene expression data based on Gaussian mixture decomposition Marczyk, Michal Jaksik, Roman Polanski, Andrzej Polanska, Joanna BMC Bioinformatics Methodology Article BACKGROUND: DNA microarrays are used for discovery of genes expressed differentially between various biological conditions. In microarray experiments the number of analyzed samples is often much lower than the number of genes (probe sets) which leads to many false discoveries. Multiple testing correction methods control the number of false discoveries but decrease the sensitivity of discovering differentially expressed genes. Concerning this problem, filtering methods for improving the power of detection of differentially expressed genes were proposed in earlier papers. These techniques are two-step procedures, where in the first step some pool of non-informative genes is removed and in the second step only the pool of the retained genes is used for searching for differentially expressed genes. RESULTS: A very important parameter to choose is the proportion between the sizes of the pools of removed and retained genes. A new method, which we propose, allow to determine close to optimal threshold values for sample means and sample variances for gene filtering. The method is adaptive and based on the decomposition of the histogram of gene expression means or variances into mixture of Gaussian components. CONCLUSIONS: By performing analyses of several publicly available datasets and simulated datasets we demonstrate that our adaptive method increases sensitivity of finding differentially expressed genes compared to previous methods of filtering microarray data based on using fixed threshold values. BioMed Central 2013-03-20 /pmc/articles/PMC3637832/ /pubmed/23510016 http://dx.doi.org/10.1186/1471-2105-14-101 Text en Copyright © 2013 Marczyk et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Marczyk, Michal
Jaksik, Roman
Polanski, Andrzej
Polanska, Joanna
Adaptive filtering of microarray gene expression data based on Gaussian mixture decomposition
title Adaptive filtering of microarray gene expression data based on Gaussian mixture decomposition
title_full Adaptive filtering of microarray gene expression data based on Gaussian mixture decomposition
title_fullStr Adaptive filtering of microarray gene expression data based on Gaussian mixture decomposition
title_full_unstemmed Adaptive filtering of microarray gene expression data based on Gaussian mixture decomposition
title_short Adaptive filtering of microarray gene expression data based on Gaussian mixture decomposition
title_sort adaptive filtering of microarray gene expression data based on gaussian mixture decomposition
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3637832/
https://www.ncbi.nlm.nih.gov/pubmed/23510016
http://dx.doi.org/10.1186/1471-2105-14-101
work_keys_str_mv AT marczykmichal adaptivefilteringofmicroarraygeneexpressiondatabasedongaussianmixturedecomposition
AT jaksikroman adaptivefilteringofmicroarraygeneexpressiondatabasedongaussianmixturedecomposition
AT polanskiandrzej adaptivefilteringofmicroarraygeneexpressiondatabasedongaussianmixturedecomposition
AT polanskajoanna adaptivefilteringofmicroarraygeneexpressiondatabasedongaussianmixturedecomposition