Cargando…

Finding Biomarker Signatures in Pooled Sample Designs: A Simulation Framework for Methodological Comparisons

Detection of discriminating patterns in gene expression data can be accomplished by using various methods of statistical learning. It has been proposed that sample pooling in this context would have negative effects; however, pooling cannot always be avoided. We propose a simulation framework to exp...

Descripción completa

Detalles Bibliográficos
Autores principales: Telaar, Anna, Nürnberg, Gerd, Repsilber, Dirk
Formato: Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2909718/
https://www.ncbi.nlm.nih.gov/pubmed/20671968
http://dx.doi.org/10.1155/2010/318573
_version_ 1782184313395085312
author Telaar, Anna
Nürnberg, Gerd
Repsilber, Dirk
author_facet Telaar, Anna
Nürnberg, Gerd
Repsilber, Dirk
author_sort Telaar, Anna
collection PubMed
description Detection of discriminating patterns in gene expression data can be accomplished by using various methods of statistical learning. It has been proposed that sample pooling in this context would have negative effects; however, pooling cannot always be avoided. We propose a simulation framework to explicitly investigate the parameters of patterns, experimental design, noise, and choice of method in order to find out which effects on classification performance are to be expected. We use a two-group classification task and simulated gene expression data with independent differentially expressed genes as well as bivariate linear patterns and the combination of both. Our results show a clear increase of prediction error with pool size. For pooled training sets powered partial least squares discriminant analysis outperforms discriminance analysis, random forests, and support vector machines with linear or radial kernel for two of three simulated scenarios. The proposed simulation approach can be implemented to systematically investigate a number of additional scenarios of practical interest.
format Text
id pubmed-2909718
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-29097182010-07-29 Finding Biomarker Signatures in Pooled Sample Designs: A Simulation Framework for Methodological Comparisons Telaar, Anna Nürnberg, Gerd Repsilber, Dirk Adv Bioinformatics Research Article Detection of discriminating patterns in gene expression data can be accomplished by using various methods of statistical learning. It has been proposed that sample pooling in this context would have negative effects; however, pooling cannot always be avoided. We propose a simulation framework to explicitly investigate the parameters of patterns, experimental design, noise, and choice of method in order to find out which effects on classification performance are to be expected. We use a two-group classification task and simulated gene expression data with independent differentially expressed genes as well as bivariate linear patterns and the combination of both. Our results show a clear increase of prediction error with pool size. For pooled training sets powered partial least squares discriminant analysis outperforms discriminance analysis, random forests, and support vector machines with linear or radial kernel for two of three simulated scenarios. The proposed simulation approach can be implemented to systematically investigate a number of additional scenarios of practical interest. Hindawi Publishing Corporation 2010 2010-07-04 /pmc/articles/PMC2909718/ /pubmed/20671968 http://dx.doi.org/10.1155/2010/318573 Text en Copyright © 2010 Anna Telaar et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Telaar, Anna
Nürnberg, Gerd
Repsilber, Dirk
Finding Biomarker Signatures in Pooled Sample Designs: A Simulation Framework for Methodological Comparisons
title Finding Biomarker Signatures in Pooled Sample Designs: A Simulation Framework for Methodological Comparisons
title_full Finding Biomarker Signatures in Pooled Sample Designs: A Simulation Framework for Methodological Comparisons
title_fullStr Finding Biomarker Signatures in Pooled Sample Designs: A Simulation Framework for Methodological Comparisons
title_full_unstemmed Finding Biomarker Signatures in Pooled Sample Designs: A Simulation Framework for Methodological Comparisons
title_short Finding Biomarker Signatures in Pooled Sample Designs: A Simulation Framework for Methodological Comparisons
title_sort finding biomarker signatures in pooled sample designs: a simulation framework for methodological comparisons
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2909718/
https://www.ncbi.nlm.nih.gov/pubmed/20671968
http://dx.doi.org/10.1155/2010/318573
work_keys_str_mv AT telaaranna findingbiomarkersignaturesinpooledsampledesignsasimulationframeworkformethodologicalcomparisons
AT nurnberggerd findingbiomarkersignaturesinpooledsampledesignsasimulationframeworkformethodologicalcomparisons
AT repsilberdirk findingbiomarkersignaturesinpooledsampledesignsasimulationframeworkformethodologicalcomparisons