Cargando…

A Case Study on Choosing Normalization Methods and Test Statistics for Two-Channel Microarray Data

DNA microarray analysis is a biological technology which permits the whole genome to be monitored simultaneously on a single slide. Microarray technology not only opens an exciting research area for biologists, but also provides significant new challenges to statisticians. Two very common questions...

Descripción completa

Detalles Bibliográficos
Autores principales: Xie, Yang, Jeong, Kyeong S., Pan, Wei, Khodursky, Arkady, Carlin, Bradley P.
Formato: Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2004
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447464/
https://www.ncbi.nlm.nih.gov/pubmed/18629172
http://dx.doi.org/10.1002/cfg.416
_version_ 1782156945651662848
author Xie, Yang
Jeong, Kyeong S.
Pan, Wei
Khodursky, Arkady
Carlin, Bradley P.
author_facet Xie, Yang
Jeong, Kyeong S.
Pan, Wei
Khodursky, Arkady
Carlin, Bradley P.
author_sort Xie, Yang
collection PubMed
description DNA microarray analysis is a biological technology which permits the whole genome to be monitored simultaneously on a single slide. Microarray technology not only opens an exciting research area for biologists, but also provides significant new challenges to statisticians. Two very common questions in the analysis of microarray data are, first, should we normalize arrays to remove potential systematic biases, and if so, what normalization method should we use? Second, how should we then implement tests of statistical significance? Straightforward and uniform answers to these questions remain elusive. In this paper, we use a real data example to illustrate a practical approach to addressing these questions. Our data is taken from a DNA–protein binding microarray experiment aimed at furthering our understanding of transcription regulation mechanisms, one of the most important issues in biology. For the purpose of preprocessing data, we suggest looking at descriptive plots first to decide whether we need preliminary normalization and, if so, how this should be accomplished. For subsequent comparative inference, we recommend use of an empirical Bayes method (the B statistic), since it performs much better than traditional methods, such as the sample mean (M statistic) and Student's t statistic, and it is also relatively easy to compute and explain compared to the others. The false discovery rate (FDR) is used to evaluate the different methods, and our comparative results lend support to our above suggestions.
format Text
id pubmed-2447464
institution National Center for Biotechnology Information
language English
publishDate 2004
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-24474642008-07-14 A Case Study on Choosing Normalization Methods and Test Statistics for Two-Channel Microarray Data Xie, Yang Jeong, Kyeong S. Pan, Wei Khodursky, Arkady Carlin, Bradley P. Comp Funct Genomics Research Article DNA microarray analysis is a biological technology which permits the whole genome to be monitored simultaneously on a single slide. Microarray technology not only opens an exciting research area for biologists, but also provides significant new challenges to statisticians. Two very common questions in the analysis of microarray data are, first, should we normalize arrays to remove potential systematic biases, and if so, what normalization method should we use? Second, how should we then implement tests of statistical significance? Straightforward and uniform answers to these questions remain elusive. In this paper, we use a real data example to illustrate a practical approach to addressing these questions. Our data is taken from a DNA–protein binding microarray experiment aimed at furthering our understanding of transcription regulation mechanisms, one of the most important issues in biology. For the purpose of preprocessing data, we suggest looking at descriptive plots first to decide whether we need preliminary normalization and, if so, how this should be accomplished. For subsequent comparative inference, we recommend use of an empirical Bayes method (the B statistic), since it performs much better than traditional methods, such as the sample mean (M statistic) and Student's t statistic, and it is also relatively easy to compute and explain compared to the others. The false discovery rate (FDR) is used to evaluate the different methods, and our comparative results lend support to our above suggestions. Hindawi Publishing Corporation 2004-07 /pmc/articles/PMC2447464/ /pubmed/18629172 http://dx.doi.org/10.1002/cfg.416 Text en Copyright © 2004 Hindawi Publishing Corporation. http://creativecommons.org/licenses/by/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Xie, Yang
Jeong, Kyeong S.
Pan, Wei
Khodursky, Arkady
Carlin, Bradley P.
A Case Study on Choosing Normalization Methods and Test Statistics for Two-Channel Microarray Data
title A Case Study on Choosing Normalization Methods and Test Statistics for Two-Channel Microarray Data
title_full A Case Study on Choosing Normalization Methods and Test Statistics for Two-Channel Microarray Data
title_fullStr A Case Study on Choosing Normalization Methods and Test Statistics for Two-Channel Microarray Data
title_full_unstemmed A Case Study on Choosing Normalization Methods and Test Statistics for Two-Channel Microarray Data
title_short A Case Study on Choosing Normalization Methods and Test Statistics for Two-Channel Microarray Data
title_sort case study on choosing normalization methods and test statistics for two-channel microarray data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447464/
https://www.ncbi.nlm.nih.gov/pubmed/18629172
http://dx.doi.org/10.1002/cfg.416
work_keys_str_mv AT xieyang acasestudyonchoosingnormalizationmethodsandteststatisticsfortwochannelmicroarraydata
AT jeongkyeongs acasestudyonchoosingnormalizationmethodsandteststatisticsfortwochannelmicroarraydata
AT panwei acasestudyonchoosingnormalizationmethodsandteststatisticsfortwochannelmicroarraydata
AT khodurskyarkady acasestudyonchoosingnormalizationmethodsandteststatisticsfortwochannelmicroarraydata
AT carlinbradleyp acasestudyonchoosingnormalizationmethodsandteststatisticsfortwochannelmicroarraydata
AT xieyang casestudyonchoosingnormalizationmethodsandteststatisticsfortwochannelmicroarraydata
AT jeongkyeongs casestudyonchoosingnormalizationmethodsandteststatisticsfortwochannelmicroarraydata
AT panwei casestudyonchoosingnormalizationmethodsandteststatisticsfortwochannelmicroarraydata
AT khodurskyarkady casestudyonchoosingnormalizationmethodsandteststatisticsfortwochannelmicroarraydata
AT carlinbradleyp casestudyonchoosingnormalizationmethodsandteststatisticsfortwochannelmicroarraydata