Cargando…

Effect of Normalization on Statistical and Biological Interpretation of Gene Expression Profiles

An under-appreciated aspect of the genetic analysis of gene expression is the impact of post-probe level normalization on biological inference. Here we contrast nine different methods for normalization of an Illumina bead-array gene expression profiling dataset consisting of peripheral blood samples...

Descripción completa

Detalles Bibliográficos
Autores principales: Qin, Shaopu, Kim, Jinhee, Arafat, Dalia, Gibson, Greg
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3668151/
https://www.ncbi.nlm.nih.gov/pubmed/23755061
http://dx.doi.org/10.3389/fgene.2012.00160
_version_ 1782271578276364288
author Qin, Shaopu
Kim, Jinhee
Arafat, Dalia
Gibson, Greg
author_facet Qin, Shaopu
Kim, Jinhee
Arafat, Dalia
Gibson, Greg
author_sort Qin, Shaopu
collection PubMed
description An under-appreciated aspect of the genetic analysis of gene expression is the impact of post-probe level normalization on biological inference. Here we contrast nine different methods for normalization of an Illumina bead-array gene expression profiling dataset consisting of peripheral blood samples from 189 individual participants in the Center for Health Discovery and Well Being study in Atlanta, quantifying differences in the inference of global variance components and covariance of gene expression, as well as the detection of variants that affect transcript abundance (eSNPs). The normalization strategies, all relative to raw log2 measures, include simple mean centering, two modes of transcript-level linear adjustment for technical factors, and for differential immune cell counts, variance normalization by interquartile range and by quantile, fitting the first 16 Principal Components, and supervised normalization using the SNM procedure with adjustment for cell counts. Robustness of genetic associations as a consequence of Pearson and Spearman rank correlation is also reported for each method, and it is shown that the normalization strategy has a far greater impact than correlation method. We describe similarities among methods, discuss the impact on biological interpretation, and make recommendations regarding appropriate strategies.
format Online
Article
Text
id pubmed-3668151
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-36681512013-06-10 Effect of Normalization on Statistical and Biological Interpretation of Gene Expression Profiles Qin, Shaopu Kim, Jinhee Arafat, Dalia Gibson, Greg Front Genet Genetics An under-appreciated aspect of the genetic analysis of gene expression is the impact of post-probe level normalization on biological inference. Here we contrast nine different methods for normalization of an Illumina bead-array gene expression profiling dataset consisting of peripheral blood samples from 189 individual participants in the Center for Health Discovery and Well Being study in Atlanta, quantifying differences in the inference of global variance components and covariance of gene expression, as well as the detection of variants that affect transcript abundance (eSNPs). The normalization strategies, all relative to raw log2 measures, include simple mean centering, two modes of transcript-level linear adjustment for technical factors, and for differential immune cell counts, variance normalization by interquartile range and by quantile, fitting the first 16 Principal Components, and supervised normalization using the SNM procedure with adjustment for cell counts. Robustness of genetic associations as a consequence of Pearson and Spearman rank correlation is also reported for each method, and it is shown that the normalization strategy has a far greater impact than correlation method. We describe similarities among methods, discuss the impact on biological interpretation, and make recommendations regarding appropriate strategies. Frontiers Media S.A. 2013-05-31 /pmc/articles/PMC3668151/ /pubmed/23755061 http://dx.doi.org/10.3389/fgene.2012.00160 Text en Copyright © 2013 Qin, Kim, Arafat and Gibson. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and subject to any copyright notices concerning any third-party graphics etc.
spellingShingle Genetics
Qin, Shaopu
Kim, Jinhee
Arafat, Dalia
Gibson, Greg
Effect of Normalization on Statistical and Biological Interpretation of Gene Expression Profiles
title Effect of Normalization on Statistical and Biological Interpretation of Gene Expression Profiles
title_full Effect of Normalization on Statistical and Biological Interpretation of Gene Expression Profiles
title_fullStr Effect of Normalization on Statistical and Biological Interpretation of Gene Expression Profiles
title_full_unstemmed Effect of Normalization on Statistical and Biological Interpretation of Gene Expression Profiles
title_short Effect of Normalization on Statistical and Biological Interpretation of Gene Expression Profiles
title_sort effect of normalization on statistical and biological interpretation of gene expression profiles
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3668151/
https://www.ncbi.nlm.nih.gov/pubmed/23755061
http://dx.doi.org/10.3389/fgene.2012.00160
work_keys_str_mv AT qinshaopu effectofnormalizationonstatisticalandbiologicalinterpretationofgeneexpressionprofiles
AT kimjinhee effectofnormalizationonstatisticalandbiologicalinterpretationofgeneexpressionprofiles
AT arafatdalia effectofnormalizationonstatisticalandbiologicalinterpretationofgeneexpressionprofiles
AT gibsongreg effectofnormalizationonstatisticalandbiologicalinterpretationofgeneexpressionprofiles