Cargando…

How should we measure proportionality on relative gene expression data?

Correlation is ubiquitously used in gene expression analysis although its validity as an objective criterion is often questionable. If no normalization reflecting the original mRNA counts in the cells is available, correlation between genes becomes spurious. Yet the need for normalization can be byp...

Descripción completa

Detalles Bibliográficos
Autores principales: Erb, Ionas, Notredame, Cedric
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4870310/
https://www.ncbi.nlm.nih.gov/pubmed/26762323
http://dx.doi.org/10.1007/s12064-015-0220-8
_version_ 1782432416562937856
author Erb, Ionas
Notredame, Cedric
author_facet Erb, Ionas
Notredame, Cedric
author_sort Erb, Ionas
collection PubMed
description Correlation is ubiquitously used in gene expression analysis although its validity as an objective criterion is often questionable. If no normalization reflecting the original mRNA counts in the cells is available, correlation between genes becomes spurious. Yet the need for normalization can be bypassed using a relative analysis approach called log-ratio analysis. This approach can be used to identify proportional gene pairs, i.e. a subset of pairs whose correlation can be inferred correctly from unnormalized data due to their vanishing log-ratio variance. To interpret the size of non-zero log-ratio variances, a proposal for a scaling with respect to the variance of one member of the gene pair was recently made by Lovell et al. Here we derive analytically how spurious proportionality is introduced when using a scaling. We base our analysis on a symmetric proportionality coefficient (briefly mentioned in Lovell et al.) that has a number of advantages over their statistic. We show in detail how the choice of reference needed for the scaling determines which gene pairs are identified as proportional. We demonstrate that using an unchanged gene as a reference has huge advantages in terms of sensitivity. We also explore the link between proportionality and partial correlation and derive expressions for a partial proportionality coefficient. A brief data-analysis part puts the discussed concepts into practice.
format Online
Article
Text
id pubmed-4870310
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-48703102016-06-21 How should we measure proportionality on relative gene expression data? Erb, Ionas Notredame, Cedric Theory Biosci Original Paper Correlation is ubiquitously used in gene expression analysis although its validity as an objective criterion is often questionable. If no normalization reflecting the original mRNA counts in the cells is available, correlation between genes becomes spurious. Yet the need for normalization can be bypassed using a relative analysis approach called log-ratio analysis. This approach can be used to identify proportional gene pairs, i.e. a subset of pairs whose correlation can be inferred correctly from unnormalized data due to their vanishing log-ratio variance. To interpret the size of non-zero log-ratio variances, a proposal for a scaling with respect to the variance of one member of the gene pair was recently made by Lovell et al. Here we derive analytically how spurious proportionality is introduced when using a scaling. We base our analysis on a symmetric proportionality coefficient (briefly mentioned in Lovell et al.) that has a number of advantages over their statistic. We show in detail how the choice of reference needed for the scaling determines which gene pairs are identified as proportional. We demonstrate that using an unchanged gene as a reference has huge advantages in terms of sensitivity. We also explore the link between proportionality and partial correlation and derive expressions for a partial proportionality coefficient. A brief data-analysis part puts the discussed concepts into practice. Springer Berlin Heidelberg 2016-01-13 2016 /pmc/articles/PMC4870310/ /pubmed/26762323 http://dx.doi.org/10.1007/s12064-015-0220-8 Text en © The Author(s) 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Original Paper
Erb, Ionas
Notredame, Cedric
How should we measure proportionality on relative gene expression data?
title How should we measure proportionality on relative gene expression data?
title_full How should we measure proportionality on relative gene expression data?
title_fullStr How should we measure proportionality on relative gene expression data?
title_full_unstemmed How should we measure proportionality on relative gene expression data?
title_short How should we measure proportionality on relative gene expression data?
title_sort how should we measure proportionality on relative gene expression data?
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4870310/
https://www.ncbi.nlm.nih.gov/pubmed/26762323
http://dx.doi.org/10.1007/s12064-015-0220-8
work_keys_str_mv AT erbionas howshouldwemeasureproportionalityonrelativegeneexpressiondata
AT notredamecedric howshouldwemeasureproportionalityonrelativegeneexpressiondata