Cargando…

Discovering General Multidimensional Associations

When two variables are related by a known function, the coefficient of determination (denoted R(2)) measures the proportion of the total variance in the observations explained by that function. For linear relationships, this is equal to the square of the correlation coefficient, ρ. When the parametr...

Descripción completa

Detalles Bibliográficos
Autores principales: Murrell, Ben, Murrell, Daniel, Murrell, Hugh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4798755/
https://www.ncbi.nlm.nih.gov/pubmed/26991498
http://dx.doi.org/10.1371/journal.pone.0151551
_version_ 1782422215798554624
author Murrell, Ben
Murrell, Daniel
Murrell, Hugh
author_facet Murrell, Ben
Murrell, Daniel
Murrell, Hugh
author_sort Murrell, Ben
collection PubMed
description When two variables are related by a known function, the coefficient of determination (denoted R(2)) measures the proportion of the total variance in the observations explained by that function. For linear relationships, this is equal to the square of the correlation coefficient, ρ. When the parametric form of the relationship is unknown, however, it is unclear how to estimate the proportion of explained variance equitably—assigning similar values to equally noisy relationships. Here we demonstrate how to directly estimate a generalised R(2) when the form of the relationship is unknown, and we consider the performance of the Maximal Information Coefficient (MIC)—a recently proposed information theoretic measure of dependence. We show that our approach behaves equitably, has more power than MIC to detect association between variables, and converges faster with increasing sample size. Most importantly, our approach generalises to higher dimensions, estimating the strength of multivariate relationships (Y against A, B, …) as well as measuring association while controlling for covariates (Y against X controlling for C). An R package named matie (“Measuring Association and Testing Independence Efficiently”) is available (http://cran.r-project.org/web/packages/matie/).
format Online
Article
Text
id pubmed-4798755
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-47987552016-03-23 Discovering General Multidimensional Associations Murrell, Ben Murrell, Daniel Murrell, Hugh PLoS One Research Article When two variables are related by a known function, the coefficient of determination (denoted R(2)) measures the proportion of the total variance in the observations explained by that function. For linear relationships, this is equal to the square of the correlation coefficient, ρ. When the parametric form of the relationship is unknown, however, it is unclear how to estimate the proportion of explained variance equitably—assigning similar values to equally noisy relationships. Here we demonstrate how to directly estimate a generalised R(2) when the form of the relationship is unknown, and we consider the performance of the Maximal Information Coefficient (MIC)—a recently proposed information theoretic measure of dependence. We show that our approach behaves equitably, has more power than MIC to detect association between variables, and converges faster with increasing sample size. Most importantly, our approach generalises to higher dimensions, estimating the strength of multivariate relationships (Y against A, B, …) as well as measuring association while controlling for covariates (Y against X controlling for C). An R package named matie (“Measuring Association and Testing Independence Efficiently”) is available (http://cran.r-project.org/web/packages/matie/). Public Library of Science 2016-03-18 /pmc/articles/PMC4798755/ /pubmed/26991498 http://dx.doi.org/10.1371/journal.pone.0151551 Text en © 2016 Murrell et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Murrell, Ben
Murrell, Daniel
Murrell, Hugh
Discovering General Multidimensional Associations
title Discovering General Multidimensional Associations
title_full Discovering General Multidimensional Associations
title_fullStr Discovering General Multidimensional Associations
title_full_unstemmed Discovering General Multidimensional Associations
title_short Discovering General Multidimensional Associations
title_sort discovering general multidimensional associations
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4798755/
https://www.ncbi.nlm.nih.gov/pubmed/26991498
http://dx.doi.org/10.1371/journal.pone.0151551
work_keys_str_mv AT murrellben discoveringgeneralmultidimensionalassociations
AT murrelldaniel discoveringgeneralmultidimensionalassociations
AT murrellhugh discoveringgeneralmultidimensionalassociations