Cargando…

Generalized R-squared for detecting dependence

Detecting dependence between two random variables is a fundamental problem. Although the Pearson correlation coefficient is effective for capturing linear dependence, it can be entirely powerless for detecting nonlinear and/or heteroscedastic patterns. We introduce a new measure, G-squared, to test...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, X., Jiang, B., Liu, J. S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5793683/
https://www.ncbi.nlm.nih.gov/pubmed/29430028
http://dx.doi.org/10.1093/biomet/asw071
_version_ 1783297008352624640
author Wang, X.
Jiang, B.
Liu, J. S.
author_facet Wang, X.
Jiang, B.
Liu, J. S.
author_sort Wang, X.
collection PubMed
description Detecting dependence between two random variables is a fundamental problem. Although the Pearson correlation coefficient is effective for capturing linear dependence, it can be entirely powerless for detecting nonlinear and/or heteroscedastic patterns. We introduce a new measure, G-squared, to test whether two univariate random variables are independent and to measure the strength of their relationship. The G-squared statistic is almost identical to the square of the Pearson correlation coefficient, R-squared, for linear relationships with constant error variance, and has the intuitive meaning of the piecewise R-squared between the variables. It is particularly effective in handling nonlinearity and heteroscedastic errors. We propose two estimators of G-squared and show their consistency. Simulations demonstrate that G-squared estimators are among the most powerful test statistics compared with several state-of-the-art methods.
format Online
Article
Text
id pubmed-5793683
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-57936832018-03-01 Generalized R-squared for detecting dependence Wang, X. Jiang, B. Liu, J. S. Biometrika Articles Detecting dependence between two random variables is a fundamental problem. Although the Pearson correlation coefficient is effective for capturing linear dependence, it can be entirely powerless for detecting nonlinear and/or heteroscedastic patterns. We introduce a new measure, G-squared, to test whether two univariate random variables are independent and to measure the strength of their relationship. The G-squared statistic is almost identical to the square of the Pearson correlation coefficient, R-squared, for linear relationships with constant error variance, and has the intuitive meaning of the piecewise R-squared between the variables. It is particularly effective in handling nonlinearity and heteroscedastic errors. We propose two estimators of G-squared and show their consistency. Simulations demonstrate that G-squared estimators are among the most powerful test statistics compared with several state-of-the-art methods. Oxford University Press 2017-03 2017-02-22 /pmc/articles/PMC5793683/ /pubmed/29430028 http://dx.doi.org/10.1093/biomet/asw071 Text en © 2017 Biometrika Trust
spellingShingle Articles
Wang, X.
Jiang, B.
Liu, J. S.
Generalized R-squared for detecting dependence
title Generalized R-squared for detecting dependence
title_full Generalized R-squared for detecting dependence
title_fullStr Generalized R-squared for detecting dependence
title_full_unstemmed Generalized R-squared for detecting dependence
title_short Generalized R-squared for detecting dependence
title_sort generalized r-squared for detecting dependence
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5793683/
https://www.ncbi.nlm.nih.gov/pubmed/29430028
http://dx.doi.org/10.1093/biomet/asw071
work_keys_str_mv AT wangx generalizedrsquaredfordetectingdependence
AT jiangb generalizedrsquaredfordetectingdependence
AT liujs generalizedrsquaredfordetectingdependence