Cargando…

A Novel Test for Independence Derived from an Exact Distribution of ith Nearest Neighbours

Dependence measures and tests for independence have recently attracted a lot of attention, because they are the cornerstone of algorithms for network inference in probabilistic graphical models. Pearson's product moment correlation coefficient is still by far the most widely used statistic yet...

Descripción completa

Detalles Bibliográficos
Autores principales: Dümcke, Sebastian, Mansmann, Ulrich, Tresch, Achim
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4183502/
https://www.ncbi.nlm.nih.gov/pubmed/25275469
http://dx.doi.org/10.1371/journal.pone.0107955
_version_ 1782337700357996544
author Dümcke, Sebastian
Mansmann, Ulrich
Tresch, Achim
author_facet Dümcke, Sebastian
Mansmann, Ulrich
Tresch, Achim
author_sort Dümcke, Sebastian
collection PubMed
description Dependence measures and tests for independence have recently attracted a lot of attention, because they are the cornerstone of algorithms for network inference in probabilistic graphical models. Pearson's product moment correlation coefficient is still by far the most widely used statistic yet it is largely constrained to detecting linear relationships. In this work we provide an exact formula for the [Image: see text]th nearest neighbor distance distribution of rank-transformed data. Based on that, we propose two novel tests for independence. An implementation of these tests, together with a general benchmark framework for independence testing, are freely available as a CRAN software package (http://cran.r-project.org/web/packages/knnIndep). In this paper we have benchmarked Pearson's correlation, Hoeffding's [Image: see text], dcor, Kraskov's estimator for mutual information, maximal information criterion and our two tests. We conclude that no particular method is generally superior to all other methods. However, dcor and Hoeffding's [Image: see text] are the most powerful tests for many different types of dependence.
format Online
Article
Text
id pubmed-4183502
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-41835022014-10-07 A Novel Test for Independence Derived from an Exact Distribution of ith Nearest Neighbours Dümcke, Sebastian Mansmann, Ulrich Tresch, Achim PLoS One Research Article Dependence measures and tests for independence have recently attracted a lot of attention, because they are the cornerstone of algorithms for network inference in probabilistic graphical models. Pearson's product moment correlation coefficient is still by far the most widely used statistic yet it is largely constrained to detecting linear relationships. In this work we provide an exact formula for the [Image: see text]th nearest neighbor distance distribution of rank-transformed data. Based on that, we propose two novel tests for independence. An implementation of these tests, together with a general benchmark framework for independence testing, are freely available as a CRAN software package (http://cran.r-project.org/web/packages/knnIndep). In this paper we have benchmarked Pearson's correlation, Hoeffding's [Image: see text], dcor, Kraskov's estimator for mutual information, maximal information criterion and our two tests. We conclude that no particular method is generally superior to all other methods. However, dcor and Hoeffding's [Image: see text] are the most powerful tests for many different types of dependence. Public Library of Science 2014-10-02 /pmc/articles/PMC4183502/ /pubmed/25275469 http://dx.doi.org/10.1371/journal.pone.0107955 Text en © 2014 Dümcke et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Dümcke, Sebastian
Mansmann, Ulrich
Tresch, Achim
A Novel Test for Independence Derived from an Exact Distribution of ith Nearest Neighbours
title A Novel Test for Independence Derived from an Exact Distribution of ith Nearest Neighbours
title_full A Novel Test for Independence Derived from an Exact Distribution of ith Nearest Neighbours
title_fullStr A Novel Test for Independence Derived from an Exact Distribution of ith Nearest Neighbours
title_full_unstemmed A Novel Test for Independence Derived from an Exact Distribution of ith Nearest Neighbours
title_short A Novel Test for Independence Derived from an Exact Distribution of ith Nearest Neighbours
title_sort novel test for independence derived from an exact distribution of ith nearest neighbours
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4183502/
https://www.ncbi.nlm.nih.gov/pubmed/25275469
http://dx.doi.org/10.1371/journal.pone.0107955
work_keys_str_mv AT dumckesebastian anoveltestforindependencederivedfromanexactdistributionofithnearestneighbours
AT mansmannulrich anoveltestforindependencederivedfromanexactdistributionofithnearestneighbours
AT treschachim anoveltestforindependencederivedfromanexactdistributionofithnearestneighbours
AT dumckesebastian noveltestforindependencederivedfromanexactdistributionofithnearestneighbours
AT mansmannulrich noveltestforindependencederivedfromanexactdistributionofithnearestneighbours
AT treschachim noveltestforindependencederivedfromanexactdistributionofithnearestneighbours