Cargando…

A fast kernel independence test for cluster-correlated data

Cluster-correlated data receives a lot of attention in biomedical and longitudinal studies and it is of interest to assess the generalized dependence between two multivariate variables under the cluster-correlated structure. The Hilbert–Schmidt independence criterion (HSIC) is a powerful kernel-base...

Descripción completa

Detalles Bibliográficos
Autores principales: Song, Hoseung, Liu, Hongjiao, Wu, Michael C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9755291/
https://www.ncbi.nlm.nih.gov/pubmed/36522522
http://dx.doi.org/10.1038/s41598-022-26278-9
_version_ 1784851398424264704
author Song, Hoseung
Liu, Hongjiao
Wu, Michael C.
author_facet Song, Hoseung
Liu, Hongjiao
Wu, Michael C.
author_sort Song, Hoseung
collection PubMed
description Cluster-correlated data receives a lot of attention in biomedical and longitudinal studies and it is of interest to assess the generalized dependence between two multivariate variables under the cluster-correlated structure. The Hilbert–Schmidt independence criterion (HSIC) is a powerful kernel-based test statistic that captures various dependence between two random vectors and can be applied to an arbitrary non-Euclidean domain. However, the existing HSIC is not directly applicable to cluster-correlated data. Therefore, we propose a HSIC-based test of independence for cluster-correlated data. The new test statistic combines kernel information so that the dependence structure in each cluster is fully considered and exhibits good performance under high dimensions. Moreover, a rapid p value approximation makes the new test fast applicable to large datasets. Numerical studies show that the new approach performs well in both synthetic and real world data.
format Online
Article
Text
id pubmed-9755291
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-97552912022-12-17 A fast kernel independence test for cluster-correlated data Song, Hoseung Liu, Hongjiao Wu, Michael C. Sci Rep Article Cluster-correlated data receives a lot of attention in biomedical and longitudinal studies and it is of interest to assess the generalized dependence between two multivariate variables under the cluster-correlated structure. The Hilbert–Schmidt independence criterion (HSIC) is a powerful kernel-based test statistic that captures various dependence between two random vectors and can be applied to an arbitrary non-Euclidean domain. However, the existing HSIC is not directly applicable to cluster-correlated data. Therefore, we propose a HSIC-based test of independence for cluster-correlated data. The new test statistic combines kernel information so that the dependence structure in each cluster is fully considered and exhibits good performance under high dimensions. Moreover, a rapid p value approximation makes the new test fast applicable to large datasets. Numerical studies show that the new approach performs well in both synthetic and real world data. Nature Publishing Group UK 2022-12-15 /pmc/articles/PMC9755291/ /pubmed/36522522 http://dx.doi.org/10.1038/s41598-022-26278-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Song, Hoseung
Liu, Hongjiao
Wu, Michael C.
A fast kernel independence test for cluster-correlated data
title A fast kernel independence test for cluster-correlated data
title_full A fast kernel independence test for cluster-correlated data
title_fullStr A fast kernel independence test for cluster-correlated data
title_full_unstemmed A fast kernel independence test for cluster-correlated data
title_short A fast kernel independence test for cluster-correlated data
title_sort fast kernel independence test for cluster-correlated data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9755291/
https://www.ncbi.nlm.nih.gov/pubmed/36522522
http://dx.doi.org/10.1038/s41598-022-26278-9
work_keys_str_mv AT songhoseung afastkernelindependencetestforclustercorrelateddata
AT liuhongjiao afastkernelindependencetestforclustercorrelateddata
AT wumichaelc afastkernelindependencetestforclustercorrelateddata
AT songhoseung fastkernelindependencetestforclustercorrelateddata
AT liuhongjiao fastkernelindependencetestforclustercorrelateddata
AT wumichaelc fastkernelindependencetestforclustercorrelateddata