Cargando…

Detecting Trivariate Associations in High-Dimensional Datasets

Detecting correlations in high-dimensional datasets plays an important role in data mining and knowledge discovery. While recent works achieve promising results, detecting multivariable correlations especially trivariate associations still remains a challenge. For example, maximal information coeffi...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Chuanlu, Wang, Shuliang, Yuan, Hanning, Dang, Yingxu, Liu, Xiaojia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9003031/
https://www.ncbi.nlm.nih.gov/pubmed/35408419
http://dx.doi.org/10.3390/s22072806
_version_ 1784686034075779072
author Liu, Chuanlu
Wang, Shuliang
Yuan, Hanning
Dang, Yingxu
Liu, Xiaojia
author_facet Liu, Chuanlu
Wang, Shuliang
Yuan, Hanning
Dang, Yingxu
Liu, Xiaojia
author_sort Liu, Chuanlu
collection PubMed
description Detecting correlations in high-dimensional datasets plays an important role in data mining and knowledge discovery. While recent works achieve promising results, detecting multivariable correlations especially trivariate associations still remains a challenge. For example, maximal information coefficient (MIC) introduces generality and equitability to detect bivariate correlations but fails to detect multivariable correlation. To solve the problem mentioned above, we proposed quadratic optimized trivariate information coefficient (QOTIC). Specifically, QOTIC equitably measures dependence among three variables. Our contributions are three-fold: (1) we present a novel quadratic optimization procedure to approach the correlation with high accuracy; (2) QOTIC exceeds existing methods in generality and equitability as QOTIC has general test functions and is applicable in detecting multivariable correlation in datasets of various sample sizes and noise levels; (3) QOTIC achieved both higher accuracy and higher time-efficiency than previous methods. Extensive experiments demonstrate the excellent performance of QOTIC.
format Online
Article
Text
id pubmed-9003031
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-90030312022-04-13 Detecting Trivariate Associations in High-Dimensional Datasets Liu, Chuanlu Wang, Shuliang Yuan, Hanning Dang, Yingxu Liu, Xiaojia Sensors (Basel) Article Detecting correlations in high-dimensional datasets plays an important role in data mining and knowledge discovery. While recent works achieve promising results, detecting multivariable correlations especially trivariate associations still remains a challenge. For example, maximal information coefficient (MIC) introduces generality and equitability to detect bivariate correlations but fails to detect multivariable correlation. To solve the problem mentioned above, we proposed quadratic optimized trivariate information coefficient (QOTIC). Specifically, QOTIC equitably measures dependence among three variables. Our contributions are three-fold: (1) we present a novel quadratic optimization procedure to approach the correlation with high accuracy; (2) QOTIC exceeds existing methods in generality and equitability as QOTIC has general test functions and is applicable in detecting multivariable correlation in datasets of various sample sizes and noise levels; (3) QOTIC achieved both higher accuracy and higher time-efficiency than previous methods. Extensive experiments demonstrate the excellent performance of QOTIC. MDPI 2022-04-06 /pmc/articles/PMC9003031/ /pubmed/35408419 http://dx.doi.org/10.3390/s22072806 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Liu, Chuanlu
Wang, Shuliang
Yuan, Hanning
Dang, Yingxu
Liu, Xiaojia
Detecting Trivariate Associations in High-Dimensional Datasets
title Detecting Trivariate Associations in High-Dimensional Datasets
title_full Detecting Trivariate Associations in High-Dimensional Datasets
title_fullStr Detecting Trivariate Associations in High-Dimensional Datasets
title_full_unstemmed Detecting Trivariate Associations in High-Dimensional Datasets
title_short Detecting Trivariate Associations in High-Dimensional Datasets
title_sort detecting trivariate associations in high-dimensional datasets
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9003031/
https://www.ncbi.nlm.nih.gov/pubmed/35408419
http://dx.doi.org/10.3390/s22072806
work_keys_str_mv AT liuchuanlu detectingtrivariateassociationsinhighdimensionaldatasets
AT wangshuliang detectingtrivariateassociationsinhighdimensionaldatasets
AT yuanhanning detectingtrivariateassociationsinhighdimensionaldatasets
AT dangyingxu detectingtrivariateassociationsinhighdimensionaldatasets
AT liuxiaojia detectingtrivariateassociationsinhighdimensionaldatasets