Cargando…

Efficient test for nonlinear dependence of two continuous variables

BACKGROUND: Testing dependence/correlation of two variables is one of the fundamental tasks in statistics. In this work, we proposed a new way of testing nonlinear dependence between two continuous variables (X and Y). RESULTS: We addressed this research question by using CANOVA (continuous analysis...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yi, Li, Yi, Cao, Hongbao, Xiong, Momiao, Shugart, Yin Yao, Jin, Li
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4539721/
https://www.ncbi.nlm.nih.gov/pubmed/26283601
http://dx.doi.org/10.1186/s12859-015-0697-7
_version_ 1782386151892451328
author Wang, Yi
Li, Yi
Cao, Hongbao
Xiong, Momiao
Shugart, Yin Yao
Jin, Li
author_facet Wang, Yi
Li, Yi
Cao, Hongbao
Xiong, Momiao
Shugart, Yin Yao
Jin, Li
author_sort Wang, Yi
collection PubMed
description BACKGROUND: Testing dependence/correlation of two variables is one of the fundamental tasks in statistics. In this work, we proposed a new way of testing nonlinear dependence between two continuous variables (X and Y). RESULTS: We addressed this research question by using CANOVA (continuous analysis of variance, software available at https://sourceforge.net/projects/canova/). In the CANOVA framework, we first defined a neighborhood for each data point related to its X value, and then calculated the variance of the Y value within the neighborhood. Finally, we performed permutations to evaluate the significance of the observed values within the neighborhood variance. To evaluate the strength of CANOVA compared to six other methods, we performed extensive simulations to explore the relationship between methods and compared the false positive rates and statistical power using both simulated and real datasets (kidney cancer RNA-seq dataset). CONCLUSIONS: We concluded that CANOVA is an efficient method for testing nonlinear correlation with several advantages in real data applications. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0697-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4539721
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45397212015-08-19 Efficient test for nonlinear dependence of two continuous variables Wang, Yi Li, Yi Cao, Hongbao Xiong, Momiao Shugart, Yin Yao Jin, Li BMC Bioinformatics Methodology Article BACKGROUND: Testing dependence/correlation of two variables is one of the fundamental tasks in statistics. In this work, we proposed a new way of testing nonlinear dependence between two continuous variables (X and Y). RESULTS: We addressed this research question by using CANOVA (continuous analysis of variance, software available at https://sourceforge.net/projects/canova/). In the CANOVA framework, we first defined a neighborhood for each data point related to its X value, and then calculated the variance of the Y value within the neighborhood. Finally, we performed permutations to evaluate the significance of the observed values within the neighborhood variance. To evaluate the strength of CANOVA compared to six other methods, we performed extensive simulations to explore the relationship between methods and compared the false positive rates and statistical power using both simulated and real datasets (kidney cancer RNA-seq dataset). CONCLUSIONS: We concluded that CANOVA is an efficient method for testing nonlinear correlation with several advantages in real data applications. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0697-7) contains supplementary material, which is available to authorized users. BioMed Central 2015-08-19 /pmc/articles/PMC4539721/ /pubmed/26283601 http://dx.doi.org/10.1186/s12859-015-0697-7 Text en © Wang et al. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Wang, Yi
Li, Yi
Cao, Hongbao
Xiong, Momiao
Shugart, Yin Yao
Jin, Li
Efficient test for nonlinear dependence of two continuous variables
title Efficient test for nonlinear dependence of two continuous variables
title_full Efficient test for nonlinear dependence of two continuous variables
title_fullStr Efficient test for nonlinear dependence of two continuous variables
title_full_unstemmed Efficient test for nonlinear dependence of two continuous variables
title_short Efficient test for nonlinear dependence of two continuous variables
title_sort efficient test for nonlinear dependence of two continuous variables
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4539721/
https://www.ncbi.nlm.nih.gov/pubmed/26283601
http://dx.doi.org/10.1186/s12859-015-0697-7
work_keys_str_mv AT wangyi efficienttestfornonlineardependenceoftwocontinuousvariables
AT liyi efficienttestfornonlineardependenceoftwocontinuousvariables
AT caohongbao efficienttestfornonlineardependenceoftwocontinuousvariables
AT xiongmomiao efficienttestfornonlineardependenceoftwocontinuousvariables
AT shugartyinyao efficienttestfornonlineardependenceoftwocontinuousvariables
AT jinli efficienttestfornonlineardependenceoftwocontinuousvariables