Cargando…

Handling hybrid and missing data in constraint-based causal discovery to study the etiology of ADHD

Causal discovery is an increasingly important method for data analysis in the field of medical research. In this paper, we consider two challenges in causal discovery that occur very often when working with medical data: a mixture of discrete and continuous variables and a substantial amount of miss...

Descripción completa

Detalles Bibliográficos
Autores principales: Sokolova, Elena, von Rhein, Daniel, Naaijen, Jilly, Groot, Perry, Claassen, Tom, Buitelaar, Jan, Heskes, Tom
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5479362/
https://www.ncbi.nlm.nih.gov/pubmed/28691055
http://dx.doi.org/10.1007/s41060-016-0034-x
Descripción
Sumario:Causal discovery is an increasingly important method for data analysis in the field of medical research. In this paper, we consider two challenges in causal discovery that occur very often when working with medical data: a mixture of discrete and continuous variables and a substantial amount of missing values. To the best of our knowledge, there are no methods that can handle both challenges at the same time. In this paper, we develop a new method that can handle these challenges based on the assumption that data are missing at random and that continuous variables obey a non-paranormal distribution. We demonstrate the validity of our approach for causal discovery on simulated data as well as on two real-world data sets from a monetary incentive delay task and a reversal learning task. Our results help in the understanding of the etiology of attention-deficit/hyperactivity disorder (ADHD). ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s41060-016-0034-x) contains supplementary material, which is available to authorized users.