Cargando…

Handling hybrid and missing data in constraint-based causal discovery to study the etiology of ADHD

Causal discovery is an increasingly important method for data analysis in the field of medical research. In this paper, we consider two challenges in causal discovery that occur very often when working with medical data: a mixture of discrete and continuous variables and a substantial amount of miss...

Descripción completa

Detalles Bibliográficos
Autores principales: Sokolova, Elena, von Rhein, Daniel, Naaijen, Jilly, Groot, Perry, Claassen, Tom, Buitelaar, Jan, Heskes, Tom
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5479362/
https://www.ncbi.nlm.nih.gov/pubmed/28691055
http://dx.doi.org/10.1007/s41060-016-0034-x
_version_ 1783245118154735616
author Sokolova, Elena
von Rhein, Daniel
Naaijen, Jilly
Groot, Perry
Claassen, Tom
Buitelaar, Jan
Heskes, Tom
author_facet Sokolova, Elena
von Rhein, Daniel
Naaijen, Jilly
Groot, Perry
Claassen, Tom
Buitelaar, Jan
Heskes, Tom
author_sort Sokolova, Elena
collection PubMed
description Causal discovery is an increasingly important method for data analysis in the field of medical research. In this paper, we consider two challenges in causal discovery that occur very often when working with medical data: a mixture of discrete and continuous variables and a substantial amount of missing values. To the best of our knowledge, there are no methods that can handle both challenges at the same time. In this paper, we develop a new method that can handle these challenges based on the assumption that data are missing at random and that continuous variables obey a non-paranormal distribution. We demonstrate the validity of our approach for causal discovery on simulated data as well as on two real-world data sets from a monetary incentive delay task and a reversal learning task. Our results help in the understanding of the etiology of attention-deficit/hyperactivity disorder (ADHD). ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s41060-016-0034-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5479362
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-54793622017-07-06 Handling hybrid and missing data in constraint-based causal discovery to study the etiology of ADHD Sokolova, Elena von Rhein, Daniel Naaijen, Jilly Groot, Perry Claassen, Tom Buitelaar, Jan Heskes, Tom Int J Data Sci Anal Regular Paper Causal discovery is an increasingly important method for data analysis in the field of medical research. In this paper, we consider two challenges in causal discovery that occur very often when working with medical data: a mixture of discrete and continuous variables and a substantial amount of missing values. To the best of our knowledge, there are no methods that can handle both challenges at the same time. In this paper, we develop a new method that can handle these challenges based on the assumption that data are missing at random and that continuous variables obey a non-paranormal distribution. We demonstrate the validity of our approach for causal discovery on simulated data as well as on two real-world data sets from a monetary incentive delay task and a reversal learning task. Our results help in the understanding of the etiology of attention-deficit/hyperactivity disorder (ADHD). ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s41060-016-0034-x) contains supplementary material, which is available to authorized users. Springer International Publishing 2016-12-02 2017 /pmc/articles/PMC5479362/ /pubmed/28691055 http://dx.doi.org/10.1007/s41060-016-0034-x Text en © The Author(s) 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Regular Paper
Sokolova, Elena
von Rhein, Daniel
Naaijen, Jilly
Groot, Perry
Claassen, Tom
Buitelaar, Jan
Heskes, Tom
Handling hybrid and missing data in constraint-based causal discovery to study the etiology of ADHD
title Handling hybrid and missing data in constraint-based causal discovery to study the etiology of ADHD
title_full Handling hybrid and missing data in constraint-based causal discovery to study the etiology of ADHD
title_fullStr Handling hybrid and missing data in constraint-based causal discovery to study the etiology of ADHD
title_full_unstemmed Handling hybrid and missing data in constraint-based causal discovery to study the etiology of ADHD
title_short Handling hybrid and missing data in constraint-based causal discovery to study the etiology of ADHD
title_sort handling hybrid and missing data in constraint-based causal discovery to study the etiology of adhd
topic Regular Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5479362/
https://www.ncbi.nlm.nih.gov/pubmed/28691055
http://dx.doi.org/10.1007/s41060-016-0034-x
work_keys_str_mv AT sokolovaelena handlinghybridandmissingdatainconstraintbasedcausaldiscoverytostudytheetiologyofadhd
AT vonrheindaniel handlinghybridandmissingdatainconstraintbasedcausaldiscoverytostudytheetiologyofadhd
AT naaijenjilly handlinghybridandmissingdatainconstraintbasedcausaldiscoverytostudytheetiologyofadhd
AT grootperry handlinghybridandmissingdatainconstraintbasedcausaldiscoverytostudytheetiologyofadhd
AT claassentom handlinghybridandmissingdatainconstraintbasedcausaldiscoverytostudytheetiologyofadhd
AT buitelaarjan handlinghybridandmissingdatainconstraintbasedcausaldiscoverytostudytheetiologyofadhd
AT heskestom handlinghybridandmissingdatainconstraintbasedcausaldiscoverytostudytheetiologyofadhd