Cargando…

Robust data‐driven identification of risk factors and their interactions: A simulation and a study of parental and demographic risk factors for schizophrenia

OBJECTIVES: Few interactions between risk factors for schizophrenia have been replicated, but fitting all such interactions is difficult due to high‐dimensionality. Our aims are to examine significant main and interaction effects for schizophrenia and the performance of our approach using simulated...

Descripción completa

Detalles Bibliográficos
Autores principales: Gyllenberg, David, McKeague, Ian W., Sourander, Andre, Brown, Alan S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7723216/
https://www.ncbi.nlm.nih.gov/pubmed/32520440
http://dx.doi.org/10.1002/mpr.1834
_version_ 1783620297260269568
author Gyllenberg, David
McKeague, Ian W.
Sourander, Andre
Brown, Alan S.
author_facet Gyllenberg, David
McKeague, Ian W.
Sourander, Andre
Brown, Alan S.
author_sort Gyllenberg, David
collection PubMed
description OBJECTIVES: Few interactions between risk factors for schizophrenia have been replicated, but fitting all such interactions is difficult due to high‐dimensionality. Our aims are to examine significant main and interaction effects for schizophrenia and the performance of our approach using simulated data. METHODS: We apply the machine learning technique elastic net to a high‐dimensional logistic regression model to produce a sparse set of predictors, and then assess the significance of odds ratios (OR) with Bonferroni‐corrected p‐values and confidence intervals (CI). We introduce a simulation model that resembles a Finnish nested case–control study of schizophrenia which uses national registers to identify cases (n = 1,468) and controls (n = 2,975). The predictors include nine sociodemographic factors and all interactions (31 predictors). RESULTS: In the simulation, interactions with OR = 3 and prevalence = 4% were identified with <5% false positive rate and ≥80% power. None of the studied interactions were significantly associated with schizophrenia, but main effects of parental psychosis (OR = 5.2, CI 2.9–9.7; p < .001), urbanicity (1.3, 1.1–1.7; p = .001), and paternal age ≥35 (1.3, 1.004–1.6; p = .04) were significant. CONCLUSIONS: We have provided an analytic pipeline for data‐driven identification of main and interaction effects in case–control data. We identified highly replicated main effects for schizophrenia, but no interactions.
format Online
Article
Text
id pubmed-7723216
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-77232162020-12-11 Robust data‐driven identification of risk factors and their interactions: A simulation and a study of parental and demographic risk factors for schizophrenia Gyllenberg, David McKeague, Ian W. Sourander, Andre Brown, Alan S. Int J Methods Psychiatr Res Original Articles OBJECTIVES: Few interactions between risk factors for schizophrenia have been replicated, but fitting all such interactions is difficult due to high‐dimensionality. Our aims are to examine significant main and interaction effects for schizophrenia and the performance of our approach using simulated data. METHODS: We apply the machine learning technique elastic net to a high‐dimensional logistic regression model to produce a sparse set of predictors, and then assess the significance of odds ratios (OR) with Bonferroni‐corrected p‐values and confidence intervals (CI). We introduce a simulation model that resembles a Finnish nested case–control study of schizophrenia which uses national registers to identify cases (n = 1,468) and controls (n = 2,975). The predictors include nine sociodemographic factors and all interactions (31 predictors). RESULTS: In the simulation, interactions with OR = 3 and prevalence = 4% were identified with <5% false positive rate and ≥80% power. None of the studied interactions were significantly associated with schizophrenia, but main effects of parental psychosis (OR = 5.2, CI 2.9–9.7; p < .001), urbanicity (1.3, 1.1–1.7; p = .001), and paternal age ≥35 (1.3, 1.004–1.6; p = .04) were significant. CONCLUSIONS: We have provided an analytic pipeline for data‐driven identification of main and interaction effects in case–control data. We identified highly replicated main effects for schizophrenia, but no interactions. John Wiley and Sons Inc. 2020-06-10 /pmc/articles/PMC7723216/ /pubmed/32520440 http://dx.doi.org/10.1002/mpr.1834 Text en © 2020 The Authors. International Journal of Methods in Psychiatric Research Published by John Wiley & Sons Ltd. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Articles
Gyllenberg, David
McKeague, Ian W.
Sourander, Andre
Brown, Alan S.
Robust data‐driven identification of risk factors and their interactions: A simulation and a study of parental and demographic risk factors for schizophrenia
title Robust data‐driven identification of risk factors and their interactions: A simulation and a study of parental and demographic risk factors for schizophrenia
title_full Robust data‐driven identification of risk factors and their interactions: A simulation and a study of parental and demographic risk factors for schizophrenia
title_fullStr Robust data‐driven identification of risk factors and their interactions: A simulation and a study of parental and demographic risk factors for schizophrenia
title_full_unstemmed Robust data‐driven identification of risk factors and their interactions: A simulation and a study of parental and demographic risk factors for schizophrenia
title_short Robust data‐driven identification of risk factors and their interactions: A simulation and a study of parental and demographic risk factors for schizophrenia
title_sort robust data‐driven identification of risk factors and their interactions: a simulation and a study of parental and demographic risk factors for schizophrenia
topic Original Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7723216/
https://www.ncbi.nlm.nih.gov/pubmed/32520440
http://dx.doi.org/10.1002/mpr.1834
work_keys_str_mv AT gyllenbergdavid robustdatadrivenidentificationofriskfactorsandtheirinteractionsasimulationandastudyofparentalanddemographicriskfactorsforschizophrenia
AT mckeagueianw robustdatadrivenidentificationofriskfactorsandtheirinteractionsasimulationandastudyofparentalanddemographicriskfactorsforschizophrenia
AT souranderandre robustdatadrivenidentificationofriskfactorsandtheirinteractionsasimulationandastudyofparentalanddemographicriskfactorsforschizophrenia
AT brownalans robustdatadrivenidentificationofriskfactorsandtheirinteractionsasimulationandastudyofparentalanddemographicriskfactorsforschizophrenia