Cargando…

S12. A MACHINE LEARNING FRAMEWORK FOR ROBUST AND RELIABLE PREDICTION OF SHORT- AND LONG-TERM CLINICAL RESPONSE IN INITIALLY ANTIPSYCHOTIC-NAïVE SCHIZOPHRENIA PATIENTS BASED ON MULTIMODAL NEUROPSYCHIATRIC DATA

BACKGROUND: The treatment response of patients with schizophrenia is heterogeneous, and markers of clinical response are missing. Studies using machine learning approaches have provided encouraging results regarding prediction of outcomes, but replicability has been challenging. In the present study...

Descripción completa

Detalles Bibliográficos
Autores principales: Ambrosen, Karen S, Skjerbæk, Martin W, Foldager, Jonathan, Axelsen, Martin C, Bak, Nikolaj, Arvastson, Lars, Christensen, Søren R, Johansen, Louise B, Raghava, Jayachandra M, Oranje, Bob, Rostrup, Egill, Nielsen, Mette Ø, Osler, Merete, Fagerlund, Birgitte, Pantelis, Christos, Kinon, Bruce J, Glenthøj, Birte Y, Hansen, Lars K, Ebdrup, Bjørn H
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7234015/
http://dx.doi.org/10.1093/schbul/sbaa031.078
Descripción
Sumario:BACKGROUND: The treatment response of patients with schizophrenia is heterogeneous, and markers of clinical response are missing. Studies using machine learning approaches have provided encouraging results regarding prediction of outcomes, but replicability has been challenging. In the present study, we present a novel methodological framework for applying machine learning to clinical data. Herein, algorithm selection and other methodological choices were based on model performance on a simulated dataset, to minimize bias and avoid overfitting. We subsequently applied the best performing machine learning algorithm to a rich, multimodal neuropsychiatric dataset. We aimed to 1) classify patients from controls, 2) predict short- and long-term clinical response in a sample of initially antipsychotic-naïve first-episode schizophrenia patients, and 3) validate our methodological framework. METHODS: We included data from 138 antipsychotic-naïve, first-episode schizophrenia patients, who had undergone assessments of psychopathology, cognition, electrophysiology, structural magnetic resonance imaging (MRI). Perinatal data and long-term outcome measures were obtained from Danish registers. Baseline diagnostic classification algorithms also included data from 151 matched healthy controls. Short-term treatment response was defined as change in psychopathology after the initial antipsychotic treatment period. Long-term treatment response (4–16 years) was based on data from Danish registers. The simulated dataset was generated to resemble the real data with respect to dimensionality, multimodality, and pattern of missing data. Noise levels were tunable to enable approximation to the signal-to-noise ratio in the real data. Robustness of the results was ensured by running two parallel, fundamentally different machine learning pipelines, a ‘single algorithm approach’ and an ‘ensemble approach’. Both pipelines included nested cross-validation, missing data imputation, and late integration. RESULTS: We significantly classified patients from controls with a balanced accuracy of 64.2% (95% CI = [51.7, 76.7]) for the single algorithm approach and 63.1% (95% CI = [50.4, 75.8]) for the ensemble approach. Post hoc analyses showed that the classification primarily was driven by the cognitive data. Neither approach predicted short- and long-term clinical response. To validate our methodological framework based on simulated data, we selected the best, a medium, and the most poorly performing algorithm on the simulated data and applied them to the real data. We found that the ranking of the algorithms was kept in the real data. DISCUSSION: Our rigorous modelling framework incorporating simulated data and parallel pipelines discriminated patients from controls, but our extensive, multimodal neuropsychiatric data from antipsychotic-naïve schizophrenia patients were not predictive of the clinical outcome. Nevertheless, our novel approach holds promise as an important step to obtain reliable, unbiased results with modest sample sizes when independent replication samples are not available.