Cargando…

Considering patient clinical history impacts performance of machine learning models in predicting course of multiple sclerosis

Multiple Sclerosis (MS) progresses at an unpredictable rate, but predictions on the disease course in each patient would be extremely useful to tailor therapy to the individual needs. We explore different machine learning (ML) approaches to predict whether a patient will shift from the initial Relap...

Descripción completa

Detalles Bibliográficos
Autores principales: Seccia, Ruggiero, Gammelli, Daniele, Dominici, Fabio, Romano, Silvia, Landi, Anna Chiara, Salvetti, Marco, Tacchella, Andrea, Zaccaria, Andrea, Crisanti, Andrea, Grassi, Francesca, Palagi, Laura
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7083323/
https://www.ncbi.nlm.nih.gov/pubmed/32196512
http://dx.doi.org/10.1371/journal.pone.0230219
_version_ 1783508508283502592
author Seccia, Ruggiero
Gammelli, Daniele
Dominici, Fabio
Romano, Silvia
Landi, Anna Chiara
Salvetti, Marco
Tacchella, Andrea
Zaccaria, Andrea
Crisanti, Andrea
Grassi, Francesca
Palagi, Laura
author_facet Seccia, Ruggiero
Gammelli, Daniele
Dominici, Fabio
Romano, Silvia
Landi, Anna Chiara
Salvetti, Marco
Tacchella, Andrea
Zaccaria, Andrea
Crisanti, Andrea
Grassi, Francesca
Palagi, Laura
author_sort Seccia, Ruggiero
collection PubMed
description Multiple Sclerosis (MS) progresses at an unpredictable rate, but predictions on the disease course in each patient would be extremely useful to tailor therapy to the individual needs. We explore different machine learning (ML) approaches to predict whether a patient will shift from the initial Relapsing-Remitting (RR) to the Secondary Progressive (SP) form of the disease, using only “real world” data available in clinical routine. The clinical records of 1624 outpatients (207 in the SP phase) attending the MS service of Sant’Andrea hospital, Rome, Italy, were used. Predictions at 180, 360 or 720 days from the last visit were obtained considering either the data of the last available visit (Visit-Oriented setting), comparing four classical ML methods (Random Forest, Support Vector Machine, K-Nearest Neighbours and AdaBoost) or the whole clinical history of each patient (History-Oriented setting), using a Recurrent Neural Network model, specifically designed for historical data. Missing values were handled by removing either all clinical records presenting at least one missing parameter (Feature-saving approach) or the 3 clinical parameters which contained missing values (Record-saving approach). The performances of the classifiers were rated using common indicators, such as Recall (or Sensitivity) and Precision (or Positive predictive value). In the visit-oriented setting, the Record-saving approach yielded Recall values from 70% to 100%, but low Precision (5% to 10%), which however increased to 50% when considering only predictions for which the model returned a probability above a given “confidence threshold”. For the History-oriented setting, both indicators increased as prediction time lengthened, reaching values of 67% (Recall) and 42% (Precision) at 720 days. We show how “real world” data can be effectively used to forecast the evolution of MS, leading to high Recall values and propose innovative approaches to improve Precision towards clinically useful values.
format Online
Article
Text
id pubmed-7083323
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-70833232020-03-30 Considering patient clinical history impacts performance of machine learning models in predicting course of multiple sclerosis Seccia, Ruggiero Gammelli, Daniele Dominici, Fabio Romano, Silvia Landi, Anna Chiara Salvetti, Marco Tacchella, Andrea Zaccaria, Andrea Crisanti, Andrea Grassi, Francesca Palagi, Laura PLoS One Research Article Multiple Sclerosis (MS) progresses at an unpredictable rate, but predictions on the disease course in each patient would be extremely useful to tailor therapy to the individual needs. We explore different machine learning (ML) approaches to predict whether a patient will shift from the initial Relapsing-Remitting (RR) to the Secondary Progressive (SP) form of the disease, using only “real world” data available in clinical routine. The clinical records of 1624 outpatients (207 in the SP phase) attending the MS service of Sant’Andrea hospital, Rome, Italy, were used. Predictions at 180, 360 or 720 days from the last visit were obtained considering either the data of the last available visit (Visit-Oriented setting), comparing four classical ML methods (Random Forest, Support Vector Machine, K-Nearest Neighbours and AdaBoost) or the whole clinical history of each patient (History-Oriented setting), using a Recurrent Neural Network model, specifically designed for historical data. Missing values were handled by removing either all clinical records presenting at least one missing parameter (Feature-saving approach) or the 3 clinical parameters which contained missing values (Record-saving approach). The performances of the classifiers were rated using common indicators, such as Recall (or Sensitivity) and Precision (or Positive predictive value). In the visit-oriented setting, the Record-saving approach yielded Recall values from 70% to 100%, but low Precision (5% to 10%), which however increased to 50% when considering only predictions for which the model returned a probability above a given “confidence threshold”. For the History-oriented setting, both indicators increased as prediction time lengthened, reaching values of 67% (Recall) and 42% (Precision) at 720 days. We show how “real world” data can be effectively used to forecast the evolution of MS, leading to high Recall values and propose innovative approaches to improve Precision towards clinically useful values. Public Library of Science 2020-03-20 /pmc/articles/PMC7083323/ /pubmed/32196512 http://dx.doi.org/10.1371/journal.pone.0230219 Text en © 2020 Seccia et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Seccia, Ruggiero
Gammelli, Daniele
Dominici, Fabio
Romano, Silvia
Landi, Anna Chiara
Salvetti, Marco
Tacchella, Andrea
Zaccaria, Andrea
Crisanti, Andrea
Grassi, Francesca
Palagi, Laura
Considering patient clinical history impacts performance of machine learning models in predicting course of multiple sclerosis
title Considering patient clinical history impacts performance of machine learning models in predicting course of multiple sclerosis
title_full Considering patient clinical history impacts performance of machine learning models in predicting course of multiple sclerosis
title_fullStr Considering patient clinical history impacts performance of machine learning models in predicting course of multiple sclerosis
title_full_unstemmed Considering patient clinical history impacts performance of machine learning models in predicting course of multiple sclerosis
title_short Considering patient clinical history impacts performance of machine learning models in predicting course of multiple sclerosis
title_sort considering patient clinical history impacts performance of machine learning models in predicting course of multiple sclerosis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7083323/
https://www.ncbi.nlm.nih.gov/pubmed/32196512
http://dx.doi.org/10.1371/journal.pone.0230219
work_keys_str_mv AT secciaruggiero consideringpatientclinicalhistoryimpactsperformanceofmachinelearningmodelsinpredictingcourseofmultiplesclerosis
AT gammellidaniele consideringpatientclinicalhistoryimpactsperformanceofmachinelearningmodelsinpredictingcourseofmultiplesclerosis
AT dominicifabio consideringpatientclinicalhistoryimpactsperformanceofmachinelearningmodelsinpredictingcourseofmultiplesclerosis
AT romanosilvia consideringpatientclinicalhistoryimpactsperformanceofmachinelearningmodelsinpredictingcourseofmultiplesclerosis
AT landiannachiara consideringpatientclinicalhistoryimpactsperformanceofmachinelearningmodelsinpredictingcourseofmultiplesclerosis
AT salvettimarco consideringpatientclinicalhistoryimpactsperformanceofmachinelearningmodelsinpredictingcourseofmultiplesclerosis
AT tacchellaandrea consideringpatientclinicalhistoryimpactsperformanceofmachinelearningmodelsinpredictingcourseofmultiplesclerosis
AT zaccariaandrea consideringpatientclinicalhistoryimpactsperformanceofmachinelearningmodelsinpredictingcourseofmultiplesclerosis
AT crisantiandrea consideringpatientclinicalhistoryimpactsperformanceofmachinelearningmodelsinpredictingcourseofmultiplesclerosis
AT grassifrancesca consideringpatientclinicalhistoryimpactsperformanceofmachinelearningmodelsinpredictingcourseofmultiplesclerosis
AT palagilaura consideringpatientclinicalhistoryimpactsperformanceofmachinelearningmodelsinpredictingcourseofmultiplesclerosis