Cargando…

Machine Learning Approach for Classifying Multiple Sclerosis Courses by Combining Clinical Data with Lesion Loads and Magnetic Resonance Metabolic Features

Purpose: The purpose of this study is classifying multiple sclerosis (MS) patients in the four clinical forms as defined by the McDonald criteria using machine learning algorithms trained on clinical data combined with lesion loads and magnetic resonance metabolic features. Materials and Methods: Ei...

Descripción completa

Detalles Bibliográficos
Autores principales: Ion-Mărgineanu, Adrian, Kocevar, Gabriel, Stamile, Claudio, Sima, Diana M., Durand-Dubief, Françoise, Van Huffel, Sabine, Sappey-Marinier, Dominique
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5504183/
https://www.ncbi.nlm.nih.gov/pubmed/28744195
http://dx.doi.org/10.3389/fnins.2017.00398
_version_ 1783249234968969216
author Ion-Mărgineanu, Adrian
Kocevar, Gabriel
Stamile, Claudio
Sima, Diana M.
Durand-Dubief, Françoise
Van Huffel, Sabine
Sappey-Marinier, Dominique
author_facet Ion-Mărgineanu, Adrian
Kocevar, Gabriel
Stamile, Claudio
Sima, Diana M.
Durand-Dubief, Françoise
Van Huffel, Sabine
Sappey-Marinier, Dominique
author_sort Ion-Mărgineanu, Adrian
collection PubMed
description Purpose: The purpose of this study is classifying multiple sclerosis (MS) patients in the four clinical forms as defined by the McDonald criteria using machine learning algorithms trained on clinical data combined with lesion loads and magnetic resonance metabolic features. Materials and Methods: Eighty-seven MS patients [12 Clinically Isolated Syndrome (CIS), 30 Relapse Remitting (RR), 17 Primary Progressive (PP), and 28 Secondary Progressive (SP)] and 18 healthy controls were included in this study. Longitudinal data available for each MS patient included clinical (e.g., age, disease duration, Expanded Disability Status Scale), conventional magnetic resonance imaging and spectroscopic imaging. We extract N-acetyl-aspartate (NAA), Choline (Cho), and Creatine (Cre) concentrations, and we compute three features for each spectroscopic grid by averaging metabolite ratios (NAA/Cho, NAA/Cre, Cho/Cre) over good quality voxels. We built linear mixed-effects models to test for statistically significant differences between MS forms. We test nine binary classification tasks on clinical data, lesion loads, and metabolic features, using a leave-one-patient-out cross-validation method based on 100 random patient-based bootstrap selections. We compute F1-scores and BAR values after tuning Linear Discriminant Analysis (LDA), Support Vector Machines with gaussian kernel (SVM-rbf), and Random Forests. Results: Statistically significant differences were found between the disease starting points of each MS form using four different response variables: Lesion Load, NAA/Cre, NAA/Cho, and Cho/Cre ratios. Training SVM-rbf on clinical and lesion loads yields F1-scores of 71–72% for CIS vs. RR and CIS vs. RR+SP, respectively. For RR vs. PP we obtained good classification results (maximum F1-score of 85%) after training LDA on clinical and metabolic features, while for RR vs. SP we obtained slightly higher classification results (maximum F1-score of 87%) after training LDA and SVM-rbf on clinical, lesion loads and metabolic features. Conclusions: Our results suggest that metabolic features are better at differentiating between relapsing-remitting and primary progressive forms, while lesion loads are better at differentiating between relapsing-remitting and secondary progressive forms. Therefore, combining clinical data with magnetic resonance lesion loads and metabolic features can improve the discrimination between relapsing-remitting and progressive forms.
format Online
Article
Text
id pubmed-5504183
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-55041832017-07-25 Machine Learning Approach for Classifying Multiple Sclerosis Courses by Combining Clinical Data with Lesion Loads and Magnetic Resonance Metabolic Features Ion-Mărgineanu, Adrian Kocevar, Gabriel Stamile, Claudio Sima, Diana M. Durand-Dubief, Françoise Van Huffel, Sabine Sappey-Marinier, Dominique Front Neurosci Neuroscience Purpose: The purpose of this study is classifying multiple sclerosis (MS) patients in the four clinical forms as defined by the McDonald criteria using machine learning algorithms trained on clinical data combined with lesion loads and magnetic resonance metabolic features. Materials and Methods: Eighty-seven MS patients [12 Clinically Isolated Syndrome (CIS), 30 Relapse Remitting (RR), 17 Primary Progressive (PP), and 28 Secondary Progressive (SP)] and 18 healthy controls were included in this study. Longitudinal data available for each MS patient included clinical (e.g., age, disease duration, Expanded Disability Status Scale), conventional magnetic resonance imaging and spectroscopic imaging. We extract N-acetyl-aspartate (NAA), Choline (Cho), and Creatine (Cre) concentrations, and we compute three features for each spectroscopic grid by averaging metabolite ratios (NAA/Cho, NAA/Cre, Cho/Cre) over good quality voxels. We built linear mixed-effects models to test for statistically significant differences between MS forms. We test nine binary classification tasks on clinical data, lesion loads, and metabolic features, using a leave-one-patient-out cross-validation method based on 100 random patient-based bootstrap selections. We compute F1-scores and BAR values after tuning Linear Discriminant Analysis (LDA), Support Vector Machines with gaussian kernel (SVM-rbf), and Random Forests. Results: Statistically significant differences were found between the disease starting points of each MS form using four different response variables: Lesion Load, NAA/Cre, NAA/Cho, and Cho/Cre ratios. Training SVM-rbf on clinical and lesion loads yields F1-scores of 71–72% for CIS vs. RR and CIS vs. RR+SP, respectively. For RR vs. PP we obtained good classification results (maximum F1-score of 85%) after training LDA on clinical and metabolic features, while for RR vs. SP we obtained slightly higher classification results (maximum F1-score of 87%) after training LDA and SVM-rbf on clinical, lesion loads and metabolic features. Conclusions: Our results suggest that metabolic features are better at differentiating between relapsing-remitting and primary progressive forms, while lesion loads are better at differentiating between relapsing-remitting and secondary progressive forms. Therefore, combining clinical data with magnetic resonance lesion loads and metabolic features can improve the discrimination between relapsing-remitting and progressive forms. Frontiers Media S.A. 2017-07-11 /pmc/articles/PMC5504183/ /pubmed/28744195 http://dx.doi.org/10.3389/fnins.2017.00398 Text en Copyright © 2017 Ion-Mărgineanu, Kocevar, Stamile, Sima, Durand-Dubief, Van Huffel and Sappey-Marinier. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Ion-Mărgineanu, Adrian
Kocevar, Gabriel
Stamile, Claudio
Sima, Diana M.
Durand-Dubief, Françoise
Van Huffel, Sabine
Sappey-Marinier, Dominique
Machine Learning Approach for Classifying Multiple Sclerosis Courses by Combining Clinical Data with Lesion Loads and Magnetic Resonance Metabolic Features
title Machine Learning Approach for Classifying Multiple Sclerosis Courses by Combining Clinical Data with Lesion Loads and Magnetic Resonance Metabolic Features
title_full Machine Learning Approach for Classifying Multiple Sclerosis Courses by Combining Clinical Data with Lesion Loads and Magnetic Resonance Metabolic Features
title_fullStr Machine Learning Approach for Classifying Multiple Sclerosis Courses by Combining Clinical Data with Lesion Loads and Magnetic Resonance Metabolic Features
title_full_unstemmed Machine Learning Approach for Classifying Multiple Sclerosis Courses by Combining Clinical Data with Lesion Loads and Magnetic Resonance Metabolic Features
title_short Machine Learning Approach for Classifying Multiple Sclerosis Courses by Combining Clinical Data with Lesion Loads and Magnetic Resonance Metabolic Features
title_sort machine learning approach for classifying multiple sclerosis courses by combining clinical data with lesion loads and magnetic resonance metabolic features
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5504183/
https://www.ncbi.nlm.nih.gov/pubmed/28744195
http://dx.doi.org/10.3389/fnins.2017.00398
work_keys_str_mv AT ionmargineanuadrian machinelearningapproachforclassifyingmultiplesclerosiscoursesbycombiningclinicaldatawithlesionloadsandmagneticresonancemetabolicfeatures
AT kocevargabriel machinelearningapproachforclassifyingmultiplesclerosiscoursesbycombiningclinicaldatawithlesionloadsandmagneticresonancemetabolicfeatures
AT stamileclaudio machinelearningapproachforclassifyingmultiplesclerosiscoursesbycombiningclinicaldatawithlesionloadsandmagneticresonancemetabolicfeatures
AT simadianam machinelearningapproachforclassifyingmultiplesclerosiscoursesbycombiningclinicaldatawithlesionloadsandmagneticresonancemetabolicfeatures
AT duranddubieffrancoise machinelearningapproachforclassifyingmultiplesclerosiscoursesbycombiningclinicaldatawithlesionloadsandmagneticresonancemetabolicfeatures
AT vanhuffelsabine machinelearningapproachforclassifyingmultiplesclerosiscoursesbycombiningclinicaldatawithlesionloadsandmagneticresonancemetabolicfeatures
AT sappeymarinierdominique machinelearningapproachforclassifyingmultiplesclerosiscoursesbycombiningclinicaldatawithlesionloadsandmagneticresonancemetabolicfeatures