Cargando…
Machine Learning Approach for Classifying Multiple Sclerosis Courses by Combining Clinical Data with Lesion Loads and Magnetic Resonance Metabolic Features
Purpose: The purpose of this study is classifying multiple sclerosis (MS) patients in the four clinical forms as defined by the McDonald criteria using machine learning algorithms trained on clinical data combined with lesion loads and magnetic resonance metabolic features. Materials and Methods: Ei...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5504183/ https://www.ncbi.nlm.nih.gov/pubmed/28744195 http://dx.doi.org/10.3389/fnins.2017.00398 |
_version_ | 1783249234968969216 |
---|---|
author | Ion-Mărgineanu, Adrian Kocevar, Gabriel Stamile, Claudio Sima, Diana M. Durand-Dubief, Françoise Van Huffel, Sabine Sappey-Marinier, Dominique |
author_facet | Ion-Mărgineanu, Adrian Kocevar, Gabriel Stamile, Claudio Sima, Diana M. Durand-Dubief, Françoise Van Huffel, Sabine Sappey-Marinier, Dominique |
author_sort | Ion-Mărgineanu, Adrian |
collection | PubMed |
description | Purpose: The purpose of this study is classifying multiple sclerosis (MS) patients in the four clinical forms as defined by the McDonald criteria using machine learning algorithms trained on clinical data combined with lesion loads and magnetic resonance metabolic features. Materials and Methods: Eighty-seven MS patients [12 Clinically Isolated Syndrome (CIS), 30 Relapse Remitting (RR), 17 Primary Progressive (PP), and 28 Secondary Progressive (SP)] and 18 healthy controls were included in this study. Longitudinal data available for each MS patient included clinical (e.g., age, disease duration, Expanded Disability Status Scale), conventional magnetic resonance imaging and spectroscopic imaging. We extract N-acetyl-aspartate (NAA), Choline (Cho), and Creatine (Cre) concentrations, and we compute three features for each spectroscopic grid by averaging metabolite ratios (NAA/Cho, NAA/Cre, Cho/Cre) over good quality voxels. We built linear mixed-effects models to test for statistically significant differences between MS forms. We test nine binary classification tasks on clinical data, lesion loads, and metabolic features, using a leave-one-patient-out cross-validation method based on 100 random patient-based bootstrap selections. We compute F1-scores and BAR values after tuning Linear Discriminant Analysis (LDA), Support Vector Machines with gaussian kernel (SVM-rbf), and Random Forests. Results: Statistically significant differences were found between the disease starting points of each MS form using four different response variables: Lesion Load, NAA/Cre, NAA/Cho, and Cho/Cre ratios. Training SVM-rbf on clinical and lesion loads yields F1-scores of 71–72% for CIS vs. RR and CIS vs. RR+SP, respectively. For RR vs. PP we obtained good classification results (maximum F1-score of 85%) after training LDA on clinical and metabolic features, while for RR vs. SP we obtained slightly higher classification results (maximum F1-score of 87%) after training LDA and SVM-rbf on clinical, lesion loads and metabolic features. Conclusions: Our results suggest that metabolic features are better at differentiating between relapsing-remitting and primary progressive forms, while lesion loads are better at differentiating between relapsing-remitting and secondary progressive forms. Therefore, combining clinical data with magnetic resonance lesion loads and metabolic features can improve the discrimination between relapsing-remitting and progressive forms. |
format | Online Article Text |
id | pubmed-5504183 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-55041832017-07-25 Machine Learning Approach for Classifying Multiple Sclerosis Courses by Combining Clinical Data with Lesion Loads and Magnetic Resonance Metabolic Features Ion-Mărgineanu, Adrian Kocevar, Gabriel Stamile, Claudio Sima, Diana M. Durand-Dubief, Françoise Van Huffel, Sabine Sappey-Marinier, Dominique Front Neurosci Neuroscience Purpose: The purpose of this study is classifying multiple sclerosis (MS) patients in the four clinical forms as defined by the McDonald criteria using machine learning algorithms trained on clinical data combined with lesion loads and magnetic resonance metabolic features. Materials and Methods: Eighty-seven MS patients [12 Clinically Isolated Syndrome (CIS), 30 Relapse Remitting (RR), 17 Primary Progressive (PP), and 28 Secondary Progressive (SP)] and 18 healthy controls were included in this study. Longitudinal data available for each MS patient included clinical (e.g., age, disease duration, Expanded Disability Status Scale), conventional magnetic resonance imaging and spectroscopic imaging. We extract N-acetyl-aspartate (NAA), Choline (Cho), and Creatine (Cre) concentrations, and we compute three features for each spectroscopic grid by averaging metabolite ratios (NAA/Cho, NAA/Cre, Cho/Cre) over good quality voxels. We built linear mixed-effects models to test for statistically significant differences between MS forms. We test nine binary classification tasks on clinical data, lesion loads, and metabolic features, using a leave-one-patient-out cross-validation method based on 100 random patient-based bootstrap selections. We compute F1-scores and BAR values after tuning Linear Discriminant Analysis (LDA), Support Vector Machines with gaussian kernel (SVM-rbf), and Random Forests. Results: Statistically significant differences were found between the disease starting points of each MS form using four different response variables: Lesion Load, NAA/Cre, NAA/Cho, and Cho/Cre ratios. Training SVM-rbf on clinical and lesion loads yields F1-scores of 71–72% for CIS vs. RR and CIS vs. RR+SP, respectively. For RR vs. PP we obtained good classification results (maximum F1-score of 85%) after training LDA on clinical and metabolic features, while for RR vs. SP we obtained slightly higher classification results (maximum F1-score of 87%) after training LDA and SVM-rbf on clinical, lesion loads and metabolic features. Conclusions: Our results suggest that metabolic features are better at differentiating between relapsing-remitting and primary progressive forms, while lesion loads are better at differentiating between relapsing-remitting and secondary progressive forms. Therefore, combining clinical data with magnetic resonance lesion loads and metabolic features can improve the discrimination between relapsing-remitting and progressive forms. Frontiers Media S.A. 2017-07-11 /pmc/articles/PMC5504183/ /pubmed/28744195 http://dx.doi.org/10.3389/fnins.2017.00398 Text en Copyright © 2017 Ion-Mărgineanu, Kocevar, Stamile, Sima, Durand-Dubief, Van Huffel and Sappey-Marinier. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Neuroscience Ion-Mărgineanu, Adrian Kocevar, Gabriel Stamile, Claudio Sima, Diana M. Durand-Dubief, Françoise Van Huffel, Sabine Sappey-Marinier, Dominique Machine Learning Approach for Classifying Multiple Sclerosis Courses by Combining Clinical Data with Lesion Loads and Magnetic Resonance Metabolic Features |
title | Machine Learning Approach for Classifying Multiple Sclerosis Courses by Combining Clinical Data with Lesion Loads and Magnetic Resonance Metabolic Features |
title_full | Machine Learning Approach for Classifying Multiple Sclerosis Courses by Combining Clinical Data with Lesion Loads and Magnetic Resonance Metabolic Features |
title_fullStr | Machine Learning Approach for Classifying Multiple Sclerosis Courses by Combining Clinical Data with Lesion Loads and Magnetic Resonance Metabolic Features |
title_full_unstemmed | Machine Learning Approach for Classifying Multiple Sclerosis Courses by Combining Clinical Data with Lesion Loads and Magnetic Resonance Metabolic Features |
title_short | Machine Learning Approach for Classifying Multiple Sclerosis Courses by Combining Clinical Data with Lesion Loads and Magnetic Resonance Metabolic Features |
title_sort | machine learning approach for classifying multiple sclerosis courses by combining clinical data with lesion loads and magnetic resonance metabolic features |
topic | Neuroscience |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5504183/ https://www.ncbi.nlm.nih.gov/pubmed/28744195 http://dx.doi.org/10.3389/fnins.2017.00398 |
work_keys_str_mv | AT ionmargineanuadrian machinelearningapproachforclassifyingmultiplesclerosiscoursesbycombiningclinicaldatawithlesionloadsandmagneticresonancemetabolicfeatures AT kocevargabriel machinelearningapproachforclassifyingmultiplesclerosiscoursesbycombiningclinicaldatawithlesionloadsandmagneticresonancemetabolicfeatures AT stamileclaudio machinelearningapproachforclassifyingmultiplesclerosiscoursesbycombiningclinicaldatawithlesionloadsandmagneticresonancemetabolicfeatures AT simadianam machinelearningapproachforclassifyingmultiplesclerosiscoursesbycombiningclinicaldatawithlesionloadsandmagneticresonancemetabolicfeatures AT duranddubieffrancoise machinelearningapproachforclassifyingmultiplesclerosiscoursesbycombiningclinicaldatawithlesionloadsandmagneticresonancemetabolicfeatures AT vanhuffelsabine machinelearningapproachforclassifyingmultiplesclerosiscoursesbycombiningclinicaldatawithlesionloadsandmagneticresonancemetabolicfeatures AT sappeymarinierdominique machinelearningapproachforclassifyingmultiplesclerosiscoursesbycombiningclinicaldatawithlesionloadsandmagneticresonancemetabolicfeatures |