Cargando…

Performance comparison of modified ComBat for harmonization of radiomic features for multicenter studies

Multicenter studies are needed to demonstrate the clinical potential value of radiomics as a prognostic tool. However, variability in scanner models, acquisition protocols and reconstruction settings are unavoidable and radiomic features are notoriously sensitive to these factors, which hinders pool...

Descripción completa

Detalles Bibliográficos
Autores principales: Da-ano, R., Masson, I., Lucia, F., Doré, M., Robin, P., Alfieri, J., Rousseau, C., Mervoyer, A., Reinhold, C., Castelli, J., De Crevoisier, R., Rameé, J. F., Pradier, O., Schick, U., Visvikis, D., Hatt, M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7314795/
https://www.ncbi.nlm.nih.gov/pubmed/32581221
http://dx.doi.org/10.1038/s41598-020-66110-w
_version_ 1783550131928301568
author Da-ano, R.
Masson, I.
Lucia, F.
Doré, M.
Robin, P.
Alfieri, J.
Rousseau, C.
Mervoyer, A.
Reinhold, C.
Castelli, J.
De Crevoisier, R.
Rameé, J. F.
Pradier, O.
Schick, U.
Visvikis, D.
Hatt, M.
author_facet Da-ano, R.
Masson, I.
Lucia, F.
Doré, M.
Robin, P.
Alfieri, J.
Rousseau, C.
Mervoyer, A.
Reinhold, C.
Castelli, J.
De Crevoisier, R.
Rameé, J. F.
Pradier, O.
Schick, U.
Visvikis, D.
Hatt, M.
author_sort Da-ano, R.
collection PubMed
description Multicenter studies are needed to demonstrate the clinical potential value of radiomics as a prognostic tool. However, variability in scanner models, acquisition protocols and reconstruction settings are unavoidable and radiomic features are notoriously sensitive to these factors, which hinders pooling them in a statistical analysis. A statistical harmonization method called ComBat was developed to deal with the “batch effect” in gene expression microarray data and was used in radiomics studies to deal with the “center-effect”. Our goal was to evaluate modifications in ComBat allowing for more flexibility in choosing a reference and improving robustness of the estimation. Two modified ComBat versions were evaluated: M-ComBat allows to transform all features distributions to a chosen reference, instead of the overall mean, providing more flexibility. B-ComBat adds bootstrap and Monte Carlo for improved robustness in the estimation. BM-ComBat combines both modifications. The four versions were compared regarding their ability to harmonize features in a multicenter context in two different clinical datasets. The first contains 119 locally advanced cervical cancer patients from 3 centers, with magnetic resonance imaging and positron emission tomography imaging. In that case ComBat was applied with 3 labels corresponding to each center. The second one contains 98 locally advanced laryngeal cancer patients from 5 centers with contrast-enhanced computed tomography. In that specific case, because imaging settings were highly heterogeneous even within each of the five centers, unsupervised clustering was used to determine two labels for applying ComBat. The impact of each harmonization was evaluated through three different machine learning pipelines for the modelling step in predicting the clinical outcomes, across two performance metrics (balanced accuracy and Matthews correlation coefficient). Before harmonization, almost all radiomic features had significantly different distributions between labels. These differences were successfully removed with all ComBat versions. The predictive ability of the radiomic models was always improved with harmonization and the improved ComBat provided the best results. This was observed consistently in both datasets, through all machine learning pipelines and performance metrics. The proposed modifications allow for more flexibility and robustness in the estimation. They also slightly but consistently improve the predictive power of resulting radiomic models.
format Online
Article
Text
id pubmed-7314795
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-73147952020-06-26 Performance comparison of modified ComBat for harmonization of radiomic features for multicenter studies Da-ano, R. Masson, I. Lucia, F. Doré, M. Robin, P. Alfieri, J. Rousseau, C. Mervoyer, A. Reinhold, C. Castelli, J. De Crevoisier, R. Rameé, J. F. Pradier, O. Schick, U. Visvikis, D. Hatt, M. Sci Rep Article Multicenter studies are needed to demonstrate the clinical potential value of radiomics as a prognostic tool. However, variability in scanner models, acquisition protocols and reconstruction settings are unavoidable and radiomic features are notoriously sensitive to these factors, which hinders pooling them in a statistical analysis. A statistical harmonization method called ComBat was developed to deal with the “batch effect” in gene expression microarray data and was used in radiomics studies to deal with the “center-effect”. Our goal was to evaluate modifications in ComBat allowing for more flexibility in choosing a reference and improving robustness of the estimation. Two modified ComBat versions were evaluated: M-ComBat allows to transform all features distributions to a chosen reference, instead of the overall mean, providing more flexibility. B-ComBat adds bootstrap and Monte Carlo for improved robustness in the estimation. BM-ComBat combines both modifications. The four versions were compared regarding their ability to harmonize features in a multicenter context in two different clinical datasets. The first contains 119 locally advanced cervical cancer patients from 3 centers, with magnetic resonance imaging and positron emission tomography imaging. In that case ComBat was applied with 3 labels corresponding to each center. The second one contains 98 locally advanced laryngeal cancer patients from 5 centers with contrast-enhanced computed tomography. In that specific case, because imaging settings were highly heterogeneous even within each of the five centers, unsupervised clustering was used to determine two labels for applying ComBat. The impact of each harmonization was evaluated through three different machine learning pipelines for the modelling step in predicting the clinical outcomes, across two performance metrics (balanced accuracy and Matthews correlation coefficient). Before harmonization, almost all radiomic features had significantly different distributions between labels. These differences were successfully removed with all ComBat versions. The predictive ability of the radiomic models was always improved with harmonization and the improved ComBat provided the best results. This was observed consistently in both datasets, through all machine learning pipelines and performance metrics. The proposed modifications allow for more flexibility and robustness in the estimation. They also slightly but consistently improve the predictive power of resulting radiomic models. Nature Publishing Group UK 2020-06-24 /pmc/articles/PMC7314795/ /pubmed/32581221 http://dx.doi.org/10.1038/s41598-020-66110-w Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Da-ano, R.
Masson, I.
Lucia, F.
Doré, M.
Robin, P.
Alfieri, J.
Rousseau, C.
Mervoyer, A.
Reinhold, C.
Castelli, J.
De Crevoisier, R.
Rameé, J. F.
Pradier, O.
Schick, U.
Visvikis, D.
Hatt, M.
Performance comparison of modified ComBat for harmonization of radiomic features for multicenter studies
title Performance comparison of modified ComBat for harmonization of radiomic features for multicenter studies
title_full Performance comparison of modified ComBat for harmonization of radiomic features for multicenter studies
title_fullStr Performance comparison of modified ComBat for harmonization of radiomic features for multicenter studies
title_full_unstemmed Performance comparison of modified ComBat for harmonization of radiomic features for multicenter studies
title_short Performance comparison of modified ComBat for harmonization of radiomic features for multicenter studies
title_sort performance comparison of modified combat for harmonization of radiomic features for multicenter studies
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7314795/
https://www.ncbi.nlm.nih.gov/pubmed/32581221
http://dx.doi.org/10.1038/s41598-020-66110-w
work_keys_str_mv AT daanor performancecomparisonofmodifiedcombatforharmonizationofradiomicfeaturesformulticenterstudies
AT massoni performancecomparisonofmodifiedcombatforharmonizationofradiomicfeaturesformulticenterstudies
AT luciaf performancecomparisonofmodifiedcombatforharmonizationofradiomicfeaturesformulticenterstudies
AT dorem performancecomparisonofmodifiedcombatforharmonizationofradiomicfeaturesformulticenterstudies
AT robinp performancecomparisonofmodifiedcombatforharmonizationofradiomicfeaturesformulticenterstudies
AT alfierij performancecomparisonofmodifiedcombatforharmonizationofradiomicfeaturesformulticenterstudies
AT rousseauc performancecomparisonofmodifiedcombatforharmonizationofradiomicfeaturesformulticenterstudies
AT mervoyera performancecomparisonofmodifiedcombatforharmonizationofradiomicfeaturesformulticenterstudies
AT reinholdc performancecomparisonofmodifiedcombatforharmonizationofradiomicfeaturesformulticenterstudies
AT castellij performancecomparisonofmodifiedcombatforharmonizationofradiomicfeaturesformulticenterstudies
AT decrevoisierr performancecomparisonofmodifiedcombatforharmonizationofradiomicfeaturesformulticenterstudies
AT rameejf performancecomparisonofmodifiedcombatforharmonizationofradiomicfeaturesformulticenterstudies
AT pradiero performancecomparisonofmodifiedcombatforharmonizationofradiomicfeaturesformulticenterstudies
AT schicku performancecomparisonofmodifiedcombatforharmonizationofradiomicfeaturesformulticenterstudies
AT visvikisd performancecomparisonofmodifiedcombatforharmonizationofradiomicfeaturesformulticenterstudies
AT hattm performancecomparisonofmodifiedcombatforharmonizationofradiomicfeaturesformulticenterstudies