Cargando…

Learning a confidence score and the latent space of a new supervised autoencoder for diagnosis and prognosis in clinical metabolomic studies

BACKGROUND: Presently, there is a wide variety of classification methods and deep neural network approaches in bioinformatics. Deep neural networks have proven their effectiveness for classification tasks, and have outperformed classical methods, but they suffer from a lack of interpretability. Ther...

Descripción completa

Detalles Bibliográficos
Autores principales: Chardin, David, Gille, Cyprien, Pourcher, Thierry, Humbert, Olivier, Barlaud, Michel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9434875/
https://www.ncbi.nlm.nih.gov/pubmed/36050631
http://dx.doi.org/10.1186/s12859-022-04900-x
_version_ 1784780980696907776
author Chardin, David
Gille, Cyprien
Pourcher, Thierry
Humbert, Olivier
Barlaud, Michel
author_facet Chardin, David
Gille, Cyprien
Pourcher, Thierry
Humbert, Olivier
Barlaud, Michel
author_sort Chardin, David
collection PubMed
description BACKGROUND: Presently, there is a wide variety of classification methods and deep neural network approaches in bioinformatics. Deep neural networks have proven their effectiveness for classification tasks, and have outperformed classical methods, but they suffer from a lack of interpretability. Therefore, these innovative methods are not appropriate for decision support systems in healthcare. Indeed, to allow clinicians to make informed and well thought out decisions, the algorithm should provide the main pieces of information used to compute the predicted diagnosis and/or prognosis, as well as a confidence score for this prediction. METHODS: Herein, we used a new supervised autoencoder (SAE) approach for classification of clinical metabolomic data. This new method has the advantage of providing a confidence score for each prediction thanks to a softmax classifier and a meaningful latent space visualization and to include a new efficient feature selection method, with a structured constraint, which allows for biologically interpretable results. RESULTS: Experimental results on three metabolomics datasets of clinical samples illustrate the effectiveness of our SAE and its confidence score. The supervised autoencoder provides an accurate localization of the patients in the latent space, and an efficient confidence score. Experiments show that the SAE outperforms classical methods (PLS-DA, Random Forests, SVM, and neural networks (NN)). Furthermore, the metabolites selected by the SAE were found to be biologically relevant. CONCLUSION: In this paper, we describe a new efficient SAE method to support diagnostic or prognostic evaluation based on metabolomics analyses.
format Online
Article
Text
id pubmed-9434875
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-94348752022-09-02 Learning a confidence score and the latent space of a new supervised autoencoder for diagnosis and prognosis in clinical metabolomic studies Chardin, David Gille, Cyprien Pourcher, Thierry Humbert, Olivier Barlaud, Michel BMC Bioinformatics Research BACKGROUND: Presently, there is a wide variety of classification methods and deep neural network approaches in bioinformatics. Deep neural networks have proven their effectiveness for classification tasks, and have outperformed classical methods, but they suffer from a lack of interpretability. Therefore, these innovative methods are not appropriate for decision support systems in healthcare. Indeed, to allow clinicians to make informed and well thought out decisions, the algorithm should provide the main pieces of information used to compute the predicted diagnosis and/or prognosis, as well as a confidence score for this prediction. METHODS: Herein, we used a new supervised autoencoder (SAE) approach for classification of clinical metabolomic data. This new method has the advantage of providing a confidence score for each prediction thanks to a softmax classifier and a meaningful latent space visualization and to include a new efficient feature selection method, with a structured constraint, which allows for biologically interpretable results. RESULTS: Experimental results on three metabolomics datasets of clinical samples illustrate the effectiveness of our SAE and its confidence score. The supervised autoencoder provides an accurate localization of the patients in the latent space, and an efficient confidence score. Experiments show that the SAE outperforms classical methods (PLS-DA, Random Forests, SVM, and neural networks (NN)). Furthermore, the metabolites selected by the SAE were found to be biologically relevant. CONCLUSION: In this paper, we describe a new efficient SAE method to support diagnostic or prognostic evaluation based on metabolomics analyses. BioMed Central 2022-09-01 /pmc/articles/PMC9434875/ /pubmed/36050631 http://dx.doi.org/10.1186/s12859-022-04900-x Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Chardin, David
Gille, Cyprien
Pourcher, Thierry
Humbert, Olivier
Barlaud, Michel
Learning a confidence score and the latent space of a new supervised autoencoder for diagnosis and prognosis in clinical metabolomic studies
title Learning a confidence score and the latent space of a new supervised autoencoder for diagnosis and prognosis in clinical metabolomic studies
title_full Learning a confidence score and the latent space of a new supervised autoencoder for diagnosis and prognosis in clinical metabolomic studies
title_fullStr Learning a confidence score and the latent space of a new supervised autoencoder for diagnosis and prognosis in clinical metabolomic studies
title_full_unstemmed Learning a confidence score and the latent space of a new supervised autoencoder for diagnosis and prognosis in clinical metabolomic studies
title_short Learning a confidence score and the latent space of a new supervised autoencoder for diagnosis and prognosis in clinical metabolomic studies
title_sort learning a confidence score and the latent space of a new supervised autoencoder for diagnosis and prognosis in clinical metabolomic studies
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9434875/
https://www.ncbi.nlm.nih.gov/pubmed/36050631
http://dx.doi.org/10.1186/s12859-022-04900-x
work_keys_str_mv AT chardindavid learningaconfidencescoreandthelatentspaceofanewsupervisedautoencoderfordiagnosisandprognosisinclinicalmetabolomicstudies
AT gillecyprien learningaconfidencescoreandthelatentspaceofanewsupervisedautoencoderfordiagnosisandprognosisinclinicalmetabolomicstudies
AT pourcherthierry learningaconfidencescoreandthelatentspaceofanewsupervisedautoencoderfordiagnosisandprognosisinclinicalmetabolomicstudies
AT humbertolivier learningaconfidencescoreandthelatentspaceofanewsupervisedautoencoderfordiagnosisandprognosisinclinicalmetabolomicstudies
AT barlaudmichel learningaconfidencescoreandthelatentspaceofanewsupervisedautoencoderfordiagnosisandprognosisinclinicalmetabolomicstudies