Cargando…

Cumulative learning enables convolutional neural network representations for small mass spectrometry data classification

Rapid and accurate clinical diagnosis remains challenging. A component of diagnosis tool development is the design of effective classification models with Mass spectrometry (MS) data. Some Machine Learning approaches have been investigated but these models require time-consuming preprocessing steps...

Descripción completa

Detalles Bibliográficos
Autores principales: Seddiki, Khawla, Saudemont, Philippe, Precioso, Frédéric, Ogrinc, Nina, Wisztorski, Maxence, Salzet, Michel, Fournier, Isabelle, Droit, Arnaud
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7644674/
https://www.ncbi.nlm.nih.gov/pubmed/33154370
http://dx.doi.org/10.1038/s41467-020-19354-z
_version_ 1783606502411468800
author Seddiki, Khawla
Saudemont, Philippe
Precioso, Frédéric
Ogrinc, Nina
Wisztorski, Maxence
Salzet, Michel
Fournier, Isabelle
Droit, Arnaud
author_facet Seddiki, Khawla
Saudemont, Philippe
Precioso, Frédéric
Ogrinc, Nina
Wisztorski, Maxence
Salzet, Michel
Fournier, Isabelle
Droit, Arnaud
author_sort Seddiki, Khawla
collection PubMed
description Rapid and accurate clinical diagnosis remains challenging. A component of diagnosis tool development is the design of effective classification models with Mass spectrometry (MS) data. Some Machine Learning approaches have been investigated but these models require time-consuming preprocessing steps to remove artifacts, making them unsuitable for rapid analysis. Convolutional Neural Networks (CNNs) have been found to perform well under such circumstances since they can learn representations from raw data. However, their effectiveness decreases when the number of available training samples is small, which is a common situation in medicine. In this work, we investigate transfer learning on 1D-CNNs, then we develop a cumulative learning method when transfer learning is not powerful enough. We propose to train the same model through several classification tasks over various small datasets to accumulate knowledge in the resulting representation. By using rat brain as the initial training dataset, a cumulative learning approach can have a classification accuracy exceeding 98% for 1D clinical MS-data. We show the use of cumulative learning using datasets generated in different biological contexts, on different organisms, and acquired by different instruments. Here we show a promising strategy for improving MS data classification accuracy when only small numbers of samples are available.
format Online
Article
Text
id pubmed-7644674
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-76446742020-11-10 Cumulative learning enables convolutional neural network representations for small mass spectrometry data classification Seddiki, Khawla Saudemont, Philippe Precioso, Frédéric Ogrinc, Nina Wisztorski, Maxence Salzet, Michel Fournier, Isabelle Droit, Arnaud Nat Commun Article Rapid and accurate clinical diagnosis remains challenging. A component of diagnosis tool development is the design of effective classification models with Mass spectrometry (MS) data. Some Machine Learning approaches have been investigated but these models require time-consuming preprocessing steps to remove artifacts, making them unsuitable for rapid analysis. Convolutional Neural Networks (CNNs) have been found to perform well under such circumstances since they can learn representations from raw data. However, their effectiveness decreases when the number of available training samples is small, which is a common situation in medicine. In this work, we investigate transfer learning on 1D-CNNs, then we develop a cumulative learning method when transfer learning is not powerful enough. We propose to train the same model through several classification tasks over various small datasets to accumulate knowledge in the resulting representation. By using rat brain as the initial training dataset, a cumulative learning approach can have a classification accuracy exceeding 98% for 1D clinical MS-data. We show the use of cumulative learning using datasets generated in different biological contexts, on different organisms, and acquired by different instruments. Here we show a promising strategy for improving MS data classification accuracy when only small numbers of samples are available. Nature Publishing Group UK 2020-11-05 /pmc/articles/PMC7644674/ /pubmed/33154370 http://dx.doi.org/10.1038/s41467-020-19354-z Text en © The Author(s) 2020, corrected publication 2021 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Seddiki, Khawla
Saudemont, Philippe
Precioso, Frédéric
Ogrinc, Nina
Wisztorski, Maxence
Salzet, Michel
Fournier, Isabelle
Droit, Arnaud
Cumulative learning enables convolutional neural network representations for small mass spectrometry data classification
title Cumulative learning enables convolutional neural network representations for small mass spectrometry data classification
title_full Cumulative learning enables convolutional neural network representations for small mass spectrometry data classification
title_fullStr Cumulative learning enables convolutional neural network representations for small mass spectrometry data classification
title_full_unstemmed Cumulative learning enables convolutional neural network representations for small mass spectrometry data classification
title_short Cumulative learning enables convolutional neural network representations for small mass spectrometry data classification
title_sort cumulative learning enables convolutional neural network representations for small mass spectrometry data classification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7644674/
https://www.ncbi.nlm.nih.gov/pubmed/33154370
http://dx.doi.org/10.1038/s41467-020-19354-z
work_keys_str_mv AT seddikikhawla cumulativelearningenablesconvolutionalneuralnetworkrepresentationsforsmallmassspectrometrydataclassification
AT saudemontphilippe cumulativelearningenablesconvolutionalneuralnetworkrepresentationsforsmallmassspectrometrydataclassification
AT preciosofrederic cumulativelearningenablesconvolutionalneuralnetworkrepresentationsforsmallmassspectrometrydataclassification
AT ogrincnina cumulativelearningenablesconvolutionalneuralnetworkrepresentationsforsmallmassspectrometrydataclassification
AT wisztorskimaxence cumulativelearningenablesconvolutionalneuralnetworkrepresentationsforsmallmassspectrometrydataclassification
AT salzetmichel cumulativelearningenablesconvolutionalneuralnetworkrepresentationsforsmallmassspectrometrydataclassification
AT fournierisabelle cumulativelearningenablesconvolutionalneuralnetworkrepresentationsforsmallmassspectrometrydataclassification
AT droitarnaud cumulativelearningenablesconvolutionalneuralnetworkrepresentationsforsmallmassspectrometrydataclassification