Cargando…
Cumulative learning enables convolutional neural network representations for small mass spectrometry data classification
Rapid and accurate clinical diagnosis remains challenging. A component of diagnosis tool development is the design of effective classification models with Mass spectrometry (MS) data. Some Machine Learning approaches have been investigated but these models require time-consuming preprocessing steps...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7644674/ https://www.ncbi.nlm.nih.gov/pubmed/33154370 http://dx.doi.org/10.1038/s41467-020-19354-z |
_version_ | 1783606502411468800 |
---|---|
author | Seddiki, Khawla Saudemont, Philippe Precioso, Frédéric Ogrinc, Nina Wisztorski, Maxence Salzet, Michel Fournier, Isabelle Droit, Arnaud |
author_facet | Seddiki, Khawla Saudemont, Philippe Precioso, Frédéric Ogrinc, Nina Wisztorski, Maxence Salzet, Michel Fournier, Isabelle Droit, Arnaud |
author_sort | Seddiki, Khawla |
collection | PubMed |
description | Rapid and accurate clinical diagnosis remains challenging. A component of diagnosis tool development is the design of effective classification models with Mass spectrometry (MS) data. Some Machine Learning approaches have been investigated but these models require time-consuming preprocessing steps to remove artifacts, making them unsuitable for rapid analysis. Convolutional Neural Networks (CNNs) have been found to perform well under such circumstances since they can learn representations from raw data. However, their effectiveness decreases when the number of available training samples is small, which is a common situation in medicine. In this work, we investigate transfer learning on 1D-CNNs, then we develop a cumulative learning method when transfer learning is not powerful enough. We propose to train the same model through several classification tasks over various small datasets to accumulate knowledge in the resulting representation. By using rat brain as the initial training dataset, a cumulative learning approach can have a classification accuracy exceeding 98% for 1D clinical MS-data. We show the use of cumulative learning using datasets generated in different biological contexts, on different organisms, and acquired by different instruments. Here we show a promising strategy for improving MS data classification accuracy when only small numbers of samples are available. |
format | Online Article Text |
id | pubmed-7644674 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-76446742020-11-10 Cumulative learning enables convolutional neural network representations for small mass spectrometry data classification Seddiki, Khawla Saudemont, Philippe Precioso, Frédéric Ogrinc, Nina Wisztorski, Maxence Salzet, Michel Fournier, Isabelle Droit, Arnaud Nat Commun Article Rapid and accurate clinical diagnosis remains challenging. A component of diagnosis tool development is the design of effective classification models with Mass spectrometry (MS) data. Some Machine Learning approaches have been investigated but these models require time-consuming preprocessing steps to remove artifacts, making them unsuitable for rapid analysis. Convolutional Neural Networks (CNNs) have been found to perform well under such circumstances since they can learn representations from raw data. However, their effectiveness decreases when the number of available training samples is small, which is a common situation in medicine. In this work, we investigate transfer learning on 1D-CNNs, then we develop a cumulative learning method when transfer learning is not powerful enough. We propose to train the same model through several classification tasks over various small datasets to accumulate knowledge in the resulting representation. By using rat brain as the initial training dataset, a cumulative learning approach can have a classification accuracy exceeding 98% for 1D clinical MS-data. We show the use of cumulative learning using datasets generated in different biological contexts, on different organisms, and acquired by different instruments. Here we show a promising strategy for improving MS data classification accuracy when only small numbers of samples are available. Nature Publishing Group UK 2020-11-05 /pmc/articles/PMC7644674/ /pubmed/33154370 http://dx.doi.org/10.1038/s41467-020-19354-z Text en © The Author(s) 2020, corrected publication 2021 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Seddiki, Khawla Saudemont, Philippe Precioso, Frédéric Ogrinc, Nina Wisztorski, Maxence Salzet, Michel Fournier, Isabelle Droit, Arnaud Cumulative learning enables convolutional neural network representations for small mass spectrometry data classification |
title | Cumulative learning enables convolutional neural network representations for small mass spectrometry data classification |
title_full | Cumulative learning enables convolutional neural network representations for small mass spectrometry data classification |
title_fullStr | Cumulative learning enables convolutional neural network representations for small mass spectrometry data classification |
title_full_unstemmed | Cumulative learning enables convolutional neural network representations for small mass spectrometry data classification |
title_short | Cumulative learning enables convolutional neural network representations for small mass spectrometry data classification |
title_sort | cumulative learning enables convolutional neural network representations for small mass spectrometry data classification |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7644674/ https://www.ncbi.nlm.nih.gov/pubmed/33154370 http://dx.doi.org/10.1038/s41467-020-19354-z |
work_keys_str_mv | AT seddikikhawla cumulativelearningenablesconvolutionalneuralnetworkrepresentationsforsmallmassspectrometrydataclassification AT saudemontphilippe cumulativelearningenablesconvolutionalneuralnetworkrepresentationsforsmallmassspectrometrydataclassification AT preciosofrederic cumulativelearningenablesconvolutionalneuralnetworkrepresentationsforsmallmassspectrometrydataclassification AT ogrincnina cumulativelearningenablesconvolutionalneuralnetworkrepresentationsforsmallmassspectrometrydataclassification AT wisztorskimaxence cumulativelearningenablesconvolutionalneuralnetworkrepresentationsforsmallmassspectrometrydataclassification AT salzetmichel cumulativelearningenablesconvolutionalneuralnetworkrepresentationsforsmallmassspectrometrydataclassification AT fournierisabelle cumulativelearningenablesconvolutionalneuralnetworkrepresentationsforsmallmassspectrometrydataclassification AT droitarnaud cumulativelearningenablesconvolutionalneuralnetworkrepresentationsforsmallmassspectrometrydataclassification |