Cargando…

MSpectraAI: a powerful platform for deciphering proteome profiling of multi-tumor mass spectrometry data by using deep neural networks

BACKGROUND: Mass spectrometry (MS) has become a promising analytical technique to acquire proteomics information for the characterization of biological samples. Nevertheless, most studies focus on the final proteins identified through a suite of algorithms by using partial MS spectra to compare with...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Shisheng, Zhu, Hongwen, Zhou, Hu, Cheng, Jingqiu, Yang, Hao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7539376/
https://www.ncbi.nlm.nih.gov/pubmed/33028193
http://dx.doi.org/10.1186/s12859-020-03783-0
_version_ 1783591042318073856
author Wang, Shisheng
Zhu, Hongwen
Zhou, Hu
Cheng, Jingqiu
Yang, Hao
author_facet Wang, Shisheng
Zhu, Hongwen
Zhou, Hu
Cheng, Jingqiu
Yang, Hao
author_sort Wang, Shisheng
collection PubMed
description BACKGROUND: Mass spectrometry (MS) has become a promising analytical technique to acquire proteomics information for the characterization of biological samples. Nevertheless, most studies focus on the final proteins identified through a suite of algorithms by using partial MS spectra to compare with the sequence database, while the pattern recognition and classification of raw mass-spectrometric data remain unresolved. RESULTS: We developed an open-source and comprehensive platform, named MSpectraAI, for analyzing large-scale MS data through deep neural networks (DNNs); this system involves spectral-feature swath extraction, classification, and visualization. Moreover, this platform allows users to create their own DNN model by using Keras. To evaluate this tool, we collected the publicly available proteomics datasets of six tumor types (a total of 7,997,805 mass spectra) from the ProteomeXchange consortium and classified the samples based on the spectra profiling. The results suggest that MSpectraAI can distinguish different types of samples based on the fingerprint spectrum and achieve better prediction accuracy in MS1 level (average 0.967). CONCLUSION: This study deciphers proteome profiling of raw mass spectrometry data and broadens the promising application of the classification and prediction of proteomics data from multi-tumor samples using deep learning methods. MSpectraAI also shows a better performance compared to the other classical machine learning approaches.
format Online
Article
Text
id pubmed-7539376
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-75393762020-10-08 MSpectraAI: a powerful platform for deciphering proteome profiling of multi-tumor mass spectrometry data by using deep neural networks Wang, Shisheng Zhu, Hongwen Zhou, Hu Cheng, Jingqiu Yang, Hao BMC Bioinformatics Software BACKGROUND: Mass spectrometry (MS) has become a promising analytical technique to acquire proteomics information for the characterization of biological samples. Nevertheless, most studies focus on the final proteins identified through a suite of algorithms by using partial MS spectra to compare with the sequence database, while the pattern recognition and classification of raw mass-spectrometric data remain unresolved. RESULTS: We developed an open-source and comprehensive platform, named MSpectraAI, for analyzing large-scale MS data through deep neural networks (DNNs); this system involves spectral-feature swath extraction, classification, and visualization. Moreover, this platform allows users to create their own DNN model by using Keras. To evaluate this tool, we collected the publicly available proteomics datasets of six tumor types (a total of 7,997,805 mass spectra) from the ProteomeXchange consortium and classified the samples based on the spectra profiling. The results suggest that MSpectraAI can distinguish different types of samples based on the fingerprint spectrum and achieve better prediction accuracy in MS1 level (average 0.967). CONCLUSION: This study deciphers proteome profiling of raw mass spectrometry data and broadens the promising application of the classification and prediction of proteomics data from multi-tumor samples using deep learning methods. MSpectraAI also shows a better performance compared to the other classical machine learning approaches. BioMed Central 2020-10-07 /pmc/articles/PMC7539376/ /pubmed/33028193 http://dx.doi.org/10.1186/s12859-020-03783-0 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Wang, Shisheng
Zhu, Hongwen
Zhou, Hu
Cheng, Jingqiu
Yang, Hao
MSpectraAI: a powerful platform for deciphering proteome profiling of multi-tumor mass spectrometry data by using deep neural networks
title MSpectraAI: a powerful platform for deciphering proteome profiling of multi-tumor mass spectrometry data by using deep neural networks
title_full MSpectraAI: a powerful platform for deciphering proteome profiling of multi-tumor mass spectrometry data by using deep neural networks
title_fullStr MSpectraAI: a powerful platform for deciphering proteome profiling of multi-tumor mass spectrometry data by using deep neural networks
title_full_unstemmed MSpectraAI: a powerful platform for deciphering proteome profiling of multi-tumor mass spectrometry data by using deep neural networks
title_short MSpectraAI: a powerful platform for deciphering proteome profiling of multi-tumor mass spectrometry data by using deep neural networks
title_sort mspectraai: a powerful platform for deciphering proteome profiling of multi-tumor mass spectrometry data by using deep neural networks
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7539376/
https://www.ncbi.nlm.nih.gov/pubmed/33028193
http://dx.doi.org/10.1186/s12859-020-03783-0
work_keys_str_mv AT wangshisheng mspectraaiapowerfulplatformfordecipheringproteomeprofilingofmultitumormassspectrometrydatabyusingdeepneuralnetworks
AT zhuhongwen mspectraaiapowerfulplatformfordecipheringproteomeprofilingofmultitumormassspectrometrydatabyusingdeepneuralnetworks
AT zhouhu mspectraaiapowerfulplatformfordecipheringproteomeprofilingofmultitumormassspectrometrydatabyusingdeepneuralnetworks
AT chengjingqiu mspectraaiapowerfulplatformfordecipheringproteomeprofilingofmultitumormassspectrometrydatabyusingdeepneuralnetworks
AT yanghao mspectraaiapowerfulplatformfordecipheringproteomeprofilingofmultitumormassspectrometrydatabyusingdeepneuralnetworks