Cargando…

In silico spectral libraries by deep learning facilitate data-independent acquisition proteomics

Data-independent acquisition (DIA) is an emerging technology for quantitative proteomic analysis of large cohorts of samples. However, sample-specific spectral libraries built by data-dependent acquisition (DDA) experiments are required prior to DIA analysis, which is time-consuming and limits the i...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Yi, Liu, Xiaohui, Shen, Chengpin, Lin, Yu, Yang, Pengyuan, Qiao, Liang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6952453/
https://www.ncbi.nlm.nih.gov/pubmed/31919359
http://dx.doi.org/10.1038/s41467-019-13866-z
_version_ 1783486453216444416
author Yang, Yi
Liu, Xiaohui
Shen, Chengpin
Lin, Yu
Yang, Pengyuan
Qiao, Liang
author_facet Yang, Yi
Liu, Xiaohui
Shen, Chengpin
Lin, Yu
Yang, Pengyuan
Qiao, Liang
author_sort Yang, Yi
collection PubMed
description Data-independent acquisition (DIA) is an emerging technology for quantitative proteomic analysis of large cohorts of samples. However, sample-specific spectral libraries built by data-dependent acquisition (DDA) experiments are required prior to DIA analysis, which is time-consuming and limits the identification/quantification by DIA to the peptides identified by DDA. Herein, we propose DeepDIA, a deep learning-based approach to generate in silico spectral libraries for DIA analysis. We demonstrate that the quality of in silico libraries predicted by instrument-specific models using DeepDIA is comparable to that of experimental libraries, and outperforms libraries generated by global models. With peptide detectability prediction, in silico libraries can be built directly from protein sequence databases. We further illustrate that DeepDIA can break through the limitation of DDA on peptide/protein detection, and enhance DIA analysis on human serum samples compared to the state-of-the-art protocol using a DDA library. We expect this work expanding the toolbox for DIA proteomics.
format Online
Article
Text
id pubmed-6952453
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-69524532020-01-13 In silico spectral libraries by deep learning facilitate data-independent acquisition proteomics Yang, Yi Liu, Xiaohui Shen, Chengpin Lin, Yu Yang, Pengyuan Qiao, Liang Nat Commun Article Data-independent acquisition (DIA) is an emerging technology for quantitative proteomic analysis of large cohorts of samples. However, sample-specific spectral libraries built by data-dependent acquisition (DDA) experiments are required prior to DIA analysis, which is time-consuming and limits the identification/quantification by DIA to the peptides identified by DDA. Herein, we propose DeepDIA, a deep learning-based approach to generate in silico spectral libraries for DIA analysis. We demonstrate that the quality of in silico libraries predicted by instrument-specific models using DeepDIA is comparable to that of experimental libraries, and outperforms libraries generated by global models. With peptide detectability prediction, in silico libraries can be built directly from protein sequence databases. We further illustrate that DeepDIA can break through the limitation of DDA on peptide/protein detection, and enhance DIA analysis on human serum samples compared to the state-of-the-art protocol using a DDA library. We expect this work expanding the toolbox for DIA proteomics. Nature Publishing Group UK 2020-01-09 /pmc/articles/PMC6952453/ /pubmed/31919359 http://dx.doi.org/10.1038/s41467-019-13866-z Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Yang, Yi
Liu, Xiaohui
Shen, Chengpin
Lin, Yu
Yang, Pengyuan
Qiao, Liang
In silico spectral libraries by deep learning facilitate data-independent acquisition proteomics
title In silico spectral libraries by deep learning facilitate data-independent acquisition proteomics
title_full In silico spectral libraries by deep learning facilitate data-independent acquisition proteomics
title_fullStr In silico spectral libraries by deep learning facilitate data-independent acquisition proteomics
title_full_unstemmed In silico spectral libraries by deep learning facilitate data-independent acquisition proteomics
title_short In silico spectral libraries by deep learning facilitate data-independent acquisition proteomics
title_sort in silico spectral libraries by deep learning facilitate data-independent acquisition proteomics
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6952453/
https://www.ncbi.nlm.nih.gov/pubmed/31919359
http://dx.doi.org/10.1038/s41467-019-13866-z
work_keys_str_mv AT yangyi insilicospectrallibrariesbydeeplearningfacilitatedataindependentacquisitionproteomics
AT liuxiaohui insilicospectrallibrariesbydeeplearningfacilitatedataindependentacquisitionproteomics
AT shenchengpin insilicospectrallibrariesbydeeplearningfacilitatedataindependentacquisitionproteomics
AT linyu insilicospectrallibrariesbydeeplearningfacilitatedataindependentacquisitionproteomics
AT yangpengyuan insilicospectrallibrariesbydeeplearningfacilitatedataindependentacquisitionproteomics
AT qiaoliang insilicospectrallibrariesbydeeplearningfacilitatedataindependentacquisitionproteomics