Cargando…

Parallel Factor Analysis Enables Quantification and Identification of Highly Convolved Data-Independent-Acquired Protein Spectra

High-throughput data-independent acquisition (DIA) is the method of choice for quantitative proteomics, combining the best practices of targeted and shotgun approaches. The resultant DIA spectra are, however, highly convolved and with no direct precursor-fragment correspondence, complicating biologi...

Descripción completa

Detalles Bibliográficos
Autores principales: Buric, Filip, Zrimec, Jan, Zelezniak, Aleksej
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7733873/
https://www.ncbi.nlm.nih.gov/pubmed/33336195
http://dx.doi.org/10.1016/j.patter.2020.100137
_version_ 1783622353123540992
author Buric, Filip
Zrimec, Jan
Zelezniak, Aleksej
author_facet Buric, Filip
Zrimec, Jan
Zelezniak, Aleksej
author_sort Buric, Filip
collection PubMed
description High-throughput data-independent acquisition (DIA) is the method of choice for quantitative proteomics, combining the best practices of targeted and shotgun approaches. The resultant DIA spectra are, however, highly convolved and with no direct precursor-fragment correspondence, complicating biological sample analysis. Here, we present CANDIA (canonical decomposition of data-independent-acquired spectra), a GPU-powered unsupervised multiway factor analysis framework that deconvolves multispectral scans to individual analyte spectra, chromatographic profiles, and sample abundances, using parallel factor analysis. The deconvolved spectra can be annotated with traditional database search engines or used as high-quality input for de novo sequencing methods. We demonstrate that spectral libraries generated with CANDIA substantially reduce the false discovery rate underlying the validation of spectral quantification. CANDIA covers up to 33 times more total ion current than library-based approaches, which typically use less than 5% of total recorded ions, thus allowing quantification and identification of signals from unexplored DIA spectra.
format Online
Article
Text
id pubmed-7733873
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-77338732020-12-16 Parallel Factor Analysis Enables Quantification and Identification of Highly Convolved Data-Independent-Acquired Protein Spectra Buric, Filip Zrimec, Jan Zelezniak, Aleksej Patterns (N Y) Article High-throughput data-independent acquisition (DIA) is the method of choice for quantitative proteomics, combining the best practices of targeted and shotgun approaches. The resultant DIA spectra are, however, highly convolved and with no direct precursor-fragment correspondence, complicating biological sample analysis. Here, we present CANDIA (canonical decomposition of data-independent-acquired spectra), a GPU-powered unsupervised multiway factor analysis framework that deconvolves multispectral scans to individual analyte spectra, chromatographic profiles, and sample abundances, using parallel factor analysis. The deconvolved spectra can be annotated with traditional database search engines or used as high-quality input for de novo sequencing methods. We demonstrate that spectral libraries generated with CANDIA substantially reduce the false discovery rate underlying the validation of spectral quantification. CANDIA covers up to 33 times more total ion current than library-based approaches, which typically use less than 5% of total recorded ions, thus allowing quantification and identification of signals from unexplored DIA spectra. Elsevier 2020-11-05 /pmc/articles/PMC7733873/ /pubmed/33336195 http://dx.doi.org/10.1016/j.patter.2020.100137 Text en © 2020 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Buric, Filip
Zrimec, Jan
Zelezniak, Aleksej
Parallel Factor Analysis Enables Quantification and Identification of Highly Convolved Data-Independent-Acquired Protein Spectra
title Parallel Factor Analysis Enables Quantification and Identification of Highly Convolved Data-Independent-Acquired Protein Spectra
title_full Parallel Factor Analysis Enables Quantification and Identification of Highly Convolved Data-Independent-Acquired Protein Spectra
title_fullStr Parallel Factor Analysis Enables Quantification and Identification of Highly Convolved Data-Independent-Acquired Protein Spectra
title_full_unstemmed Parallel Factor Analysis Enables Quantification and Identification of Highly Convolved Data-Independent-Acquired Protein Spectra
title_short Parallel Factor Analysis Enables Quantification and Identification of Highly Convolved Data-Independent-Acquired Protein Spectra
title_sort parallel factor analysis enables quantification and identification of highly convolved data-independent-acquired protein spectra
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7733873/
https://www.ncbi.nlm.nih.gov/pubmed/33336195
http://dx.doi.org/10.1016/j.patter.2020.100137
work_keys_str_mv AT buricfilip parallelfactoranalysisenablesquantificationandidentificationofhighlyconvolveddataindependentacquiredproteinspectra
AT zrimecjan parallelfactoranalysisenablesquantificationandidentificationofhighlyconvolveddataindependentacquiredproteinspectra
AT zelezniakaleksej parallelfactoranalysisenablesquantificationandidentificationofhighlyconvolveddataindependentacquiredproteinspectra