Cargando…

DeepPep: Deep proteome inference from peptide profiles

Protein inference, the identification of the protein set that is the origin of a given peptide profile, is a fundamental challenge in proteomics. We present DeepPep, a deep-convolutional neural network framework that predicts the protein set from a proteomics mixture, given the sequence universe of...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Minseung, Eetemadi, Ameen, Tagkopoulos, Ilias
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5600403/
https://www.ncbi.nlm.nih.gov/pubmed/28873403
http://dx.doi.org/10.1371/journal.pcbi.1005661
_version_ 1783264236677365760
author Kim, Minseung
Eetemadi, Ameen
Tagkopoulos, Ilias
author_facet Kim, Minseung
Eetemadi, Ameen
Tagkopoulos, Ilias
author_sort Kim, Minseung
collection PubMed
description Protein inference, the identification of the protein set that is the origin of a given peptide profile, is a fundamental challenge in proteomics. We present DeepPep, a deep-convolutional neural network framework that predicts the protein set from a proteomics mixture, given the sequence universe of possible proteins and a target peptide profile. In its core, DeepPep quantifies the change in probabilistic score of peptide-spectrum matches in the presence or absence of a specific protein, hence selecting as candidate proteins with the largest impact to the peptide profile. Application of the method across datasets argues for its competitive predictive ability (AUC of 0.80±0.18, AUPR of 0.84±0.28) in inferring proteins without need of peptide detectability on which the most competitive methods rely. We find that the convolutional neural network architecture outperforms the traditional artificial neural network architectures without convolution layers in protein inference. We expect that similar deep learning architectures that allow learning nonlinear patterns can be further extended to problems in metagenome profiling and cell type inference. The source code of DeepPep and the benchmark datasets used in this study are available at https://deeppep.github.io/DeepPep/.
format Online
Article
Text
id pubmed-5600403
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-56004032017-09-22 DeepPep: Deep proteome inference from peptide profiles Kim, Minseung Eetemadi, Ameen Tagkopoulos, Ilias PLoS Comput Biol Research Article Protein inference, the identification of the protein set that is the origin of a given peptide profile, is a fundamental challenge in proteomics. We present DeepPep, a deep-convolutional neural network framework that predicts the protein set from a proteomics mixture, given the sequence universe of possible proteins and a target peptide profile. In its core, DeepPep quantifies the change in probabilistic score of peptide-spectrum matches in the presence or absence of a specific protein, hence selecting as candidate proteins with the largest impact to the peptide profile. Application of the method across datasets argues for its competitive predictive ability (AUC of 0.80±0.18, AUPR of 0.84±0.28) in inferring proteins without need of peptide detectability on which the most competitive methods rely. We find that the convolutional neural network architecture outperforms the traditional artificial neural network architectures without convolution layers in protein inference. We expect that similar deep learning architectures that allow learning nonlinear patterns can be further extended to problems in metagenome profiling and cell type inference. The source code of DeepPep and the benchmark datasets used in this study are available at https://deeppep.github.io/DeepPep/. Public Library of Science 2017-09-05 /pmc/articles/PMC5600403/ /pubmed/28873403 http://dx.doi.org/10.1371/journal.pcbi.1005661 Text en © 2017 Kim et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Kim, Minseung
Eetemadi, Ameen
Tagkopoulos, Ilias
DeepPep: Deep proteome inference from peptide profiles
title DeepPep: Deep proteome inference from peptide profiles
title_full DeepPep: Deep proteome inference from peptide profiles
title_fullStr DeepPep: Deep proteome inference from peptide profiles
title_full_unstemmed DeepPep: Deep proteome inference from peptide profiles
title_short DeepPep: Deep proteome inference from peptide profiles
title_sort deeppep: deep proteome inference from peptide profiles
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5600403/
https://www.ncbi.nlm.nih.gov/pubmed/28873403
http://dx.doi.org/10.1371/journal.pcbi.1005661
work_keys_str_mv AT kimminseung deeppepdeepproteomeinferencefrompeptideprofiles
AT eetemadiameen deeppepdeepproteomeinferencefrompeptideprofiles
AT tagkopoulosilias deeppepdeepproteomeinferencefrompeptideprofiles