Cargando…

Characterization of peptide-protein relationships in protein ambiguity groups via bipartite graphs

In bottom-up proteomics, proteins are enzymatically digested into peptides before measurement with mass spectrometry. The relationship between proteins and their corresponding peptides can be represented by bipartite graphs. We conduct a comprehensive analysis of bipartite graphs using quantified pe...

Descripción completa

Detalles Bibliográficos
Autores principales: Schork, Karin, Turewicz, Michael, Uszkoreit, Julian, Rahnenführer, Jörg, Eisenacher, Martin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9586388/
https://www.ncbi.nlm.nih.gov/pubmed/36269744
http://dx.doi.org/10.1371/journal.pone.0276401
_version_ 1784813683333922816
author Schork, Karin
Turewicz, Michael
Uszkoreit, Julian
Rahnenführer, Jörg
Eisenacher, Martin
author_facet Schork, Karin
Turewicz, Michael
Uszkoreit, Julian
Rahnenführer, Jörg
Eisenacher, Martin
author_sort Schork, Karin
collection PubMed
description In bottom-up proteomics, proteins are enzymatically digested into peptides before measurement with mass spectrometry. The relationship between proteins and their corresponding peptides can be represented by bipartite graphs. We conduct a comprehensive analysis of bipartite graphs using quantified peptides from measured data sets as well as theoretical peptides from an in silico digestion of the corresponding complete taxonomic protein sequence databases. The aim of this study is to characterize and structure the different types of graphs that occur and to compare them between data sets. We observed a large influence of the accepted minimum peptide length during in silico digestion. When changing from theoretical peptides to measured ones, the graph structures are subject to two opposite effects. On the one hand, the graphs based on measured peptides are on average smaller and less complex compared to graphs using theoretical peptides. On the other hand, the proportion of protein nodes without unique peptides, which are a complicated case for protein inference and quantification, is considerably larger for measured data. Additionally, the proportion of graphs containing at least one protein node without unique peptides rises when going from database to quantitative level. The fraction of shared peptides and proteins without unique peptides as well as the complexity and size of the graphs highly depends on the data set and organism. Large differences between the structures of bipartite peptide-protein graphs have been observed between database and quantitative level as well as between analyzed species. In the analyzed measured data sets, the proportion of protein nodes without unique peptides ranged from 6.4% to 55.0%. This highlights the need for novel methods that can quantify proteins without unique peptides. The knowledge about the structure of the bipartite peptide-protein graphs gained in this study will be useful for the development of such algorithms.
format Online
Article
Text
id pubmed-9586388
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-95863882022-10-22 Characterization of peptide-protein relationships in protein ambiguity groups via bipartite graphs Schork, Karin Turewicz, Michael Uszkoreit, Julian Rahnenführer, Jörg Eisenacher, Martin PLoS One Research Article In bottom-up proteomics, proteins are enzymatically digested into peptides before measurement with mass spectrometry. The relationship between proteins and their corresponding peptides can be represented by bipartite graphs. We conduct a comprehensive analysis of bipartite graphs using quantified peptides from measured data sets as well as theoretical peptides from an in silico digestion of the corresponding complete taxonomic protein sequence databases. The aim of this study is to characterize and structure the different types of graphs that occur and to compare them between data sets. We observed a large influence of the accepted minimum peptide length during in silico digestion. When changing from theoretical peptides to measured ones, the graph structures are subject to two opposite effects. On the one hand, the graphs based on measured peptides are on average smaller and less complex compared to graphs using theoretical peptides. On the other hand, the proportion of protein nodes without unique peptides, which are a complicated case for protein inference and quantification, is considerably larger for measured data. Additionally, the proportion of graphs containing at least one protein node without unique peptides rises when going from database to quantitative level. The fraction of shared peptides and proteins without unique peptides as well as the complexity and size of the graphs highly depends on the data set and organism. Large differences between the structures of bipartite peptide-protein graphs have been observed between database and quantitative level as well as between analyzed species. In the analyzed measured data sets, the proportion of protein nodes without unique peptides ranged from 6.4% to 55.0%. This highlights the need for novel methods that can quantify proteins without unique peptides. The knowledge about the structure of the bipartite peptide-protein graphs gained in this study will be useful for the development of such algorithms. Public Library of Science 2022-10-21 /pmc/articles/PMC9586388/ /pubmed/36269744 http://dx.doi.org/10.1371/journal.pone.0276401 Text en © 2022 Schork et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Schork, Karin
Turewicz, Michael
Uszkoreit, Julian
Rahnenführer, Jörg
Eisenacher, Martin
Characterization of peptide-protein relationships in protein ambiguity groups via bipartite graphs
title Characterization of peptide-protein relationships in protein ambiguity groups via bipartite graphs
title_full Characterization of peptide-protein relationships in protein ambiguity groups via bipartite graphs
title_fullStr Characterization of peptide-protein relationships in protein ambiguity groups via bipartite graphs
title_full_unstemmed Characterization of peptide-protein relationships in protein ambiguity groups via bipartite graphs
title_short Characterization of peptide-protein relationships in protein ambiguity groups via bipartite graphs
title_sort characterization of peptide-protein relationships in protein ambiguity groups via bipartite graphs
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9586388/
https://www.ncbi.nlm.nih.gov/pubmed/36269744
http://dx.doi.org/10.1371/journal.pone.0276401
work_keys_str_mv AT schorkkarin characterizationofpeptideproteinrelationshipsinproteinambiguitygroupsviabipartitegraphs
AT turewiczmichael characterizationofpeptideproteinrelationshipsinproteinambiguitygroupsviabipartitegraphs
AT uszkoreitjulian characterizationofpeptideproteinrelationshipsinproteinambiguitygroupsviabipartitegraphs
AT rahnenfuhrerjorg characterizationofpeptideproteinrelationshipsinproteinambiguitygroupsviabipartitegraphs
AT eisenachermartin characterizationofpeptideproteinrelationshipsinproteinambiguitygroupsviabipartitegraphs