Cargando…

A Graph-Centric Approach for Metagenome-Guided Peptide and Protein Identification in Metaproteomics

Metaproteomic studies adopt the common bottom-up proteomics approach to investigate the protein composition and the dynamics of protein expression in microbial communities. When matched metagenomic and/or metatranscriptomic data of the microbial communities are available, metaproteomic data analyses...

Descripción completa

Detalles Bibliográficos
Autores principales: Tang, Haixu, Li, Sujun, Ye, Yuzhen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5137872/
https://www.ncbi.nlm.nih.gov/pubmed/27918579
http://dx.doi.org/10.1371/journal.pcbi.1005224
_version_ 1782471971388260352
author Tang, Haixu
Li, Sujun
Ye, Yuzhen
author_facet Tang, Haixu
Li, Sujun
Ye, Yuzhen
author_sort Tang, Haixu
collection PubMed
description Metaproteomic studies adopt the common bottom-up proteomics approach to investigate the protein composition and the dynamics of protein expression in microbial communities. When matched metagenomic and/or metatranscriptomic data of the microbial communities are available, metaproteomic data analyses often employ a metagenome-guided approach, in which complete or fragmental protein-coding genes are first directly predicted from metagenomic (and/or metatranscriptomic) sequences or from their assemblies, and the resulting protein sequences are then used as the reference database for peptide/protein identification from MS/MS spectra. This approach is often limited because protein coding genes predicted from metagenomes are incomplete and fragmental. In this paper, we present a graph-centric approach to improving metagenome-guided peptide and protein identification in metaproteomics. Our method exploits the de Bruijn graph structure reported by metagenome assembly algorithms to generate a comprehensive database of protein sequences encoded in the community. We tested our method using several public metaproteomic datasets with matched metagenomic and metatranscriptomic sequencing data acquired from complex microbial communities in a biological wastewater treatment plant. The results showed that many more peptides and proteins can be identified when assembly graphs were utilized, improving the characterization of the proteins expressed in the microbial communities. The additional proteins we identified contribute to the characterization of important pathways such as those involved in degradation of chemical hazards. Our tools are released as open-source software on github at https://github.com/COL-IU/Graph2Pro.
format Online
Article
Text
id pubmed-5137872
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-51378722016-12-21 A Graph-Centric Approach for Metagenome-Guided Peptide and Protein Identification in Metaproteomics Tang, Haixu Li, Sujun Ye, Yuzhen PLoS Comput Biol Research Article Metaproteomic studies adopt the common bottom-up proteomics approach to investigate the protein composition and the dynamics of protein expression in microbial communities. When matched metagenomic and/or metatranscriptomic data of the microbial communities are available, metaproteomic data analyses often employ a metagenome-guided approach, in which complete or fragmental protein-coding genes are first directly predicted from metagenomic (and/or metatranscriptomic) sequences or from their assemblies, and the resulting protein sequences are then used as the reference database for peptide/protein identification from MS/MS spectra. This approach is often limited because protein coding genes predicted from metagenomes are incomplete and fragmental. In this paper, we present a graph-centric approach to improving metagenome-guided peptide and protein identification in metaproteomics. Our method exploits the de Bruijn graph structure reported by metagenome assembly algorithms to generate a comprehensive database of protein sequences encoded in the community. We tested our method using several public metaproteomic datasets with matched metagenomic and metatranscriptomic sequencing data acquired from complex microbial communities in a biological wastewater treatment plant. The results showed that many more peptides and proteins can be identified when assembly graphs were utilized, improving the characterization of the proteins expressed in the microbial communities. The additional proteins we identified contribute to the characterization of important pathways such as those involved in degradation of chemical hazards. Our tools are released as open-source software on github at https://github.com/COL-IU/Graph2Pro. Public Library of Science 2016-12-05 /pmc/articles/PMC5137872/ /pubmed/27918579 http://dx.doi.org/10.1371/journal.pcbi.1005224 Text en © 2016 Tang et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Tang, Haixu
Li, Sujun
Ye, Yuzhen
A Graph-Centric Approach for Metagenome-Guided Peptide and Protein Identification in Metaproteomics
title A Graph-Centric Approach for Metagenome-Guided Peptide and Protein Identification in Metaproteomics
title_full A Graph-Centric Approach for Metagenome-Guided Peptide and Protein Identification in Metaproteomics
title_fullStr A Graph-Centric Approach for Metagenome-Guided Peptide and Protein Identification in Metaproteomics
title_full_unstemmed A Graph-Centric Approach for Metagenome-Guided Peptide and Protein Identification in Metaproteomics
title_short A Graph-Centric Approach for Metagenome-Guided Peptide and Protein Identification in Metaproteomics
title_sort graph-centric approach for metagenome-guided peptide and protein identification in metaproteomics
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5137872/
https://www.ncbi.nlm.nih.gov/pubmed/27918579
http://dx.doi.org/10.1371/journal.pcbi.1005224
work_keys_str_mv AT tanghaixu agraphcentricapproachformetagenomeguidedpeptideandproteinidentificationinmetaproteomics
AT lisujun agraphcentricapproachformetagenomeguidedpeptideandproteinidentificationinmetaproteomics
AT yeyuzhen agraphcentricapproachformetagenomeguidedpeptideandproteinidentificationinmetaproteomics
AT tanghaixu graphcentricapproachformetagenomeguidedpeptideandproteinidentificationinmetaproteomics
AT lisujun graphcentricapproachformetagenomeguidedpeptideandproteinidentificationinmetaproteomics
AT yeyuzhen graphcentricapproachformetagenomeguidedpeptideandproteinidentificationinmetaproteomics