Cargando…

Expert System for Computer-assisted Annotation of MS/MS Spectra

An important step in mass spectrometry (MS)-based proteomics is the identification of peptides by their fragment spectra. Regardless of the identification score achieved, almost all tandem-MS (MS/MS) spectra contain remaining peaks that are not assigned by the search engine. These peaks may be expla...

Descripción completa

Detalles Bibliográficos
Autores principales: Neuhauser, Nadin, Michalski, Annette, Cox, Jürgen, Mann, Matthias
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The American Society for Biochemistry and Molecular Biology 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3494176/
https://www.ncbi.nlm.nih.gov/pubmed/22888147
http://dx.doi.org/10.1074/mcp.M112.020271
_version_ 1782249373228335104
author Neuhauser, Nadin
Michalski, Annette
Cox, Jürgen
Mann, Matthias
author_facet Neuhauser, Nadin
Michalski, Annette
Cox, Jürgen
Mann, Matthias
author_sort Neuhauser, Nadin
collection PubMed
description An important step in mass spectrometry (MS)-based proteomics is the identification of peptides by their fragment spectra. Regardless of the identification score achieved, almost all tandem-MS (MS/MS) spectra contain remaining peaks that are not assigned by the search engine. These peaks may be explainable by human experts but the scale of modern proteomics experiments makes this impractical. In computer science, Expert Systems are a mature technology to implement a list of rules generated by interviews with practitioners. We here develop such an Expert System, making use of literature knowledge as well as a large body of high mass accuracy and pure fragmentation spectra. Interestingly, we find that even with high mass accuracy data, rule sets can quickly become too complex, leading to over-annotation. Therefore we establish a rigorous false discovery rate, calculated by random insertion of peaks from a large collection of other MS/MS spectra, and use it to develop an optimized knowledge base. This rule set correctly annotates almost all peaks of medium or high abundance. For high resolution HCD data, median intensity coverage of fragment peaks in MS/MS spectra increases from 58% by search engine annotation alone to 86%. The resulting annotation performance surpasses a human expert, especially on complex spectra such as those of larger phosphorylated peptides. Our system is also applicable to high resolution collision-induced dissociation data. It is available both as a part of MaxQuant and via a webserver that only requires an MS/MS spectrum and the corresponding peptides sequence, and which outputs publication quality, annotated MS/MS spectra (www.biochem.mpg.de/mann/tools/). It provides expert knowledge to beginners in the field of MS-based proteomics and helps advanced users to focus on unusual and possibly novel types of fragment ions.
format Online
Article
Text
id pubmed-3494176
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher The American Society for Biochemistry and Molecular Biology
record_format MEDLINE/PubMed
spelling pubmed-34941762012-11-09 Expert System for Computer-assisted Annotation of MS/MS Spectra Neuhauser, Nadin Michalski, Annette Cox, Jürgen Mann, Matthias Mol Cell Proteomics Technological Innovation and Resources An important step in mass spectrometry (MS)-based proteomics is the identification of peptides by their fragment spectra. Regardless of the identification score achieved, almost all tandem-MS (MS/MS) spectra contain remaining peaks that are not assigned by the search engine. These peaks may be explainable by human experts but the scale of modern proteomics experiments makes this impractical. In computer science, Expert Systems are a mature technology to implement a list of rules generated by interviews with practitioners. We here develop such an Expert System, making use of literature knowledge as well as a large body of high mass accuracy and pure fragmentation spectra. Interestingly, we find that even with high mass accuracy data, rule sets can quickly become too complex, leading to over-annotation. Therefore we establish a rigorous false discovery rate, calculated by random insertion of peaks from a large collection of other MS/MS spectra, and use it to develop an optimized knowledge base. This rule set correctly annotates almost all peaks of medium or high abundance. For high resolution HCD data, median intensity coverage of fragment peaks in MS/MS spectra increases from 58% by search engine annotation alone to 86%. The resulting annotation performance surpasses a human expert, especially on complex spectra such as those of larger phosphorylated peptides. Our system is also applicable to high resolution collision-induced dissociation data. It is available both as a part of MaxQuant and via a webserver that only requires an MS/MS spectrum and the corresponding peptides sequence, and which outputs publication quality, annotated MS/MS spectra (www.biochem.mpg.de/mann/tools/). It provides expert knowledge to beginners in the field of MS-based proteomics and helps advanced users to focus on unusual and possibly novel types of fragment ions. The American Society for Biochemistry and Molecular Biology 2012-11 2012-08-10 /pmc/articles/PMC3494176/ /pubmed/22888147 http://dx.doi.org/10.1074/mcp.M112.020271 Text en © 2012 by The American Society for Biochemistry and Molecular Biology, Inc. Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/) applies to Author Choice Articles
spellingShingle Technological Innovation and Resources
Neuhauser, Nadin
Michalski, Annette
Cox, Jürgen
Mann, Matthias
Expert System for Computer-assisted Annotation of MS/MS Spectra
title Expert System for Computer-assisted Annotation of MS/MS Spectra
title_full Expert System for Computer-assisted Annotation of MS/MS Spectra
title_fullStr Expert System for Computer-assisted Annotation of MS/MS Spectra
title_full_unstemmed Expert System for Computer-assisted Annotation of MS/MS Spectra
title_short Expert System for Computer-assisted Annotation of MS/MS Spectra
title_sort expert system for computer-assisted annotation of ms/ms spectra
topic Technological Innovation and Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3494176/
https://www.ncbi.nlm.nih.gov/pubmed/22888147
http://dx.doi.org/10.1074/mcp.M112.020271
work_keys_str_mv AT neuhausernadin expertsystemforcomputerassistedannotationofmsmsspectra
AT michalskiannette expertsystemforcomputerassistedannotationofmsmsspectra
AT coxjurgen expertsystemforcomputerassistedannotationofmsmsspectra
AT mannmatthias expertsystemforcomputerassistedannotationofmsmsspectra