Cargando…

The Importance of Non-accessible Crosslinks and Solvent Accessible Surface Distance in Modeling Proteins with Restraints From Crosslinking Mass Spectrometry

Crosslinking mass spectrometry (XL-MS) is becoming an increasingly popular technique for modeling protein monomers and complexes. The distance restraints garnered from these experiments can be used alone or as part of an integrative modeling approach, incorporating data from many sources. However, m...

Descripción completa

Detalles Bibliográficos
Autores principales: Bullock, Joshua Matthew Allen, Schwab, Jannik, Thalassinos, Konstantinos, Topf, Maya
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The American Society for Biochemistry and Molecular Biology 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4937519/
https://www.ncbi.nlm.nih.gov/pubmed/27150526
http://dx.doi.org/10.1074/mcp.M116.058560
_version_ 1782441720017846272
author Bullock, Joshua Matthew Allen
Schwab, Jannik
Thalassinos, Konstantinos
Topf, Maya
author_facet Bullock, Joshua Matthew Allen
Schwab, Jannik
Thalassinos, Konstantinos
Topf, Maya
author_sort Bullock, Joshua Matthew Allen
collection PubMed
description Crosslinking mass spectrometry (XL-MS) is becoming an increasingly popular technique for modeling protein monomers and complexes. The distance restraints garnered from these experiments can be used alone or as part of an integrative modeling approach, incorporating data from many sources. However, modeling practices are varied and the difference in their usefulness is not clear. Here, we develop a new scoring procedure for models based on crosslink data—Matched and Nonaccessible Crosslink score (MNXL). We compare its performance with that of other commonly-used scoring functions (Number of Violations and Sum of Violation Distances) on a benchmark of 14 protein domains, each with 300 corresponding models (at various levels of quality) and associated, previously published, experimental crosslinks (XLdb). The distances between crosslinked lysines are calculated either as Euclidean distances or Solvent Accessible Surface Distances (SASD) using a newly-developed method (Jwalk). MNXL takes into account whether a crosslink is nonaccessible, i.e. an experimentally observed crosslink has no corresponding SASD in a model due to buried lysines. This metric alone is shown to have a significant impact on modeling performance and is a concept that is not considered at present if only Euclidean distances are used. Additionally, a comparison between modeling with SASD or Euclidean distance shows that SASD is superior, even when factoring out the effect of the nonaccessible crosslinks. Our benchmarking also shows that MNXL outperforms the other tested scoring functions in terms of precision and correlation to Cα-RMSD from the crystal structure. We finally test the MNXL at different levels of crosslink recovery (i.e. the percentage of crosslinks experimentally observed out of all theoretical ones) and set a target recovery of ∼20% after which the performance plateaus.
format Online
Article
Text
id pubmed-4937519
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher The American Society for Biochemistry and Molecular Biology
record_format MEDLINE/PubMed
spelling pubmed-49375192016-07-19 The Importance of Non-accessible Crosslinks and Solvent Accessible Surface Distance in Modeling Proteins with Restraints From Crosslinking Mass Spectrometry Bullock, Joshua Matthew Allen Schwab, Jannik Thalassinos, Konstantinos Topf, Maya Mol Cell Proteomics Technological Innovation and Resources Crosslinking mass spectrometry (XL-MS) is becoming an increasingly popular technique for modeling protein monomers and complexes. The distance restraints garnered from these experiments can be used alone or as part of an integrative modeling approach, incorporating data from many sources. However, modeling practices are varied and the difference in their usefulness is not clear. Here, we develop a new scoring procedure for models based on crosslink data—Matched and Nonaccessible Crosslink score (MNXL). We compare its performance with that of other commonly-used scoring functions (Number of Violations and Sum of Violation Distances) on a benchmark of 14 protein domains, each with 300 corresponding models (at various levels of quality) and associated, previously published, experimental crosslinks (XLdb). The distances between crosslinked lysines are calculated either as Euclidean distances or Solvent Accessible Surface Distances (SASD) using a newly-developed method (Jwalk). MNXL takes into account whether a crosslink is nonaccessible, i.e. an experimentally observed crosslink has no corresponding SASD in a model due to buried lysines. This metric alone is shown to have a significant impact on modeling performance and is a concept that is not considered at present if only Euclidean distances are used. Additionally, a comparison between modeling with SASD or Euclidean distance shows that SASD is superior, even when factoring out the effect of the nonaccessible crosslinks. Our benchmarking also shows that MNXL outperforms the other tested scoring functions in terms of precision and correlation to Cα-RMSD from the crystal structure. We finally test the MNXL at different levels of crosslink recovery (i.e. the percentage of crosslinks experimentally observed out of all theoretical ones) and set a target recovery of ∼20% after which the performance plateaus. The American Society for Biochemistry and Molecular Biology 2016-07 2016-05-05 /pmc/articles/PMC4937519/ /pubmed/27150526 http://dx.doi.org/10.1074/mcp.M116.058560 Text en © 2016 by The American Society for Biochemistry and Molecular Biology, Inc. Author's Choice—Final version free via Creative Commons CC-BY license (http://creativecommons.org/licenses/by/4.0) .
spellingShingle Technological Innovation and Resources
Bullock, Joshua Matthew Allen
Schwab, Jannik
Thalassinos, Konstantinos
Topf, Maya
The Importance of Non-accessible Crosslinks and Solvent Accessible Surface Distance in Modeling Proteins with Restraints From Crosslinking Mass Spectrometry
title The Importance of Non-accessible Crosslinks and Solvent Accessible Surface Distance in Modeling Proteins with Restraints From Crosslinking Mass Spectrometry
title_full The Importance of Non-accessible Crosslinks and Solvent Accessible Surface Distance in Modeling Proteins with Restraints From Crosslinking Mass Spectrometry
title_fullStr The Importance of Non-accessible Crosslinks and Solvent Accessible Surface Distance in Modeling Proteins with Restraints From Crosslinking Mass Spectrometry
title_full_unstemmed The Importance of Non-accessible Crosslinks and Solvent Accessible Surface Distance in Modeling Proteins with Restraints From Crosslinking Mass Spectrometry
title_short The Importance of Non-accessible Crosslinks and Solvent Accessible Surface Distance in Modeling Proteins with Restraints From Crosslinking Mass Spectrometry
title_sort importance of non-accessible crosslinks and solvent accessible surface distance in modeling proteins with restraints from crosslinking mass spectrometry
topic Technological Innovation and Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4937519/
https://www.ncbi.nlm.nih.gov/pubmed/27150526
http://dx.doi.org/10.1074/mcp.M116.058560
work_keys_str_mv AT bullockjoshuamatthewallen theimportanceofnonaccessiblecrosslinksandsolventaccessiblesurfacedistanceinmodelingproteinswithrestraintsfromcrosslinkingmassspectrometry
AT schwabjannik theimportanceofnonaccessiblecrosslinksandsolventaccessiblesurfacedistanceinmodelingproteinswithrestraintsfromcrosslinkingmassspectrometry
AT thalassinoskonstantinos theimportanceofnonaccessiblecrosslinksandsolventaccessiblesurfacedistanceinmodelingproteinswithrestraintsfromcrosslinkingmassspectrometry
AT topfmaya theimportanceofnonaccessiblecrosslinksandsolventaccessiblesurfacedistanceinmodelingproteinswithrestraintsfromcrosslinkingmassspectrometry
AT bullockjoshuamatthewallen importanceofnonaccessiblecrosslinksandsolventaccessiblesurfacedistanceinmodelingproteinswithrestraintsfromcrosslinkingmassspectrometry
AT schwabjannik importanceofnonaccessiblecrosslinksandsolventaccessiblesurfacedistanceinmodelingproteinswithrestraintsfromcrosslinkingmassspectrometry
AT thalassinoskonstantinos importanceofnonaccessiblecrosslinksandsolventaccessiblesurfacedistanceinmodelingproteinswithrestraintsfromcrosslinkingmassspectrometry
AT topfmaya importanceofnonaccessiblecrosslinksandsolventaccessiblesurfacedistanceinmodelingproteinswithrestraintsfromcrosslinkingmassspectrometry