Cargando…

Identifying direct contacts between protein complex subunits from their conditional dependence in proteomics datasets

Determining the three dimensional arrangement of proteins in a complex is highly beneficial for uncovering mechanistic function and interpreting genetic variation in coding genes comprising protein complexes. There are several methods for determining co-complex interactions between proteins, among t...

Descripción completa

Detalles Bibliográficos
Autores principales: Drew, Kevin, Müller, Christian L., Bonneau, Richard, Marcotte, Edward M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5638211/
https://www.ncbi.nlm.nih.gov/pubmed/29023445
http://dx.doi.org/10.1371/journal.pcbi.1005625
_version_ 1783270707227000832
author Drew, Kevin
Müller, Christian L.
Bonneau, Richard
Marcotte, Edward M.
author_facet Drew, Kevin
Müller, Christian L.
Bonneau, Richard
Marcotte, Edward M.
author_sort Drew, Kevin
collection PubMed
description Determining the three dimensional arrangement of proteins in a complex is highly beneficial for uncovering mechanistic function and interpreting genetic variation in coding genes comprising protein complexes. There are several methods for determining co-complex interactions between proteins, among them co-fractionation / mass spectrometry (CF-MS), but it remains difficult to identify directly contacting subunits within a multi-protein complex. Correlation analysis of CF-MS profiles shows promise in detecting protein complexes as a whole but is limited in its ability to infer direct physical contacts among proteins in sub-complexes. To identify direct protein-protein contacts within human protein complexes we learn a sparse conditional dependency graph from approximately 3,000 CF-MS experiments on human cell lines. We show substantial performance gains in estimating direct interactions compared to correlation analysis on a benchmark of large protein complexes with solved three-dimensional structures. We demonstrate the method’s value in determining the three dimensional arrangement of proteins by making predictions for complexes without known structure (the exocyst and tRNA multi-synthetase complex) and by establishing evidence for the structural position of a recently discovered component of the core human EKC/KEOPS complex, GON7/C14ORF142, providing a more complete 3D model of the complex. Direct contact prediction provides easily calculable additional structural information for large-scale protein complex mapping studies and should be broadly applicable across organisms as more CF-MS datasets become available.
format Online
Article
Text
id pubmed-5638211
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-56382112017-11-03 Identifying direct contacts between protein complex subunits from their conditional dependence in proteomics datasets Drew, Kevin Müller, Christian L. Bonneau, Richard Marcotte, Edward M. PLoS Comput Biol Research Article Determining the three dimensional arrangement of proteins in a complex is highly beneficial for uncovering mechanistic function and interpreting genetic variation in coding genes comprising protein complexes. There are several methods for determining co-complex interactions between proteins, among them co-fractionation / mass spectrometry (CF-MS), but it remains difficult to identify directly contacting subunits within a multi-protein complex. Correlation analysis of CF-MS profiles shows promise in detecting protein complexes as a whole but is limited in its ability to infer direct physical contacts among proteins in sub-complexes. To identify direct protein-protein contacts within human protein complexes we learn a sparse conditional dependency graph from approximately 3,000 CF-MS experiments on human cell lines. We show substantial performance gains in estimating direct interactions compared to correlation analysis on a benchmark of large protein complexes with solved three-dimensional structures. We demonstrate the method’s value in determining the three dimensional arrangement of proteins by making predictions for complexes without known structure (the exocyst and tRNA multi-synthetase complex) and by establishing evidence for the structural position of a recently discovered component of the core human EKC/KEOPS complex, GON7/C14ORF142, providing a more complete 3D model of the complex. Direct contact prediction provides easily calculable additional structural information for large-scale protein complex mapping studies and should be broadly applicable across organisms as more CF-MS datasets become available. Public Library of Science 2017-10-12 /pmc/articles/PMC5638211/ /pubmed/29023445 http://dx.doi.org/10.1371/journal.pcbi.1005625 Text en © 2017 Drew et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Drew, Kevin
Müller, Christian L.
Bonneau, Richard
Marcotte, Edward M.
Identifying direct contacts between protein complex subunits from their conditional dependence in proteomics datasets
title Identifying direct contacts between protein complex subunits from their conditional dependence in proteomics datasets
title_full Identifying direct contacts between protein complex subunits from their conditional dependence in proteomics datasets
title_fullStr Identifying direct contacts between protein complex subunits from their conditional dependence in proteomics datasets
title_full_unstemmed Identifying direct contacts between protein complex subunits from their conditional dependence in proteomics datasets
title_short Identifying direct contacts between protein complex subunits from their conditional dependence in proteomics datasets
title_sort identifying direct contacts between protein complex subunits from their conditional dependence in proteomics datasets
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5638211/
https://www.ncbi.nlm.nih.gov/pubmed/29023445
http://dx.doi.org/10.1371/journal.pcbi.1005625
work_keys_str_mv AT drewkevin identifyingdirectcontactsbetweenproteincomplexsubunitsfromtheirconditionaldependenceinproteomicsdatasets
AT mullerchristianl identifyingdirectcontactsbetweenproteincomplexsubunitsfromtheirconditionaldependenceinproteomicsdatasets
AT bonneaurichard identifyingdirectcontactsbetweenproteincomplexsubunitsfromtheirconditionaldependenceinproteomicsdatasets
AT marcotteedwardm identifyingdirectcontactsbetweenproteincomplexsubunitsfromtheirconditionaldependenceinproteomicsdatasets