Cargando…

Network-assisted protein identification and data interpretation in shotgun proteomics

Protein assembly and biological interpretation of the assembled protein lists are critical steps in shotgun proteomics data analysis. Although most biological functions arise from interactions among proteins, current protein assembly pipelines treat proteins as independent entities. Usually, only in...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Jing, Zimmerman, Lisa J, Park, Byung-Hoon, Tabb, David L, Liebler, Daniel C, Zhang, Bing
Formato: Texto
Lenguaje:English
Publicado: Nature Publishing Group 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2736651/
https://www.ncbi.nlm.nih.gov/pubmed/19690572
http://dx.doi.org/10.1038/msb.2009.54
_version_ 1782171353182371840
author Li, Jing
Zimmerman, Lisa J
Park, Byung-Hoon
Tabb, David L
Liebler, Daniel C
Zhang, Bing
author_facet Li, Jing
Zimmerman, Lisa J
Park, Byung-Hoon
Tabb, David L
Liebler, Daniel C
Zhang, Bing
author_sort Li, Jing
collection PubMed
description Protein assembly and biological interpretation of the assembled protein lists are critical steps in shotgun proteomics data analysis. Although most biological functions arise from interactions among proteins, current protein assembly pipelines treat proteins as independent entities. Usually, only individual proteins with strong experimental evidence, that is, confident proteins, are reported, whereas many possible proteins of biological interest are eliminated. We have developed a clique-enrichment approach (CEA) to rescue eliminated proteins by incorporating the relationship among proteins as embedded in a protein interaction network. In several data sets tested, CEA increased protein identification by 8–23% with an estimated accuracy of 85%. Rescued proteins were supported by existing literature or transcriptome profiling studies at similar levels as confident proteins and at a significantly higher level than abandoned ones. Applying CEA on a breast cancer data set, rescued proteins coded by well-known breast cancer genes. In addition, CEA generated a network view of the proteins and helped show the modular organization of proteins that may underpin the molecular mechanisms of the disease.
format Text
id pubmed-2736651
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-27366512009-09-02 Network-assisted protein identification and data interpretation in shotgun proteomics Li, Jing Zimmerman, Lisa J Park, Byung-Hoon Tabb, David L Liebler, Daniel C Zhang, Bing Mol Syst Biol Report Protein assembly and biological interpretation of the assembled protein lists are critical steps in shotgun proteomics data analysis. Although most biological functions arise from interactions among proteins, current protein assembly pipelines treat proteins as independent entities. Usually, only individual proteins with strong experimental evidence, that is, confident proteins, are reported, whereas many possible proteins of biological interest are eliminated. We have developed a clique-enrichment approach (CEA) to rescue eliminated proteins by incorporating the relationship among proteins as embedded in a protein interaction network. In several data sets tested, CEA increased protein identification by 8–23% with an estimated accuracy of 85%. Rescued proteins were supported by existing literature or transcriptome profiling studies at similar levels as confident proteins and at a significantly higher level than abandoned ones. Applying CEA on a breast cancer data set, rescued proteins coded by well-known breast cancer genes. In addition, CEA generated a network view of the proteins and helped show the modular organization of proteins that may underpin the molecular mechanisms of the disease. Nature Publishing Group 2009-08-18 /pmc/articles/PMC2736651/ /pubmed/19690572 http://dx.doi.org/10.1038/msb.2009.54 Text en Copyright © 2009, EMBO and Nature Publishing Group http://creativecommons.org/licenses/by-nc-sa/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution Licence, which permits distribution and reproduction in any medium, provided the original author and source are credited. Creation of derivative works is permitted but the resulting work may be distributed only under the same or similar licence to this one. This licence does not permit commercial exploitation without specific permission.
spellingShingle Report
Li, Jing
Zimmerman, Lisa J
Park, Byung-Hoon
Tabb, David L
Liebler, Daniel C
Zhang, Bing
Network-assisted protein identification and data interpretation in shotgun proteomics
title Network-assisted protein identification and data interpretation in shotgun proteomics
title_full Network-assisted protein identification and data interpretation in shotgun proteomics
title_fullStr Network-assisted protein identification and data interpretation in shotgun proteomics
title_full_unstemmed Network-assisted protein identification and data interpretation in shotgun proteomics
title_short Network-assisted protein identification and data interpretation in shotgun proteomics
title_sort network-assisted protein identification and data interpretation in shotgun proteomics
topic Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2736651/
https://www.ncbi.nlm.nih.gov/pubmed/19690572
http://dx.doi.org/10.1038/msb.2009.54
work_keys_str_mv AT lijing networkassistedproteinidentificationanddatainterpretationinshotgunproteomics
AT zimmermanlisaj networkassistedproteinidentificationanddatainterpretationinshotgunproteomics
AT parkbyunghoon networkassistedproteinidentificationanddatainterpretationinshotgunproteomics
AT tabbdavidl networkassistedproteinidentificationanddatainterpretationinshotgunproteomics
AT lieblerdanielc networkassistedproteinidentificationanddatainterpretationinshotgunproteomics
AT zhangbing networkassistedproteinidentificationanddatainterpretationinshotgunproteomics