Cargando…

Method for Rapid Protein Identification in a Large Database

Protein identification is an integral part of proteomics research. The available tools to identify proteins in tandem mass spectrometry experiments are not optimized to face current challenges in terms of identification scale and speed owing to the exponential growth of the protein database and the...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Wenli, Zhao, Xiaofang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3755435/
https://www.ncbi.nlm.nih.gov/pubmed/24000323
http://dx.doi.org/10.1155/2013/414069
_version_ 1782281992063156224
author Zhang, Wenli
Zhao, Xiaofang
author_facet Zhang, Wenli
Zhao, Xiaofang
author_sort Zhang, Wenli
collection PubMed
description Protein identification is an integral part of proteomics research. The available tools to identify proteins in tandem mass spectrometry experiments are not optimized to face current challenges in terms of identification scale and speed owing to the exponential growth of the protein database and the accelerated generation of mass spectrometry data, as well as the demand for nonspecific digestion and post-modifications in complex-sample identification. As a result, a rapid method is required to mitigate such complexity and computation challenges. This paper thus aims to present an open method to prevent enzyme and modification specificity on a large database. This paper designed and developed a distributed program to facilitate application to computer resources. With this optimization, nearly linear speedup and real-time support are achieved on a large database with nonspecific digestion, thus enabling testing with two classical large protein databases in a 20-blade cluster. This work aids in the discovery of more significant biological results, such as modification sites, and enables the identification of more complex samples, such as metaproteomics samples.
format Online
Article
Text
id pubmed-3755435
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-37554352013-09-02 Method for Rapid Protein Identification in a Large Database Zhang, Wenli Zhao, Xiaofang Biomed Res Int Research Article Protein identification is an integral part of proteomics research. The available tools to identify proteins in tandem mass spectrometry experiments are not optimized to face current challenges in terms of identification scale and speed owing to the exponential growth of the protein database and the accelerated generation of mass spectrometry data, as well as the demand for nonspecific digestion and post-modifications in complex-sample identification. As a result, a rapid method is required to mitigate such complexity and computation challenges. This paper thus aims to present an open method to prevent enzyme and modification specificity on a large database. This paper designed and developed a distributed program to facilitate application to computer resources. With this optimization, nearly linear speedup and real-time support are achieved on a large database with nonspecific digestion, thus enabling testing with two classical large protein databases in a 20-blade cluster. This work aids in the discovery of more significant biological results, such as modification sites, and enables the identification of more complex samples, such as metaproteomics samples. Hindawi Publishing Corporation 2013 2013-08-13 /pmc/articles/PMC3755435/ /pubmed/24000323 http://dx.doi.org/10.1155/2013/414069 Text en Copyright © 2013 W. Zhang and X. Zhao. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhang, Wenli
Zhao, Xiaofang
Method for Rapid Protein Identification in a Large Database
title Method for Rapid Protein Identification in a Large Database
title_full Method for Rapid Protein Identification in a Large Database
title_fullStr Method for Rapid Protein Identification in a Large Database
title_full_unstemmed Method for Rapid Protein Identification in a Large Database
title_short Method for Rapid Protein Identification in a Large Database
title_sort method for rapid protein identification in a large database
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3755435/
https://www.ncbi.nlm.nih.gov/pubmed/24000323
http://dx.doi.org/10.1155/2013/414069
work_keys_str_mv AT zhangwenli methodforrapidproteinidentificationinalargedatabase
AT zhaoxiaofang methodforrapidproteinidentificationinalargedatabase