Cargando…

APP: an Automated Proteomics Pipeline for the analysis of mass spectrometry data based on multiple open access tools

BACKGROUND: Mass spectrometry analyses of complex protein samples yield large amounts of data and specific expertise is needed for data analysis, in addition to a dedicated computer infrastructure. Furthermore, the identification of proteins and their specific properties require the use of multiple...

Descripción completa

Detalles Bibliográficos
Autores principales:	Malm, Erik K, Srivastava, Vaibhav, Sundqvist, Gustav, Bulone, Vincent
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2014
Materias:	Software
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4314934/ https://www.ncbi.nlm.nih.gov/pubmed/25547515 http://dx.doi.org/10.1186/s12859-014-0441-8

_version_	1782355399418052608
author	Malm, Erik K Srivastava, Vaibhav Sundqvist, Gustav Bulone, Vincent
author_facet	Malm, Erik K Srivastava, Vaibhav Sundqvist, Gustav Bulone, Vincent
author_sort	Malm, Erik K
collection	PubMed
description	BACKGROUND: Mass spectrometry analyses of complex protein samples yield large amounts of data and specific expertise is needed for data analysis, in addition to a dedicated computer infrastructure. Furthermore, the identification of proteins and their specific properties require the use of multiple independent bioinformatics tools and several database search algorithms to process the same datasets. In order to facilitate and increase the speed of data analysis, there is a need for an integrated platform that would allow a comprehensive profiling of thousands of peptides and proteins in a single process through the simultaneous exploitation of multiple complementary algorithms. RESULTS: We have established a new proteomics pipeline designated as APP that fulfills these objectives using a complete series of tools freely available from open sources. APP automates the processing of proteomics tasks such as peptide identification, validation and quantitation from LC-MS/MS data and allows easy integration of many separate proteomics tools. Distributed processing is at the core of APP, allowing the processing of very large datasets using any combination of Windows/Linux physical or virtual computing resources. CONCLUSIONS: APP provides distributed computing nodes that are simple to set up, greatly relieving the need for separate IT competence when handling large datasets. The modular nature of APP allows complex workflows to be managed and distributed, speeding up throughput and setup. Additionally, APP logs execution information on all executed tasks and generated results, simplifying information management and validation. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-014-0441-8) contains supplementary material, which is available to authorized users.
format	Online Article Text
id	pubmed-4314934
institution	National Center for Biotechnology Information
language	English
publishDate	2014
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-43149342015-02-04 APP: an Automated Proteomics Pipeline for the analysis of mass spectrometry data based on multiple open access tools Malm, Erik K Srivastava, Vaibhav Sundqvist, Gustav Bulone, Vincent BMC Bioinformatics Software BACKGROUND: Mass spectrometry analyses of complex protein samples yield large amounts of data and specific expertise is needed for data analysis, in addition to a dedicated computer infrastructure. Furthermore, the identification of proteins and their specific properties require the use of multiple independent bioinformatics tools and several database search algorithms to process the same datasets. In order to facilitate and increase the speed of data analysis, there is a need for an integrated platform that would allow a comprehensive profiling of thousands of peptides and proteins in a single process through the simultaneous exploitation of multiple complementary algorithms. RESULTS: We have established a new proteomics pipeline designated as APP that fulfills these objectives using a complete series of tools freely available from open sources. APP automates the processing of proteomics tasks such as peptide identification, validation and quantitation from LC-MS/MS data and allows easy integration of many separate proteomics tools. Distributed processing is at the core of APP, allowing the processing of very large datasets using any combination of Windows/Linux physical or virtual computing resources. CONCLUSIONS: APP provides distributed computing nodes that are simple to set up, greatly relieving the need for separate IT competence when handling large datasets. The modular nature of APP allows complex workflows to be managed and distributed, speeding up throughput and setup. Additionally, APP logs execution information on all executed tasks and generated results, simplifying information management and validation. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-014-0441-8) contains supplementary material, which is available to authorized users. BioMed Central 2014-12-30 /pmc/articles/PMC4314934/ /pubmed/25547515 http://dx.doi.org/10.1186/s12859-014-0441-8 Text en © Malm et al.; licensee BioMed Central. 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Software Malm, Erik K Srivastava, Vaibhav Sundqvist, Gustav Bulone, Vincent APP: an Automated Proteomics Pipeline for the analysis of mass spectrometry data based on multiple open access tools
title	APP: an Automated Proteomics Pipeline for the analysis of mass spectrometry data based on multiple open access tools
title_full	APP: an Automated Proteomics Pipeline for the analysis of mass spectrometry data based on multiple open access tools
title_fullStr	APP: an Automated Proteomics Pipeline for the analysis of mass spectrometry data based on multiple open access tools
title_full_unstemmed	APP: an Automated Proteomics Pipeline for the analysis of mass spectrometry data based on multiple open access tools
title_short	APP: an Automated Proteomics Pipeline for the analysis of mass spectrometry data based on multiple open access tools
title_sort	app: an automated proteomics pipeline for the analysis of mass spectrometry data based on multiple open access tools
topic	Software
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4314934/ https://www.ncbi.nlm.nih.gov/pubmed/25547515 http://dx.doi.org/10.1186/s12859-014-0441-8
work_keys_str_mv	AT malmerikk appanautomatedproteomicspipelinefortheanalysisofmassspectrometrydatabasedonmultipleopenaccesstools AT srivastavavaibhav appanautomatedproteomicspipelinefortheanalysisofmassspectrometrydatabasedonmultipleopenaccesstools AT sundqvistgustav appanautomatedproteomicspipelinefortheanalysisofmassspectrometrydatabasedonmultipleopenaccesstools AT bulonevincent appanautomatedproteomicspipelinefortheanalysisofmassspectrometrydatabasedonmultipleopenaccesstools

APP: an Automated Proteomics Pipeline for the analysis of mass spectrometry data based on multiple open access tools

Ejemplares similares