Cargando…

PatternLab for proteomics: a tool for differential shotgun proteomics

BACKGROUND: A goal of proteomics is to distinguish between states of a biological system by identifying protein expression differences. Liu et al. demonstrated a method to perform semi-relative protein quantitation in shotgun proteomics data by correlating the number of tandem mass spectra obtained...

Descripción completa

Detalles Bibliográficos
Autores principales: Carvalho, Paulo C, Fischer, Juliana SG, Chen, Emily I, Yates, John R, Barbosa, Valmir C
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2488363/
https://www.ncbi.nlm.nih.gov/pubmed/18644148
http://dx.doi.org/10.1186/1471-2105-9-316
_version_ 1782158124631719936
author Carvalho, Paulo C
Fischer, Juliana SG
Chen, Emily I
Yates, John R
Barbosa, Valmir C
author_facet Carvalho, Paulo C
Fischer, Juliana SG
Chen, Emily I
Yates, John R
Barbosa, Valmir C
author_sort Carvalho, Paulo C
collection PubMed
description BACKGROUND: A goal of proteomics is to distinguish between states of a biological system by identifying protein expression differences. Liu et al. demonstrated a method to perform semi-relative protein quantitation in shotgun proteomics data by correlating the number of tandem mass spectra obtained for each protein, or "spectral count", with its abundance in a mixture; however, two issues have remained open: how to normalize spectral counting data and how to efficiently pinpoint differences between profiles. Moreover, Chen et al. recently showed how to increase the number of identified proteins in shotgun proteomics by analyzing samples with different MS-compatible detergents while performing proteolytic digestion. The latter introduced new challenges as seen from the data analysis perspective, since replicate readings are not acquired. RESULTS: To address the open issues above, we present a program termed PatternLab for proteomics. This program implements existing strategies and adds two new methods to pinpoint differences in protein profiles. The first method, ACFold, addresses experiments with less than three replicates from each state or having assays acquired by different protocols as described by Chen et al. ACFold uses a combined criterion based on expression fold changes, the AC test, and the false-discovery rate, and can supply a "bird's-eye view" of differentially expressed proteins. The other method addresses experimental designs having multiple readings from each state and is referred to as nSVM (natural support vector machine) because of its roots in evolutionary computing and in statistical learning theory. Our observations suggest that nSVM's niche comprises projects that select a minimum set of proteins for classification purposes; for example, the development of an early detection kit for a given pathology. We demonstrate the effectiveness of each method on experimental data and confront them with existing strategies. CONCLUSION: PatternLab offers an easy and unified access to a variety of feature selection and normalization strategies, each having its own niche. Additionally, graphing tools are available to aid in the analysis of high throughput experimental data. PatternLab is available at .
format Text
id pubmed-2488363
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-24883632008-07-29 PatternLab for proteomics: a tool for differential shotgun proteomics Carvalho, Paulo C Fischer, Juliana SG Chen, Emily I Yates, John R Barbosa, Valmir C BMC Bioinformatics Software BACKGROUND: A goal of proteomics is to distinguish between states of a biological system by identifying protein expression differences. Liu et al. demonstrated a method to perform semi-relative protein quantitation in shotgun proteomics data by correlating the number of tandem mass spectra obtained for each protein, or "spectral count", with its abundance in a mixture; however, two issues have remained open: how to normalize spectral counting data and how to efficiently pinpoint differences between profiles. Moreover, Chen et al. recently showed how to increase the number of identified proteins in shotgun proteomics by analyzing samples with different MS-compatible detergents while performing proteolytic digestion. The latter introduced new challenges as seen from the data analysis perspective, since replicate readings are not acquired. RESULTS: To address the open issues above, we present a program termed PatternLab for proteomics. This program implements existing strategies and adds two new methods to pinpoint differences in protein profiles. The first method, ACFold, addresses experiments with less than three replicates from each state or having assays acquired by different protocols as described by Chen et al. ACFold uses a combined criterion based on expression fold changes, the AC test, and the false-discovery rate, and can supply a "bird's-eye view" of differentially expressed proteins. The other method addresses experimental designs having multiple readings from each state and is referred to as nSVM (natural support vector machine) because of its roots in evolutionary computing and in statistical learning theory. Our observations suggest that nSVM's niche comprises projects that select a minimum set of proteins for classification purposes; for example, the development of an early detection kit for a given pathology. We demonstrate the effectiveness of each method on experimental data and confront them with existing strategies. CONCLUSION: PatternLab offers an easy and unified access to a variety of feature selection and normalization strategies, each having its own niche. Additionally, graphing tools are available to aid in the analysis of high throughput experimental data. PatternLab is available at . BioMed Central 2008-07-21 /pmc/articles/PMC2488363/ /pubmed/18644148 http://dx.doi.org/10.1186/1471-2105-9-316 Text en Copyright © 2008 Carvalho et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Carvalho, Paulo C
Fischer, Juliana SG
Chen, Emily I
Yates, John R
Barbosa, Valmir C
PatternLab for proteomics: a tool for differential shotgun proteomics
title PatternLab for proteomics: a tool for differential shotgun proteomics
title_full PatternLab for proteomics: a tool for differential shotgun proteomics
title_fullStr PatternLab for proteomics: a tool for differential shotgun proteomics
title_full_unstemmed PatternLab for proteomics: a tool for differential shotgun proteomics
title_short PatternLab for proteomics: a tool for differential shotgun proteomics
title_sort patternlab for proteomics: a tool for differential shotgun proteomics
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2488363/
https://www.ncbi.nlm.nih.gov/pubmed/18644148
http://dx.doi.org/10.1186/1471-2105-9-316
work_keys_str_mv AT carvalhopauloc patternlabforproteomicsatoolfordifferentialshotgunproteomics
AT fischerjulianasg patternlabforproteomicsatoolfordifferentialshotgunproteomics
AT chenemilyi patternlabforproteomicsatoolfordifferentialshotgunproteomics
AT yatesjohnr patternlabforproteomicsatoolfordifferentialshotgunproteomics
AT barbosavalmirc patternlabforproteomicsatoolfordifferentialshotgunproteomics