Cargando…

Identifying technical aliases in SELDI mass spectra of complex mixtures of proteins

BACKGROUND: Biomarker discovery datasets created using mass spectrum protein profiling of complex mixtures of proteins contain many peaks that represent the same protein with different charge states. Correlated variables such as these can confound the statistical analyses of proteomic data. Previous...

Descripción completa

Detalles Bibliográficos
Autores principales: Whitin, John C, Rangan, Srinivasa, Cohen, Harvey J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3847147/
https://www.ncbi.nlm.nih.gov/pubmed/24010718
http://dx.doi.org/10.1186/1756-0500-6-358
_version_ 1782293550404206592
author Whitin, John C
Rangan, Srinivasa
Cohen, Harvey J
author_facet Whitin, John C
Rangan, Srinivasa
Cohen, Harvey J
author_sort Whitin, John C
collection PubMed
description BACKGROUND: Biomarker discovery datasets created using mass spectrum protein profiling of complex mixtures of proteins contain many peaks that represent the same protein with different charge states. Correlated variables such as these can confound the statistical analyses of proteomic data. Previously we developed an algorithm that clustered mass spectrum peaks that were biologically or technically correlated. Here we demonstrate an algorithm that clusters correlated technical aliases only. RESULTS: In this paper, we propose a preprocessing algorithm that can be used for grouping technical aliases in mass spectrometry protein profiling data. The stringency of the variance allowed for clustering is customizable, thereby affecting the number of peaks that are clustered. Subsequent analysis of the clusters, instead of individual peaks, helps reduce difficulties associated with technically-correlated data, and can aid more efficient biomarker identification. CONCLUSIONS: This software can be used to pre-process and thereby decrease the complexity of protein profiling proteomics data, thus simplifying the subsequent analysis of biomarkers by decreasing the number of tests. The software is also a practical tool for identifying which features to investigate further by purification, identification and confirmation.
format Online
Article
Text
id pubmed-3847147
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38471472013-12-04 Identifying technical aliases in SELDI mass spectra of complex mixtures of proteins Whitin, John C Rangan, Srinivasa Cohen, Harvey J BMC Res Notes Research Article BACKGROUND: Biomarker discovery datasets created using mass spectrum protein profiling of complex mixtures of proteins contain many peaks that represent the same protein with different charge states. Correlated variables such as these can confound the statistical analyses of proteomic data. Previously we developed an algorithm that clustered mass spectrum peaks that were biologically or technically correlated. Here we demonstrate an algorithm that clusters correlated technical aliases only. RESULTS: In this paper, we propose a preprocessing algorithm that can be used for grouping technical aliases in mass spectrometry protein profiling data. The stringency of the variance allowed for clustering is customizable, thereby affecting the number of peaks that are clustered. Subsequent analysis of the clusters, instead of individual peaks, helps reduce difficulties associated with technically-correlated data, and can aid more efficient biomarker identification. CONCLUSIONS: This software can be used to pre-process and thereby decrease the complexity of protein profiling proteomics data, thus simplifying the subsequent analysis of biomarkers by decreasing the number of tests. The software is also a practical tool for identifying which features to investigate further by purification, identification and confirmation. BioMed Central 2013-09-08 /pmc/articles/PMC3847147/ /pubmed/24010718 http://dx.doi.org/10.1186/1756-0500-6-358 Text en Copyright © 2013 Whitin et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Whitin, John C
Rangan, Srinivasa
Cohen, Harvey J
Identifying technical aliases in SELDI mass spectra of complex mixtures of proteins
title Identifying technical aliases in SELDI mass spectra of complex mixtures of proteins
title_full Identifying technical aliases in SELDI mass spectra of complex mixtures of proteins
title_fullStr Identifying technical aliases in SELDI mass spectra of complex mixtures of proteins
title_full_unstemmed Identifying technical aliases in SELDI mass spectra of complex mixtures of proteins
title_short Identifying technical aliases in SELDI mass spectra of complex mixtures of proteins
title_sort identifying technical aliases in seldi mass spectra of complex mixtures of proteins
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3847147/
https://www.ncbi.nlm.nih.gov/pubmed/24010718
http://dx.doi.org/10.1186/1756-0500-6-358
work_keys_str_mv AT whitinjohnc identifyingtechnicalaliasesinseldimassspectraofcomplexmixturesofproteins
AT rangansrinivasa identifyingtechnicalaliasesinseldimassspectraofcomplexmixturesofproteins
AT cohenharveyj identifyingtechnicalaliasesinseldimassspectraofcomplexmixturesofproteins