Cargando…
Identifying technical aliases in SELDI mass spectra of complex mixtures of proteins
BACKGROUND: Biomarker discovery datasets created using mass spectrum protein profiling of complex mixtures of proteins contain many peaks that represent the same protein with different charge states. Correlated variables such as these can confound the statistical analyses of proteomic data. Previous...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3847147/ https://www.ncbi.nlm.nih.gov/pubmed/24010718 http://dx.doi.org/10.1186/1756-0500-6-358 |
_version_ | 1782293550404206592 |
---|---|
author | Whitin, John C Rangan, Srinivasa Cohen, Harvey J |
author_facet | Whitin, John C Rangan, Srinivasa Cohen, Harvey J |
author_sort | Whitin, John C |
collection | PubMed |
description | BACKGROUND: Biomarker discovery datasets created using mass spectrum protein profiling of complex mixtures of proteins contain many peaks that represent the same protein with different charge states. Correlated variables such as these can confound the statistical analyses of proteomic data. Previously we developed an algorithm that clustered mass spectrum peaks that were biologically or technically correlated. Here we demonstrate an algorithm that clusters correlated technical aliases only. RESULTS: In this paper, we propose a preprocessing algorithm that can be used for grouping technical aliases in mass spectrometry protein profiling data. The stringency of the variance allowed for clustering is customizable, thereby affecting the number of peaks that are clustered. Subsequent analysis of the clusters, instead of individual peaks, helps reduce difficulties associated with technically-correlated data, and can aid more efficient biomarker identification. CONCLUSIONS: This software can be used to pre-process and thereby decrease the complexity of protein profiling proteomics data, thus simplifying the subsequent analysis of biomarkers by decreasing the number of tests. The software is also a practical tool for identifying which features to investigate further by purification, identification and confirmation. |
format | Online Article Text |
id | pubmed-3847147 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-38471472013-12-04 Identifying technical aliases in SELDI mass spectra of complex mixtures of proteins Whitin, John C Rangan, Srinivasa Cohen, Harvey J BMC Res Notes Research Article BACKGROUND: Biomarker discovery datasets created using mass spectrum protein profiling of complex mixtures of proteins contain many peaks that represent the same protein with different charge states. Correlated variables such as these can confound the statistical analyses of proteomic data. Previously we developed an algorithm that clustered mass spectrum peaks that were biologically or technically correlated. Here we demonstrate an algorithm that clusters correlated technical aliases only. RESULTS: In this paper, we propose a preprocessing algorithm that can be used for grouping technical aliases in mass spectrometry protein profiling data. The stringency of the variance allowed for clustering is customizable, thereby affecting the number of peaks that are clustered. Subsequent analysis of the clusters, instead of individual peaks, helps reduce difficulties associated with technically-correlated data, and can aid more efficient biomarker identification. CONCLUSIONS: This software can be used to pre-process and thereby decrease the complexity of protein profiling proteomics data, thus simplifying the subsequent analysis of biomarkers by decreasing the number of tests. The software is also a practical tool for identifying which features to investigate further by purification, identification and confirmation. BioMed Central 2013-09-08 /pmc/articles/PMC3847147/ /pubmed/24010718 http://dx.doi.org/10.1186/1756-0500-6-358 Text en Copyright © 2013 Whitin et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Whitin, John C Rangan, Srinivasa Cohen, Harvey J Identifying technical aliases in SELDI mass spectra of complex mixtures of proteins |
title | Identifying technical aliases in SELDI mass spectra of complex mixtures of proteins |
title_full | Identifying technical aliases in SELDI mass spectra of complex mixtures of proteins |
title_fullStr | Identifying technical aliases in SELDI mass spectra of complex mixtures of proteins |
title_full_unstemmed | Identifying technical aliases in SELDI mass spectra of complex mixtures of proteins |
title_short | Identifying technical aliases in SELDI mass spectra of complex mixtures of proteins |
title_sort | identifying technical aliases in seldi mass spectra of complex mixtures of proteins |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3847147/ https://www.ncbi.nlm.nih.gov/pubmed/24010718 http://dx.doi.org/10.1186/1756-0500-6-358 |
work_keys_str_mv | AT whitinjohnc identifyingtechnicalaliasesinseldimassspectraofcomplexmixturesofproteins AT rangansrinivasa identifyingtechnicalaliasesinseldimassspectraofcomplexmixturesofproteins AT cohenharveyj identifyingtechnicalaliasesinseldimassspectraofcomplexmixturesofproteins |