Cargando…

A high performance profile-biomarker diagnosis for mass spectral profiles

BACKGROUND: Although mass spectrometry based proteomics demonstrates an exciting promise in complex diseases diagnosis, it remains an important research field rather than an applicable clinical routine for its diagnostic accuracy and data reproducibility. Relatively less investigation has been done...

Descripción completa

Detalles Bibliográficos
Autor principal: Han, Henry
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287485/
https://www.ncbi.nlm.nih.gov/pubmed/22784576
http://dx.doi.org/10.1186/1752-0509-5-S2-S5
_version_ 1782224673937817600
author Han, Henry
author_facet Han, Henry
author_sort Han, Henry
collection PubMed
description BACKGROUND: Although mass spectrometry based proteomics demonstrates an exciting promise in complex diseases diagnosis, it remains an important research field rather than an applicable clinical routine for its diagnostic accuracy and data reproducibility. Relatively less investigation has been done yet in attaining high-performance proteomic pattern classification compared with the amount of endeavours in enhancing data reproducibility. METHODS: In this study, we present a novel machine learning approach to achieve a clinical level disease diagnosis for mass spectral data. We propose multi-resolution independent component analysis, a novel feature selection algorithm to tackle the large dimensionality of mass spectra, by following our local and global feature selection framework. We also develop high-performance classifiers by embedding multi-resolution independent component analysis in linear discriminant analysis and support vector machines. RESULTS: Our multi-resolution independent component based support vector machines not only achieve clinical level classification accuracy, but also overcome the weakness in traditional peak-selection based biomarker discovery. In addition to rigorous theoretical analysis, we demonstrate our method’s superiority by comparing it with nine state-of-the-art classification and regression algorithms on six heterogeneous mass spectral profiles. CONCLUSIONS: Our work not only suggests an alternative direction from machine learning to accelerate mass spectral proteomic technologies into a clinical routine by treating an input profile as a ‘profile-biomarker’, but also has positive impacts on large scale ‘omics' data mining. Related source codes and data sets can be found at: https://sites.google.com/site/heyaumbioinformatics/home/proteomics
format Online
Article
Text
id pubmed-3287485
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32874852012-02-28 A high performance profile-biomarker diagnosis for mass spectral profiles Han, Henry BMC Syst Biol Proceedings BACKGROUND: Although mass spectrometry based proteomics demonstrates an exciting promise in complex diseases diagnosis, it remains an important research field rather than an applicable clinical routine for its diagnostic accuracy and data reproducibility. Relatively less investigation has been done yet in attaining high-performance proteomic pattern classification compared with the amount of endeavours in enhancing data reproducibility. METHODS: In this study, we present a novel machine learning approach to achieve a clinical level disease diagnosis for mass spectral data. We propose multi-resolution independent component analysis, a novel feature selection algorithm to tackle the large dimensionality of mass spectra, by following our local and global feature selection framework. We also develop high-performance classifiers by embedding multi-resolution independent component analysis in linear discriminant analysis and support vector machines. RESULTS: Our multi-resolution independent component based support vector machines not only achieve clinical level classification accuracy, but also overcome the weakness in traditional peak-selection based biomarker discovery. In addition to rigorous theoretical analysis, we demonstrate our method’s superiority by comparing it with nine state-of-the-art classification and regression algorithms on six heterogeneous mass spectral profiles. CONCLUSIONS: Our work not only suggests an alternative direction from machine learning to accelerate mass spectral proteomic technologies into a clinical routine by treating an input profile as a ‘profile-biomarker’, but also has positive impacts on large scale ‘omics' data mining. Related source codes and data sets can be found at: https://sites.google.com/site/heyaumbioinformatics/home/proteomics BioMed Central 2011-12-14 /pmc/articles/PMC3287485/ /pubmed/22784576 http://dx.doi.org/10.1186/1752-0509-5-S2-S5 Text en Copyright ©2011 Han; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Han, Henry
A high performance profile-biomarker diagnosis for mass spectral profiles
title A high performance profile-biomarker diagnosis for mass spectral profiles
title_full A high performance profile-biomarker diagnosis for mass spectral profiles
title_fullStr A high performance profile-biomarker diagnosis for mass spectral profiles
title_full_unstemmed A high performance profile-biomarker diagnosis for mass spectral profiles
title_short A high performance profile-biomarker diagnosis for mass spectral profiles
title_sort high performance profile-biomarker diagnosis for mass spectral profiles
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287485/
https://www.ncbi.nlm.nih.gov/pubmed/22784576
http://dx.doi.org/10.1186/1752-0509-5-S2-S5
work_keys_str_mv AT hanhenry ahighperformanceprofilebiomarkerdiagnosisformassspectralprofiles
AT hanhenry highperformanceprofilebiomarkerdiagnosisformassspectralprofiles