Cargando…

A Hybrid Feature Subset Selection Algorithm for Analysis of High Correlation Proteomic Data

Pathological changes within an organ can be reflected as proteomic patterns in biological fluids such as plasma, serum, and urine. The surface-enhanced laser desorption and ionization time-of-flight mass spectrometry (SELDI-TOF MS) has been used to generate proteomic profiles from biological fluids....

Descripción completa

Detalles Bibliográficos
Autores principales: Kordy, Hussain Montazery, Baygi, Mohammad Hossein Miran, Moradi, Mohammad Hassan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Medknow Publications & Media Pvt Ltd 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3660712/
https://www.ncbi.nlm.nih.gov/pubmed/23717808
_version_ 1782270597201395712
author Kordy, Hussain Montazery
Baygi, Mohammad Hossein Miran
Moradi, Mohammad Hassan
author_facet Kordy, Hussain Montazery
Baygi, Mohammad Hossein Miran
Moradi, Mohammad Hassan
author_sort Kordy, Hussain Montazery
collection PubMed
description Pathological changes within an organ can be reflected as proteomic patterns in biological fluids such as plasma, serum, and urine. The surface-enhanced laser desorption and ionization time-of-flight mass spectrometry (SELDI-TOF MS) has been used to generate proteomic profiles from biological fluids. Mass spectrometry yields redundant noisy data that the most data points are irrelevant features for differentiating between cancer and normal cases. In this paper, we have proposed a hybrid feature subset selection algorithm based on maximum-discrimination and minimum-correlation coupled with peak scoring criteria. Our algorithm has been applied to two independent SELDI-TOF MS datasets of ovarian cancer obtained from the NCI-FDA clinical proteomics databank. The proposed algorithm has used to extract a set of proteins as potential biomarkers in each dataset. We applied the linear discriminate analysis to identify the important biomarkers. The selected biomarkers have been able to successfully diagnose the ovarian cancer patients from the noncancer control group with an accuracy of 100%, a sensitivity of 100%, and a specificity of 100% in the two datasets. The hybrid algorithm has the advantage that increases reproducibility of selected biomarkers and able to find a small set of proteins with high discrimination power.
format Online
Article
Text
id pubmed-3660712
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Medknow Publications & Media Pvt Ltd
record_format MEDLINE/PubMed
spelling pubmed-36607122013-05-28 A Hybrid Feature Subset Selection Algorithm for Analysis of High Correlation Proteomic Data Kordy, Hussain Montazery Baygi, Mohammad Hossein Miran Moradi, Mohammad Hassan J Med Signals Sens Original Article Pathological changes within an organ can be reflected as proteomic patterns in biological fluids such as plasma, serum, and urine. The surface-enhanced laser desorption and ionization time-of-flight mass spectrometry (SELDI-TOF MS) has been used to generate proteomic profiles from biological fluids. Mass spectrometry yields redundant noisy data that the most data points are irrelevant features for differentiating between cancer and normal cases. In this paper, we have proposed a hybrid feature subset selection algorithm based on maximum-discrimination and minimum-correlation coupled with peak scoring criteria. Our algorithm has been applied to two independent SELDI-TOF MS datasets of ovarian cancer obtained from the NCI-FDA clinical proteomics databank. The proposed algorithm has used to extract a set of proteins as potential biomarkers in each dataset. We applied the linear discriminate analysis to identify the important biomarkers. The selected biomarkers have been able to successfully diagnose the ovarian cancer patients from the noncancer control group with an accuracy of 100%, a sensitivity of 100%, and a specificity of 100% in the two datasets. The hybrid algorithm has the advantage that increases reproducibility of selected biomarkers and able to find a small set of proteins with high discrimination power. Medknow Publications & Media Pvt Ltd 2012 /pmc/articles/PMC3660712/ /pubmed/23717808 Text en Copyright: © Journal of Medical Signals and Sensors http://creativecommons.org/licenses/by-nc-sa/3.0 This is an open-access article distributed under the terms of the Creative Commons Attribution-Noncommercial-Share Alike 3.0 Unported, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Kordy, Hussain Montazery
Baygi, Mohammad Hossein Miran
Moradi, Mohammad Hassan
A Hybrid Feature Subset Selection Algorithm for Analysis of High Correlation Proteomic Data
title A Hybrid Feature Subset Selection Algorithm for Analysis of High Correlation Proteomic Data
title_full A Hybrid Feature Subset Selection Algorithm for Analysis of High Correlation Proteomic Data
title_fullStr A Hybrid Feature Subset Selection Algorithm for Analysis of High Correlation Proteomic Data
title_full_unstemmed A Hybrid Feature Subset Selection Algorithm for Analysis of High Correlation Proteomic Data
title_short A Hybrid Feature Subset Selection Algorithm for Analysis of High Correlation Proteomic Data
title_sort hybrid feature subset selection algorithm for analysis of high correlation proteomic data
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3660712/
https://www.ncbi.nlm.nih.gov/pubmed/23717808
work_keys_str_mv AT kordyhussainmontazery ahybridfeaturesubsetselectionalgorithmforanalysisofhighcorrelationproteomicdata
AT baygimohammadhosseinmiran ahybridfeaturesubsetselectionalgorithmforanalysisofhighcorrelationproteomicdata
AT moradimohammadhassan ahybridfeaturesubsetselectionalgorithmforanalysisofhighcorrelationproteomicdata
AT kordyhussainmontazery hybridfeaturesubsetselectionalgorithmforanalysisofhighcorrelationproteomicdata
AT baygimohammadhosseinmiran hybridfeaturesubsetselectionalgorithmforanalysisofhighcorrelationproteomicdata
AT moradimohammadhassan hybridfeaturesubsetselectionalgorithmforanalysisofhighcorrelationproteomicdata