Cargando…

Identifying Cancer Biomarkers From Microarray Data Using Feature Selection and Semisupervised Learning

Microarrays have now gone from obscurity to being almost ubiquitous in biological research. At the same time, the statistical methodology for microarray analysis has progressed from simple visual assessments of results to novel algorithms for analyzing changes in expression profiles. In a micro-RNA...

Descripción completa

Detalles Bibliográficos
Formato: Online Artículo Texto
Lenguaje:English
Publicado: IEEE 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4848046/
https://www.ncbi.nlm.nih.gov/pubmed/27170887
http://dx.doi.org/10.1109/JTEHM.2014.2375820
_version_ 1782429290667704320
collection PubMed
description Microarrays have now gone from obscurity to being almost ubiquitous in biological research. At the same time, the statistical methodology for microarray analysis has progressed from simple visual assessments of results to novel algorithms for analyzing changes in expression profiles. In a micro-RNA (miRNA) or gene-expression profiling experiment, the expression levels of thousands of genes/miRNAs are simultaneously monitored to study the effects of certain treatments, diseases, and developmental stages on their expressions. Microarray-based gene expression profiling can be used to identify genes, whose expressions are changed in response to pathogens or other organisms by comparing gene expression in infected to that in uninfected cells or tissues. Recent studies have revealed that patterns of altered microarray expression profiles in cancer can serve as molecular biomarkers for tumor diagnosis, prognosis of disease-specific outcomes, and prediction of therapeutic responses. Microarray data sets containing expression profiles of a number of miRNAs or genes are used to identify biomarkers, which have dysregulation in normal and malignant tissues. However, small sample size remains a bottleneck to design successful classification methods. On the other hand, adequate number of microarray data that do not have clinical knowledge can be employed as additional source of information. In this paper, a combination of kernelized fuzzy rough set (KFRS) and semisupervised support vector machine (S(3)VM) is proposed for predicting cancer biomarkers from one miRNA and three gene expression data sets. Biomarkers are discovered employing three feature selection methods, including KFRS. The effectiveness of the proposed KFRS and S(3)VM combination on the microarray data sets is demonstrated, and the cancer biomarkers identified from miRNA data are reported. Furthermore, biological significance tests are conducted for miRNA cancer biomarkers.
format Online
Article
Text
id pubmed-4848046
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher IEEE
record_format MEDLINE/PubMed
spelling pubmed-48480462016-05-11 Identifying Cancer Biomarkers From Microarray Data Using Feature Selection and Semisupervised Learning IEEE J Transl Eng Health Med Article Microarrays have now gone from obscurity to being almost ubiquitous in biological research. At the same time, the statistical methodology for microarray analysis has progressed from simple visual assessments of results to novel algorithms for analyzing changes in expression profiles. In a micro-RNA (miRNA) or gene-expression profiling experiment, the expression levels of thousands of genes/miRNAs are simultaneously monitored to study the effects of certain treatments, diseases, and developmental stages on their expressions. Microarray-based gene expression profiling can be used to identify genes, whose expressions are changed in response to pathogens or other organisms by comparing gene expression in infected to that in uninfected cells or tissues. Recent studies have revealed that patterns of altered microarray expression profiles in cancer can serve as molecular biomarkers for tumor diagnosis, prognosis of disease-specific outcomes, and prediction of therapeutic responses. Microarray data sets containing expression profiles of a number of miRNAs or genes are used to identify biomarkers, which have dysregulation in normal and malignant tissues. However, small sample size remains a bottleneck to design successful classification methods. On the other hand, adequate number of microarray data that do not have clinical knowledge can be employed as additional source of information. In this paper, a combination of kernelized fuzzy rough set (KFRS) and semisupervised support vector machine (S(3)VM) is proposed for predicting cancer biomarkers from one miRNA and three gene expression data sets. Biomarkers are discovered employing three feature selection methods, including KFRS. The effectiveness of the proposed KFRS and S(3)VM combination on the microarray data sets is demonstrated, and the cancer biomarkers identified from miRNA data are reported. Furthermore, biological significance tests are conducted for miRNA cancer biomarkers. IEEE 2014-12-02 /pmc/articles/PMC4848046/ /pubmed/27170887 http://dx.doi.org/10.1109/JTEHM.2014.2375820 Text en 2168-2372 © 2014 IEEE. Translations and content mining are permitted for academic research only. Personal use is also permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
spellingShingle Article
Identifying Cancer Biomarkers From Microarray Data Using Feature Selection and Semisupervised Learning
title Identifying Cancer Biomarkers From Microarray Data Using Feature Selection and Semisupervised Learning
title_full Identifying Cancer Biomarkers From Microarray Data Using Feature Selection and Semisupervised Learning
title_fullStr Identifying Cancer Biomarkers From Microarray Data Using Feature Selection and Semisupervised Learning
title_full_unstemmed Identifying Cancer Biomarkers From Microarray Data Using Feature Selection and Semisupervised Learning
title_short Identifying Cancer Biomarkers From Microarray Data Using Feature Selection and Semisupervised Learning
title_sort identifying cancer biomarkers from microarray data using feature selection and semisupervised learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4848046/
https://www.ncbi.nlm.nih.gov/pubmed/27170887
http://dx.doi.org/10.1109/JTEHM.2014.2375820
work_keys_str_mv AT identifyingcancerbiomarkersfrommicroarraydatausingfeatureselectionandsemisupervisedlearning
AT identifyingcancerbiomarkersfrommicroarraydatausingfeatureselectionandsemisupervisedlearning