Cargando…
Identifying Cancer Biomarkers From Microarray Data Using Feature Selection and Semisupervised Learning
Microarrays have now gone from obscurity to being almost ubiquitous in biological research. At the same time, the statistical methodology for microarray analysis has progressed from simple visual assessments of results to novel algorithms for analyzing changes in expression profiles. In a micro-RNA...
Formato: | Online Artículo Texto |
---|---|
Lenguaje: | English |
Publicado: |
IEEE
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4848046/ https://www.ncbi.nlm.nih.gov/pubmed/27170887 http://dx.doi.org/10.1109/JTEHM.2014.2375820 |
_version_ | 1782429290667704320 |
---|---|
collection | PubMed |
description | Microarrays have now gone from obscurity to being almost ubiquitous in biological research. At the same time, the statistical methodology for microarray analysis has progressed from simple visual assessments of results to novel algorithms for analyzing changes in expression profiles. In a micro-RNA (miRNA) or gene-expression profiling experiment, the expression levels of thousands of genes/miRNAs are simultaneously monitored to study the effects of certain treatments, diseases, and developmental stages on their expressions. Microarray-based gene expression profiling can be used to identify genes, whose expressions are changed in response to pathogens or other organisms by comparing gene expression in infected to that in uninfected cells or tissues. Recent studies have revealed that patterns of altered microarray expression profiles in cancer can serve as molecular biomarkers for tumor diagnosis, prognosis of disease-specific outcomes, and prediction of therapeutic responses. Microarray data sets containing expression profiles of a number of miRNAs or genes are used to identify biomarkers, which have dysregulation in normal and malignant tissues. However, small sample size remains a bottleneck to design successful classification methods. On the other hand, adequate number of microarray data that do not have clinical knowledge can be employed as additional source of information. In this paper, a combination of kernelized fuzzy rough set (KFRS) and semisupervised support vector machine (S(3)VM) is proposed for predicting cancer biomarkers from one miRNA and three gene expression data sets. Biomarkers are discovered employing three feature selection methods, including KFRS. The effectiveness of the proposed KFRS and S(3)VM combination on the microarray data sets is demonstrated, and the cancer biomarkers identified from miRNA data are reported. Furthermore, biological significance tests are conducted for miRNA cancer biomarkers. |
format | Online Article Text |
id | pubmed-4848046 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | IEEE |
record_format | MEDLINE/PubMed |
spelling | pubmed-48480462016-05-11 Identifying Cancer Biomarkers From Microarray Data Using Feature Selection and Semisupervised Learning IEEE J Transl Eng Health Med Article Microarrays have now gone from obscurity to being almost ubiquitous in biological research. At the same time, the statistical methodology for microarray analysis has progressed from simple visual assessments of results to novel algorithms for analyzing changes in expression profiles. In a micro-RNA (miRNA) or gene-expression profiling experiment, the expression levels of thousands of genes/miRNAs are simultaneously monitored to study the effects of certain treatments, diseases, and developmental stages on their expressions. Microarray-based gene expression profiling can be used to identify genes, whose expressions are changed in response to pathogens or other organisms by comparing gene expression in infected to that in uninfected cells or tissues. Recent studies have revealed that patterns of altered microarray expression profiles in cancer can serve as molecular biomarkers for tumor diagnosis, prognosis of disease-specific outcomes, and prediction of therapeutic responses. Microarray data sets containing expression profiles of a number of miRNAs or genes are used to identify biomarkers, which have dysregulation in normal and malignant tissues. However, small sample size remains a bottleneck to design successful classification methods. On the other hand, adequate number of microarray data that do not have clinical knowledge can be employed as additional source of information. In this paper, a combination of kernelized fuzzy rough set (KFRS) and semisupervised support vector machine (S(3)VM) is proposed for predicting cancer biomarkers from one miRNA and three gene expression data sets. Biomarkers are discovered employing three feature selection methods, including KFRS. The effectiveness of the proposed KFRS and S(3)VM combination on the microarray data sets is demonstrated, and the cancer biomarkers identified from miRNA data are reported. Furthermore, biological significance tests are conducted for miRNA cancer biomarkers. IEEE 2014-12-02 /pmc/articles/PMC4848046/ /pubmed/27170887 http://dx.doi.org/10.1109/JTEHM.2014.2375820 Text en 2168-2372 © 2014 IEEE. Translations and content mining are permitted for academic research only. Personal use is also permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information. |
spellingShingle | Article Identifying Cancer Biomarkers From Microarray Data Using Feature Selection and Semisupervised Learning |
title | Identifying Cancer Biomarkers From Microarray Data Using Feature Selection and Semisupervised Learning |
title_full | Identifying Cancer Biomarkers From Microarray Data Using Feature Selection and Semisupervised Learning |
title_fullStr | Identifying Cancer Biomarkers From Microarray Data Using Feature Selection and Semisupervised Learning |
title_full_unstemmed | Identifying Cancer Biomarkers From Microarray Data Using Feature Selection and Semisupervised Learning |
title_short | Identifying Cancer Biomarkers From Microarray Data Using Feature Selection and Semisupervised Learning |
title_sort | identifying cancer biomarkers from microarray data using feature selection and semisupervised learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4848046/ https://www.ncbi.nlm.nih.gov/pubmed/27170887 http://dx.doi.org/10.1109/JTEHM.2014.2375820 |
work_keys_str_mv | AT identifyingcancerbiomarkersfrommicroarraydatausingfeatureselectionandsemisupervisedlearning AT identifyingcancerbiomarkersfrommicroarraydatausingfeatureselectionandsemisupervisedlearning |