Cargando…
A feature selection method based on multiple kernel learning with expression profiles of different types
BACKGROUND: With the development of high-throughput technology, the researchers can acquire large number of expression data with different types from several public databases. Because most of these data have small number of samples and hundreds or thousands features, how to extract informative featu...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5288949/ https://www.ncbi.nlm.nih.gov/pubmed/28184251 http://dx.doi.org/10.1186/s13040-017-0124-x |
_version_ | 1782504424401272832 |
---|---|
author | Du, Wei Cao, Zhongbo Song, Tianci Li, Ying Liang, Yanchun |
author_facet | Du, Wei Cao, Zhongbo Song, Tianci Li, Ying Liang, Yanchun |
author_sort | Du, Wei |
collection | PubMed |
description | BACKGROUND: With the development of high-throughput technology, the researchers can acquire large number of expression data with different types from several public databases. Because most of these data have small number of samples and hundreds or thousands features, how to extract informative features from expression data effectively and robustly using feature selection technique is challenging and crucial. So far, a mass of many feature selection approaches have been proposed and applied to analyse expression data of different types. However, most of these methods only are limited to measure the performances on one single type of expression data by accuracy or error rate of classification. RESULTS: In this article, we propose a hybrid feature selection method based on Multiple Kernel Learning (MKL) and evaluate the performance on expression datasets of different types. Firstly, the relevance between features and classifying samples is measured by using the optimizing function of MKL. In this step, an iterative gradient descent process is used to perform the optimization both on the parameters of Support Vector Machine (SVM) and kernel confidence. Then, a set of relevant features is selected by sorting the optimizing function of each feature. Furthermore, we apply an embedded scheme of forward selection to detect the compact feature subsets from the relevant feature set. CONCLUSIONS: We not only compare the classification accuracy with other methods, but also compare the stability, similarity and consistency of different algorithms. The proposed method has a satisfactory capability of feature selection for analysing expression datasets of different types using different performance measurements. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13040-017-0124-x) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5288949 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-52889492017-02-09 A feature selection method based on multiple kernel learning with expression profiles of different types Du, Wei Cao, Zhongbo Song, Tianci Li, Ying Liang, Yanchun BioData Min Methodology BACKGROUND: With the development of high-throughput technology, the researchers can acquire large number of expression data with different types from several public databases. Because most of these data have small number of samples and hundreds or thousands features, how to extract informative features from expression data effectively and robustly using feature selection technique is challenging and crucial. So far, a mass of many feature selection approaches have been proposed and applied to analyse expression data of different types. However, most of these methods only are limited to measure the performances on one single type of expression data by accuracy or error rate of classification. RESULTS: In this article, we propose a hybrid feature selection method based on Multiple Kernel Learning (MKL) and evaluate the performance on expression datasets of different types. Firstly, the relevance between features and classifying samples is measured by using the optimizing function of MKL. In this step, an iterative gradient descent process is used to perform the optimization both on the parameters of Support Vector Machine (SVM) and kernel confidence. Then, a set of relevant features is selected by sorting the optimizing function of each feature. Furthermore, we apply an embedded scheme of forward selection to detect the compact feature subsets from the relevant feature set. CONCLUSIONS: We not only compare the classification accuracy with other methods, but also compare the stability, similarity and consistency of different algorithms. The proposed method has a satisfactory capability of feature selection for analysing expression datasets of different types using different performance measurements. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13040-017-0124-x) contains supplementary material, which is available to authorized users. BioMed Central 2017-02-02 /pmc/articles/PMC5288949/ /pubmed/28184251 http://dx.doi.org/10.1186/s13040-017-0124-x Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Du, Wei Cao, Zhongbo Song, Tianci Li, Ying Liang, Yanchun A feature selection method based on multiple kernel learning with expression profiles of different types |
title | A feature selection method based on multiple kernel learning with expression profiles of different types |
title_full | A feature selection method based on multiple kernel learning with expression profiles of different types |
title_fullStr | A feature selection method based on multiple kernel learning with expression profiles of different types |
title_full_unstemmed | A feature selection method based on multiple kernel learning with expression profiles of different types |
title_short | A feature selection method based on multiple kernel learning with expression profiles of different types |
title_sort | feature selection method based on multiple kernel learning with expression profiles of different types |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5288949/ https://www.ncbi.nlm.nih.gov/pubmed/28184251 http://dx.doi.org/10.1186/s13040-017-0124-x |
work_keys_str_mv | AT duwei afeatureselectionmethodbasedonmultiplekernellearningwithexpressionprofilesofdifferenttypes AT caozhongbo afeatureselectionmethodbasedonmultiplekernellearningwithexpressionprofilesofdifferenttypes AT songtianci afeatureselectionmethodbasedonmultiplekernellearningwithexpressionprofilesofdifferenttypes AT liying afeatureselectionmethodbasedonmultiplekernellearningwithexpressionprofilesofdifferenttypes AT liangyanchun afeatureselectionmethodbasedonmultiplekernellearningwithexpressionprofilesofdifferenttypes AT duwei featureselectionmethodbasedonmultiplekernellearningwithexpressionprofilesofdifferenttypes AT caozhongbo featureselectionmethodbasedonmultiplekernellearningwithexpressionprofilesofdifferenttypes AT songtianci featureselectionmethodbasedonmultiplekernellearningwithexpressionprofilesofdifferenttypes AT liying featureselectionmethodbasedonmultiplekernellearningwithexpressionprofilesofdifferenttypes AT liangyanchun featureselectionmethodbasedonmultiplekernellearningwithexpressionprofilesofdifferenttypes |