Cargando…

A feature selection method based on multiple kernel learning with expression profiles of different types

BACKGROUND: With the development of high-throughput technology, the researchers can acquire large number of expression data with different types from several public databases. Because most of these data have small number of samples and hundreds or thousands features, how to extract informative featu...

Descripción completa

Detalles Bibliográficos
Autores principales: Du, Wei, Cao, Zhongbo, Song, Tianci, Li, Ying, Liang, Yanchun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5288949/
https://www.ncbi.nlm.nih.gov/pubmed/28184251
http://dx.doi.org/10.1186/s13040-017-0124-x
_version_ 1782504424401272832
author Du, Wei
Cao, Zhongbo
Song, Tianci
Li, Ying
Liang, Yanchun
author_facet Du, Wei
Cao, Zhongbo
Song, Tianci
Li, Ying
Liang, Yanchun
author_sort Du, Wei
collection PubMed
description BACKGROUND: With the development of high-throughput technology, the researchers can acquire large number of expression data with different types from several public databases. Because most of these data have small number of samples and hundreds or thousands features, how to extract informative features from expression data effectively and robustly using feature selection technique is challenging and crucial. So far, a mass of many feature selection approaches have been proposed and applied to analyse expression data of different types. However, most of these methods only are limited to measure the performances on one single type of expression data by accuracy or error rate of classification. RESULTS: In this article, we propose a hybrid feature selection method based on Multiple Kernel Learning (MKL) and evaluate the performance on expression datasets of different types. Firstly, the relevance between features and classifying samples is measured by using the optimizing function of MKL. In this step, an iterative gradient descent process is used to perform the optimization both on the parameters of Support Vector Machine (SVM) and kernel confidence. Then, a set of relevant features is selected by sorting the optimizing function of each feature. Furthermore, we apply an embedded scheme of forward selection to detect the compact feature subsets from the relevant feature set. CONCLUSIONS: We not only compare the classification accuracy with other methods, but also compare the stability, similarity and consistency of different algorithms. The proposed method has a satisfactory capability of feature selection for analysing expression datasets of different types using different performance measurements. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13040-017-0124-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5288949
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-52889492017-02-09 A feature selection method based on multiple kernel learning with expression profiles of different types Du, Wei Cao, Zhongbo Song, Tianci Li, Ying Liang, Yanchun BioData Min Methodology BACKGROUND: With the development of high-throughput technology, the researchers can acquire large number of expression data with different types from several public databases. Because most of these data have small number of samples and hundreds or thousands features, how to extract informative features from expression data effectively and robustly using feature selection technique is challenging and crucial. So far, a mass of many feature selection approaches have been proposed and applied to analyse expression data of different types. However, most of these methods only are limited to measure the performances on one single type of expression data by accuracy or error rate of classification. RESULTS: In this article, we propose a hybrid feature selection method based on Multiple Kernel Learning (MKL) and evaluate the performance on expression datasets of different types. Firstly, the relevance between features and classifying samples is measured by using the optimizing function of MKL. In this step, an iterative gradient descent process is used to perform the optimization both on the parameters of Support Vector Machine (SVM) and kernel confidence. Then, a set of relevant features is selected by sorting the optimizing function of each feature. Furthermore, we apply an embedded scheme of forward selection to detect the compact feature subsets from the relevant feature set. CONCLUSIONS: We not only compare the classification accuracy with other methods, but also compare the stability, similarity and consistency of different algorithms. The proposed method has a satisfactory capability of feature selection for analysing expression datasets of different types using different performance measurements. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13040-017-0124-x) contains supplementary material, which is available to authorized users. BioMed Central 2017-02-02 /pmc/articles/PMC5288949/ /pubmed/28184251 http://dx.doi.org/10.1186/s13040-017-0124-x Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Du, Wei
Cao, Zhongbo
Song, Tianci
Li, Ying
Liang, Yanchun
A feature selection method based on multiple kernel learning with expression profiles of different types
title A feature selection method based on multiple kernel learning with expression profiles of different types
title_full A feature selection method based on multiple kernel learning with expression profiles of different types
title_fullStr A feature selection method based on multiple kernel learning with expression profiles of different types
title_full_unstemmed A feature selection method based on multiple kernel learning with expression profiles of different types
title_short A feature selection method based on multiple kernel learning with expression profiles of different types
title_sort feature selection method based on multiple kernel learning with expression profiles of different types
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5288949/
https://www.ncbi.nlm.nih.gov/pubmed/28184251
http://dx.doi.org/10.1186/s13040-017-0124-x
work_keys_str_mv AT duwei afeatureselectionmethodbasedonmultiplekernellearningwithexpressionprofilesofdifferenttypes
AT caozhongbo afeatureselectionmethodbasedonmultiplekernellearningwithexpressionprofilesofdifferenttypes
AT songtianci afeatureselectionmethodbasedonmultiplekernellearningwithexpressionprofilesofdifferenttypes
AT liying afeatureselectionmethodbasedonmultiplekernellearningwithexpressionprofilesofdifferenttypes
AT liangyanchun afeatureselectionmethodbasedonmultiplekernellearningwithexpressionprofilesofdifferenttypes
AT duwei featureselectionmethodbasedonmultiplekernellearningwithexpressionprofilesofdifferenttypes
AT caozhongbo featureselectionmethodbasedonmultiplekernellearningwithexpressionprofilesofdifferenttypes
AT songtianci featureselectionmethodbasedonmultiplekernellearningwithexpressionprofilesofdifferenttypes
AT liying featureselectionmethodbasedonmultiplekernellearningwithexpressionprofilesofdifferenttypes
AT liangyanchun featureselectionmethodbasedonmultiplekernellearningwithexpressionprofilesofdifferenttypes