Cargando…

An Approach for Identifying Cytokines Based on a Novel Ensemble Classifier

Biology is meaningful and important to identify cytokines and investigate their various functions and biochemical mechanisms. However, several issues remain, including the large scale of benchmark datasets, serious imbalance of data, and discovery of new gene families. In this paper, we employ the m...

Descripción completa

Detalles Bibliográficos
Autores principales: Zou, Quan, Wang, Zhen, Guan, Xinjun, Liu, Bin, Wu, Yunfeng, Lin, Ziyu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3763580/
https://www.ncbi.nlm.nih.gov/pubmed/24027761
http://dx.doi.org/10.1155/2013/686090
_version_ 1782283040181977088
author Zou, Quan
Wang, Zhen
Guan, Xinjun
Liu, Bin
Wu, Yunfeng
Lin, Ziyu
author_facet Zou, Quan
Wang, Zhen
Guan, Xinjun
Liu, Bin
Wu, Yunfeng
Lin, Ziyu
author_sort Zou, Quan
collection PubMed
description Biology is meaningful and important to identify cytokines and investigate their various functions and biochemical mechanisms. However, several issues remain, including the large scale of benchmark datasets, serious imbalance of data, and discovery of new gene families. In this paper, we employ the machine learning approach based on a novel ensemble classifier to predict cytokines. We directly selected amino acids sequences as research objects. First, we pretreated the benchmark data accurately. Next, we analyzed the physicochemical properties and distribution of whole amino acids and then extracted a group of 120-dimensional (120D) valid features to represent sequences. Third, in the view of the serious imbalance in benchmark datasets, we utilized a sampling approach based on the synthetic minority oversampling technique algorithm and K-means clustering undersampling algorithm to rebuild the training set. Finally, we built a library for dynamic selection and circulating combination based on clustering (LibD3C) and employed the new training set to realize cytokine classification. Experiments showed that the geometric mean of sensitivity and specificity obtained through our approach is as high as 93.3%, which proves that our approach is effective for identifying cytokines.
format Online
Article
Text
id pubmed-3763580
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-37635802013-09-11 An Approach for Identifying Cytokines Based on a Novel Ensemble Classifier Zou, Quan Wang, Zhen Guan, Xinjun Liu, Bin Wu, Yunfeng Lin, Ziyu Biomed Res Int Research Article Biology is meaningful and important to identify cytokines and investigate their various functions and biochemical mechanisms. However, several issues remain, including the large scale of benchmark datasets, serious imbalance of data, and discovery of new gene families. In this paper, we employ the machine learning approach based on a novel ensemble classifier to predict cytokines. We directly selected amino acids sequences as research objects. First, we pretreated the benchmark data accurately. Next, we analyzed the physicochemical properties and distribution of whole amino acids and then extracted a group of 120-dimensional (120D) valid features to represent sequences. Third, in the view of the serious imbalance in benchmark datasets, we utilized a sampling approach based on the synthetic minority oversampling technique algorithm and K-means clustering undersampling algorithm to rebuild the training set. Finally, we built a library for dynamic selection and circulating combination based on clustering (LibD3C) and employed the new training set to realize cytokine classification. Experiments showed that the geometric mean of sensitivity and specificity obtained through our approach is as high as 93.3%, which proves that our approach is effective for identifying cytokines. Hindawi Publishing Corporation 2013 2013-08-21 /pmc/articles/PMC3763580/ /pubmed/24027761 http://dx.doi.org/10.1155/2013/686090 Text en Copyright © 2013 Quan Zou et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zou, Quan
Wang, Zhen
Guan, Xinjun
Liu, Bin
Wu, Yunfeng
Lin, Ziyu
An Approach for Identifying Cytokines Based on a Novel Ensemble Classifier
title An Approach for Identifying Cytokines Based on a Novel Ensemble Classifier
title_full An Approach for Identifying Cytokines Based on a Novel Ensemble Classifier
title_fullStr An Approach for Identifying Cytokines Based on a Novel Ensemble Classifier
title_full_unstemmed An Approach for Identifying Cytokines Based on a Novel Ensemble Classifier
title_short An Approach for Identifying Cytokines Based on a Novel Ensemble Classifier
title_sort approach for identifying cytokines based on a novel ensemble classifier
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3763580/
https://www.ncbi.nlm.nih.gov/pubmed/24027761
http://dx.doi.org/10.1155/2013/686090
work_keys_str_mv AT zouquan anapproachforidentifyingcytokinesbasedonanovelensembleclassifier
AT wangzhen anapproachforidentifyingcytokinesbasedonanovelensembleclassifier
AT guanxinjun anapproachforidentifyingcytokinesbasedonanovelensembleclassifier
AT liubin anapproachforidentifyingcytokinesbasedonanovelensembleclassifier
AT wuyunfeng anapproachforidentifyingcytokinesbasedonanovelensembleclassifier
AT linziyu anapproachforidentifyingcytokinesbasedonanovelensembleclassifier
AT zouquan approachforidentifyingcytokinesbasedonanovelensembleclassifier
AT wangzhen approachforidentifyingcytokinesbasedonanovelensembleclassifier
AT guanxinjun approachforidentifyingcytokinesbasedonanovelensembleclassifier
AT liubin approachforidentifyingcytokinesbasedonanovelensembleclassifier
AT wuyunfeng approachforidentifyingcytokinesbasedonanovelensembleclassifier
AT linziyu approachforidentifyingcytokinesbasedonanovelensembleclassifier